English

PhotoNs-GPU:A GPU accelerated cosmological simulation code

Instrumentation and Methods for Astrophysics 2021-12-28 v1

Abstract

We present a GPU-accelerated cosmological simulation code, PhotoNs-GPU, based on algorithm of Particle Mesh Fast Multipole Method (PM-FMM), and focus on the GPU utilization and optimization. A proper interpolated method for truncated gravity is introduced to speed up the special functions in kernels. We verify the GPU code in mixed precision and different levels of interpolated method on GPU. A run with single precision is roughly two times faster that double precision for current practical cosmological simulations. But it could induce a unbiased small noise in power spectrum. Comparing with the CPU version of PhotoNs and Gadget-2, the efficiency of new code is significantly improved. Activated all the optimizations on the memory access, kernel functions and concurrency management, the peak performance of our test runs achieves 48% of the theoretical speed and the average performance approaches to 35% on GPU.

Keywords

Cite

@article{arxiv.2107.14008,
  title  = {PhotoNs-GPU:A GPU accelerated cosmological simulation code},
  author = {Qiao Wang and Chen Meng},
  journal= {arXiv preprint arXiv:2107.14008},
  year   = {2021}
}

Comments

Accepted for publication in RAA, 7 pages, 5 figures