English

Gravitational tree-code on graphics processing units: implementation in CUDA

Instrumentation and Methods for Astrophysics 2010-10-15 v1 Distributed, Parallel, and Cluster Computing

Abstract

We present a new very fast tree-code which runs on massively parallel Graphical Processing Units (GPU) with NVIDIA CUDA architecture. The tree-construction and calculation of multipole moments is carried out on the host CPU, while the force calculation which consists of tree walks and evaluation of interaction list is carried out on the GPU. In this way we achieve a sustained performance of about 100GFLOP/s and data transfer rates of about 50GB/s. It takes about a second to compute forces on a million particles with an opening angle of θ0.5\theta \approx 0.5. The code has a convenient user interface and is freely available for use\footnote{{\tt http://castle.strw.leidenuniv.nl/software/octgrav.html}}.

Keywords

Cite

@article{arxiv.1005.5384,
  title  = {Gravitational tree-code on graphics processing units: implementation in CUDA},
  author = {Evghenii Gaburov and Jeroen Bédorf and Simon Portegies Zwart},
  journal= {arXiv preprint arXiv:1005.5384},
  year   = {2010}
}

Comments

9 pages, 8 figures. Accepted for publication at International Conference on Computational Science 2010

R2 v1 2026-06-21T15:29:22.180Z