English
Related papers

Related papers: A GPU accelerated Barnes-Hut Tree Code for FLASH4

200 papers

We have developed a highly-tuned software library that accelerates the calculation of quadrupole terms in the Barnes-Hut tree code by use of a SIMD instruction set on the x86 architecture, Advanced Vector eXtensions 2 (AVX2). Our code is…

Instrumentation and Methods for Astrophysics · Physics 2019-02-13 Tetsushi Kodama , Tomoaki Ishiyama

We describe an OctTree algorithm for the MPI-parallel, adaptive mesh-refinement code {\sc FLASH}, which can be used to calculate the gas self-gravity, and also the angle-averaged local optical depth, for treating ambient diffuse radiation.…

Instrumentation and Methods for Astrophysics · Physics 2018-01-24 Richard Wünsch , Stefanie Walch , František Dinnbier , Anthony P. Whitworth

We present a new very fast tree-code which runs on massively parallel Graphical Processing Units (GPU) with NVIDIA CUDA architecture. The tree-construction and calculation of multipole moments is carried out on the host CPU, while the force…

Instrumentation and Methods for Astrophysics · Physics 2010-10-15 Evghenii Gaburov , Jeroen Bédorf , Simon Portegies Zwart

I describe here the performance of a parallel treecode with individual particle timesteps. The code is based on the Barnes-Hut algorithm and runs cosmological N-body simulations on parallel machines with a distributed memory architecture…

Astrophysics · Physics 2009-11-07 R. Valdarnini

I describe here the performances of a parallel treecode with individual particle timesteps. The code is based on the Barnes-Hut algorithm and runs cosmological N-body simulations on parallel machines with a distributed memory architecture…

Astrophysics · Physics 2007-05-23 R. Valdarnini

We present parallel algorithms for constructing and traversing sparse octrees on graphics processing units (GPUs). The algorithms are based on parallel-scan and sort methods. To test the performance and feasibility, we implemented them in…

Instrumentation and Methods for Astrophysics · Physics 2012-04-11 Jeroen Bédorf , Evghenii Gaburov , Simon Portegies Zwart

We describe a modified version of the NBODY6 code for simulating star clusters which greatly improves computational efficiency while sacrificing little in the way of accuracy. The distant force calculator is replaced by a GPU-enabled…

Instrumentation and Methods for Astrophysics · Physics 2021-11-22 Anthony D. Arnold , Holger Baumgardt , Long Wang

In this study, the gravitational octree code originally optimized for the Fermi, Kepler, and Maxwell GPU architectures is adapted to the Volta architecture. The Volta architecture introduces independent thread scheduling requiring either…

Mathematical Software · Computer Science 2018-11-08 Yohei Miki

We describe the implementation and performance of the ${\rm P^3T}$ (Particle-Particle Particle-Tree) scheme for simulating dense stellar systems. In ${\rm P^3T}$, the force experienced by a particle is split into short-range and long-range…

Instrumentation and Methods for Astrophysics · Physics 2015-06-16 Masaki Iwasawa , Simon Portegies Zwart , Junichiro Makino

We present a gravitational hierarchical N-body code that is designed to run efficiently on Graphics Processing Units (GPUs). All parts of the algorithm are executed on the GPU which eliminates the need for data transfer between the Central…

Instrumentation and Methods for Astrophysics · Physics 2012-04-12 Jeroen Bédorf , Evghenii Gaburov , Simon Portegies Zwart

We present the newly developed code, GAMER (GPU-accelerated Adaptive MEsh Refinement code), which has adopted a novel approach to improve the performance of adaptive mesh refinement (AMR) astrophysical simulations by a large factor with the…

Instrumentation and Methods for Astrophysics · Physics 2010-02-08 Hsi-Yu Schive , Yu-Chih Tsai , Tzihong Chiueh

To assess how future progress in gravitational microlensing computation at high optical depth will rely on both hardware and software solutions, we compare a direct inverse ray-shooting code implemented on a graphics processing unit (GPU)…

Instrumentation and Methods for Astrophysics · Physics 2015-05-19 N. F. Bate , C. J. Fluke , B. R. Barsdell , H. Garsden , G. F. Lewis

General-relativistic magnetohydrodynamic (GRMHD) simulations have revolutionized our understanding of black hole accretion. Here, we present a graphics processing unit (GPU) accelerated GRMHD code \hammer{} with multi-faceted optimizations…

High Energy Astrophysical Phenomena · Physics 2022-12-06 M. T. P. Liska , K. Chatterjee , D. Issa , D. Yoon , N. Kaaz , A. Tchekhovskoy , D. van Eijnatten , G. Musoke , C. Hesp , V. Rohoza , S. Markoff , A. Ingram , M. van der Klis

Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently is no piece of cake.…

Computational Physics · Physics 2012-08-14 Rio Yokota , Lorena Barba

The tree method is a widely implemented algorithm for collisionless $N$-body simulations in astrophysics well suited for GPU(s). Adopting hierarchical time stepping can accelerate $N$-body simulations; however, it is infrequently…

Instrumentation and Methods for Astrophysics · Physics 2016-11-09 Yohei Miki , Masayuki Umemura

We describe a parallel, cosmological N-body code based on a hybrid scheme using the particle-mesh (PM) and Barnes-Hut (BH) oct-tree algorithm. We call the algorithm GOTPM for Grid-of-Oct-Trees-Particle-Mesh. The code is parallelized using…

Astrophysics · Physics 2015-06-24 John Dubinski , Juhan Kim , Changbom Park , Robin Humble

We present an approach to molecular-dynamics simulations of ferrofluids on graphics processing units (GPUs). Our numerical scheme is based on a GPU-oriented modification of the Barnes-Hut (BH) algorithm designed to increase the parallelism…

Computational Physics · Physics 2013-04-30 A. Yu. Polyakov , T. V. Lyutyy , S. Denisov , V. V. Reva , P. Hanggi

In this paper, we present a novel massively parallel algorithm for accelerating the decision tree building procedure on GPUs (Graphics Processing Units), which is a crucial step in Gradient Boosted Decision Tree (GBDT) and random forests…

Machine Learning · Statistics 2017-06-27 Huan Zhang , Si Si , Cho-Jui Hsieh

We propose a hybrid tree algorithm for reducing calculation and communication cost of collision-less N-body simulations. The concept of our algorithm is that we split interaction force into two parts: hard-force from neighbor particles and…

Instrumentation and Methods for Astrophysics · Physics 2014-06-25 Tsuyoshi Watanabe , Naohito Nakasato

In this paper, we develop the first entirely graphic processing unit (GPU) based h-adaptive flux reconstruction (FR) method with linear trees. The adaptive solver fully operates on the GPU hardware, using a linear quadtree for two…

Fluid Dynamics · Physics 2023-06-13 Lai Wang , Freddie Witherden , Antony Jameson
‹ Prev 1 2 3 10 Next ›