English
Related papers

Related papers: Optimizing the Gravitational Tree Algorithm for Ma…

200 papers

Modeling of collisionless galactic systems is based on the N-body model, which requires large computational resources due to the long-range nature of gravitational forces. The most common method for calculating gravity is the TreeCode…

Computational Physics · Physics 2024-12-03 Nikolay M. Kuzmin , Danila S. Sirotin , Alexander V. Khoperskov

We present a new very fast tree-code which runs on massively parallel Graphical Processing Units (GPU) with NVIDIA CUDA architecture. The tree-construction and calculation of multipole moments is carried out on the host CPU, while the force…

Instrumentation and Methods for Astrophysics · Physics 2010-10-15 Evghenii Gaburov , Jeroen Bédorf , Simon Portegies Zwart

We propose a hybrid tree algorithm for reducing calculation and communication cost of collision-less N-body simulations. The concept of our algorithm is that we split interaction force into two parts: hard-force from neighbor particles and…

Instrumentation and Methods for Astrophysics · Physics 2014-06-25 Tsuyoshi Watanabe , Naohito Nakasato

We introduce our new binary tree code for neighbour search and gravitational force calculations in an N-particle system. The tree is built in a "top-down" fashion by "recursive coordinate bisection" where on each tree level we split the…

Instrumentation and Methods for Astrophysics · Physics 2011-11-24 Emanuel Gafton , Stephan Rosswog

We present parallel algorithms for constructing and traversing sparse octrees on graphics processing units (GPUs). The algorithms are based on parallel-scan and sort methods. To test the performance and feasibility, we implemented them in…

Instrumentation and Methods for Astrophysics · Physics 2012-04-11 Jeroen Bédorf , Evghenii Gaburov , Simon Portegies Zwart

This paper presents an octree construction method, called Cornerstone, that facilitates global domain decomposition and interactions between particles in mesh-free numerical simulations. Our method is based on algorithms developed for 3D…

Instrumentation and Methods for Astrophysics · Physics 2023-07-14 Sebastian Keller , Aurélien Cavelan , Rubén Cabezon , Lucio Mayer , Florina M. Ciorba

We present an implementation of the hierarchical tree algorithm on the individual timestep algorithm (the Hermite scheme) for collisional $N$-body simulations, running on GRAPE-9 system, a special-purpose hardware accelerator for…

Instrumentation and Methods for Astrophysics · Physics 2016-03-23 Toshiyuki Fukushige , Atsushi Kawai

This paper investigates the execution of tree-shaped task graphs using multiple processors. Each edge of such a tree represents some large data. A task can only be executed if all input and output data fit into memory, and a data can only…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-10-02 Lionel Eyraud-Dubois , Loris Marchal , Oliver Sinnen , Frédéric Vivien

The kd-tree is a fundamental tool in computer science. Among others, an application of the kd-tree search (oct-tree method) to fast evaluation of particle interactions and neighbor search is highly important since computational complexity…

Instrumentation and Methods for Astrophysics · Physics 2009-09-04 N. Nakasato

The wavelet tree has become a very useful data structure to efficiently represent and query large volumes of data in many different domains, from bioinformatics to geographic information systems. One problem with wavelet trees is their…

Data Structures and Algorithms · Computer Science 2016-10-20 José Fuentes-Sepúlveda , Erick Elejalde , Leo Ferres , Diego Seco

This paper investigates the execution of tree-shaped task graphs using multiple processors. Each edge of such a tree represents a large IO file. A task can only be executed if all input and output files fit into memory, and a file can only…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-10-10 Loris Marchal , Oliver Sinnen , Frédéric Vivien

We describe a parallel version of our tree-code for the simulation of self-gravitating systems in Astrophysics. It is based on a dynamic and adaptive method for the domain decomposition, which exploits the hierarchical data arrangement used…

Astrophysics · Physics 2007-05-23 P. Miocchi , R. Capuzzo-Dolcetta

The wavelet tree (Grossi et al. [SODA, 2003]) and wavelet matrix (Claude et al. [Inf. Syst., 47:15--32, 2015]) are compact indices for texts over an alphabet $[0,\sigma)$ that support rank, select and access queries in $O(\lg \sigma)$ time.…

Data Structures and Algorithms · Computer Science 2017-11-13 Johannes Fischer , Florian Kurpicz , Marvin Löbel

The forest-of-octrees approach to parallel adaptive mesh refinement and coarsening (AMR) has recently been demonstrated in the context of a number of large-scale PDE-based applications. Although linear octrees, which store only leaf…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-11-05 Tobin Isaac , Carsten Burstedde , Lucas C. Wilcox , Omar Ghattas

In this paper, we present a novel massively parallel algorithm for accelerating the decision tree building procedure on GPUs (Graphics Processing Units), which is a crucial step in Gradient Boosted Decision Tree (GBDT) and random forests…

Machine Learning · Statistics 2017-06-27 Huan Zhang , Si Si , Cho-Jui Hsieh

We present a gravitational hierarchical N-body code that is designed to run efficiently on Graphics Processing Units (GPUs). All parts of the algorithm are executed on the GPU which eliminates the need for data transfer between the Central…

Instrumentation and Methods for Astrophysics · Physics 2012-04-12 Jeroen Bédorf , Evghenii Gaburov , Simon Portegies Zwart

A recent work shows how we can optimize a tree based mode of operation for a rate 1 hash function. In particular, an algorithm and a theorem are presented for selecting a good tree topology in order to optimize both the running time and the…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-04-15 Kevin Atighehchi

In this paper, we study a parallel version of Galton-Watson processes for the random generation of tree-shaped structures. Random trees are useful in many situations (testing, binary search, simulation of physics phenomena,...) as attests…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-06-22 Olivier Bodini , Camille Coti , Julien David

Dynamic tree data structures maintain a forest while supporting insertion and deletion of edges and a broad set of queries in $O(\log n)$ time per operation. Such data structures are at the core of many modern algorithms. Recent work has…

Data Structures and Algorithms · Computer Science 2025-06-23 Humza Ikram , Andrew Brady , Daniel Anderson , Guy Blelloch

Nowadays, multiprocessing is mainstream with exponentially increasing number of processors. Load balancing is, therefore, a critical operation for the efficient execution of parallel algorithms. In this paper we consider the fundamental…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-03 Osama Talaat Ibrahim , Ahmed El-Mahdy
‹ Prev 1 2 3 10 Next ›