English
Related papers

Related papers: Implementation and performance of FDPS: A Framewor…

200 papers

In this paper, we describe the algorithms we implemented in FDPS to make efficient use of accelerator hardware such as GPGPUs. We have developed FDPS to make it possible for many researchers to develop their own high-performance parallel…

Instrumentation and Methods for Astrophysics · Physics 2020-02-12 Masaki Iwasawa , Daisuke Namekata , Keigo Nitadori , Kentaro Nomura , Long Wang , Miyuki Tsubouchi , Junichiro Makino

Numerical simulations based on particle methods have been widely used in various fields including astrophysics. To date, simulation softwares have been developed by individual researchers or research groups in each field, with a huge amount…

Instrumentation and Methods for Astrophysics · Physics 2018-06-27 Daisuke Namekata , Masaki Iwasawa , Keigo Nitadori , Ataru Tanikawa , Takayuki Muranushi , Long Wang , Natsuki Hosono , Kentaro Nomura , Junichiro Makino

Scalable and efficient numerical simulations continue to gain importance, as computation is firmly established as the third pillar of discovery, alongside theory and experiment. Meanwhile, the performance of computing hardware grows through…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-26 Pietro Incardona , Antonio Leo , Yaroslav Zaluzhnyi , Rajesh Ramaswamy , Ivo F. Sbalzarini

We present an algorithm for parallelising the TreePM code. We use both functional and domain decompositions. Functional decomposition is used to separate the computation of long range and short range forces, as well as the task of…

Astrophysics · Physics 2007-05-23 Suryadeep Ray , J. S. Bagla

We developed a portable code for dissipative particle dynamics (DPD) simulations. This Fortran program named CAMUS has a couple of notable features. One is the omission of constructing the so-called neighboring particles list, providing a…

Chemical Physics · Physics 2018-07-03 Hideo Doi , Koji Okuwaki , Takamitsu Naito , Sona Saitou , Yuji Mochizuki

The development of cost-effective highperformance parallel computing on multi-processor supercomputers makes it attractive to port excessively time consuming simulation software from personal computers (PC) to super computes. The power…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Ning Lu , Z. Todd Taylor , David P. Chassin , Ross T. Guttromson , R. Scott Studham

We introduce a natively distributed mini-application benchmark representative of plastic spiking neural network simulators. It can be used to measure performances of existing computing platforms and to drive the development of future…

This article introduces a highly parallel algorithm for molecular dynamics simulations with short-range forces on single node multi- and many-core systems. The algorithm is designed to achieve high parallel speedups for strongly…

Computational Physics · Physics 2013-11-20 R. Meyer

We present a computational algorithm for computing short range forces between particles. The algorithm has two distinguishing features. First, it is optimized for multi-processor computers, and will use as many processors as are available.…

Astrophysics · Physics 2008-02-03 Robert C. Ferrell , Edmund Bertschinger

A fast $N$-body code has been developed for simulating a stellar disk embedded in a live dark matter halo. In generating its Poisson solver, a self-consistent field (SCF) code which inherently possesses perfect scalability is incorporated…

Astrophysics of Galaxies · Physics 2023-05-10 Shunsuke Hozumi , Keigo Nitadori , Masaki Iwasawa

This article presents a depth-first search (DFS)-based algorithm for evaluating sensitivity gradients in the topology optimization of soft materials exhibiting complex deformation behavior. The algorithm is formulated using a time-dependent…

Computational Engineering, Finance, and Science · Computer Science 2025-04-11 Anurag Bhattacharyya

The current landscape of scientific research is widely based on modeling and simulation, typically with complexity in the simulation's flow of execution and parameterization properties. Execution flows are not necessarily straightforward…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-26 Eduardo Ponce , Brittany Stephenson , Suzanne Lenhart , Judy Day , Gregory D. Peterson

Traditional heterogeneous parallel algorithms, designed for heterogeneous clusters of workstations, are based on the assumption that the absolute speed of the processors does not depend on the size of the computational task. This assumption…

Distributed, Parallel, and Cluster Computing · Computer Science 2011-09-15 Alexey Lastovetsky , Ravi Reddy , Vladimir Rychkov , David Clarke

An adpative integration technique for time advancement of particle motion in the context of coupled computational fluid dynamics (CFD) - discrete element method (DEM) simulations is presented in this work. CFD-DEM models provide an accurate…

Computational Physics · Physics 2018-02-28 Hariswaran Sitaraman , Ray Grout

I describe here the performances of a parallel treecode with individual particle timesteps. The code is based on the Barnes-Hut algorithm and runs cosmological N-body simulations on parallel machines with a distributed memory architecture…

Astrophysics · Physics 2007-05-23 R. Valdarnini

Particle tracking in large-scale numerical simulations of turbulent flows presents one of the major bottlenecks in parallel performance and scaling efficiency. Here, we describe a particle tracking algorithm for large-scale parallel…

Fluid Dynamics · Physics 2022-05-31 Cristian C. Lalescu , Bérenger Bramas , Markus Rampp , Michael Wilczek

We introduce a particle-based simulation method for granular material in interactive frame rates. We divide the simulation into two decoupled steps. In the first step, a relatively small number of particles is accurately simulated with a…

Graphics · Computer Science 2023-08-04 Alexander Sommer , Ulrich Schwanecke , Elmar Schömer

Fluid flow simulation is a highly active area with applications in a wide range of engineering problems and interactive systems. Meshless methods like the Moving Particle Semi-implicit (MPS) are a great alternative to deal efficiently with…

An improved implementation of an N-body code for simulating collisionless cosmological dynamics is presented. TPM (Tree-Particle-Mesh) combines the PM method on large scales with a tree code to handle particle-particle interactions at small…

Astrophysics · Physics 2009-11-07 Paul Bode , Jeremiah P. Ostriker

We present a scalable dissipative particle dynamics simulation code, fully implemented on the Graphics Processing Units (GPUs) using a hybrid CUDA/MPI programming model, which achieves 10-30 times speedup on a single GPU over 16 CPU cores…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-12-01 Yu-Hang Tang , George Em Karniadakis
‹ Prev 1 2 3 10 Next ›