Related papers: Accelerating wave-propagation algorithms with adap…

Analysis and Performance Evaluation of Adjoint-Guided Adaptive Mesh Refinement for Linear Hyperbolic PDEs Using Clawpack

Adaptive mesh refinement (AMR) is often used when solving time-dependent partial differential equations using numerical methods. It enables time-varying regions of much higher resolution, which can be used to track discontinuities in the…

Numerical Analysis · Mathematics 2018-10-03 Brisa N Davis , Randall J LeVeque

Adjoint Methods for Guiding Adaptive Mesh Refinement in Wave Propagation Problems

One difficulty in developing numerical methods for hyperbolic systems of conservation laws is the fact that solutions often contain regions where much higher resolution is required than elsewhere in the domain, particularly since the…

Numerical Analysis · Mathematics 2015-11-12 Brisa N. Davis , Randall J. LeVeque

Adjoint Methods for Guiding Adaptive Mesh Refinement in Tsunami Modeling

One difficulty in developing numerical methods for tsunami modeling is the fact that solutions contain regions where much higher resolution is required than elsewhere in the domain, particularly since the solution may contain…

Numerical Analysis · Mathematics 2017-02-02 Brisa N. Davis , Randall J. LeVeque

Efficient Tsunami Modeling on Adaptive Grids with Graphics Processing Units (GPUs)

Solving the shallow water equations efficiently is critical to the study of natural hazards induced by tsunami and storm surge, since it provides more response time in an early warning system and allows more runs to be done for…

Computational Physics · Physics 2019-01-23 Xinsheng Qin , Randall LeVeque , Michael Motley

Accelerating Lattice QCD Simulations using GPUs

Solving discretized versions of the Dirac equation represents a large share of execution time in lattice Quantum Chromodynamics (QCD) simulations. Many high-performance computing (HPC) clusters use graphics processing units (GPUs) to offer…

High Energy Physics - Lattice · Physics 2024-07-02 Tilmann Matthaei

An efficient GPU-accelerated adaptive mesh refinement framework for high-fidelity compressible reactive flows modeling

This paper presents a heterogeneous adaptive mesh refinement (AMR) framework for efficient simulation of moderately stiff reactive problems. This framework features an elaborate subcycling-in-time algorithm along with a specialized…

Computational Physics · Physics 2025-06-04 Yuqi Wang , Yadong Zeng , Ralf Deiterding , Jianhan Liang

AGIPC: Adaptive In-Solve Algebraic Coarsening for GPU IPC

Implicit time integration is key to robustly simulating stiff materials and large deformations, but its performance is often dominated by repeatedly solving large linear systems. Adaptive coarsening can reduce this cost by concentrating…

Graphics · Computer Science 2026-05-07 Xuan Wang , Zhaofeng Luo , Minchen Li , Taku Komura , Kemeng Huang

An adaptive finite element multigrid solver using GPU acceleration

Adaptive finite elements combined with geometric multigrid solvers are one of the most efficient numerical methods for problems such as the instationary Navier-Stokes equations. Yet despite their efficiency, computations remain expensive…

Numerical Analysis · Mathematics 2025-12-23 Manuel Liebchen , Robert Jendersie , Utku Kaya , Christian Lessig , Thomas Richter

ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels

GPU-based HPC clusters are attracting more scientific application developers due to their extensive parallelism and energy efficiency. In order to achieve portability among a variety of multi/many core architectures, a popular choice for an…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-04-10 Ali TehraniJamsaz , Alok Mishra , Akash Dutta , Abid M. Malik , Barbara Chapman , Ali Jannesari

A Mixed Precision, Multi-GPU Design for Large-scale Top-K Sparse Eigenproblems

Graph analytics techniques based on spectral methods process extremely large sparse matrices with millions or even billions of non-zero values. Behind these algorithms lies the Top-K sparse eigenproblem, the computation of the largest…

Hardware Architecture · Computer Science 2022-01-20 Francesco Sgherzi , Alberto Parravicini , Marco Domenico Santambrogio

Task-Based Programming for Adaptive Mesh Refinement in Compressible Flow Simulations

High-order solvers for compressible flows are vital in scientific applications. Adaptive mesh refinement (AMR) is a key technique for reducing computational cost by concentrating resolution in regions of interest. In this work, we develop…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-08 Anjiang Wei , Hang Song , Mert Hidayetoglu , Elliott Slaughter , Sanjiva K. Lele , Alex Aiken

GPU-Accelerated Primal Heuristics for Mixed Integer Programming

We introduce a fusion of GPU accelerated primal heuristics for Mixed Integer Programming. Leveraging GPU acceleration enables exploration of larger search regions and faster iterations. A GPU-accelerated PDLP serves as an approximate LP…

Optimization and Control · Mathematics 2025-10-31 Akif Çördük , Piotr Sielski , Alice Boucher , Kumar Aatish

A Multi-GPU Programming Library for Real-Time Applications

We present MGPU, a C++ programming library targeted at single-node multi-GPU systems. Such systems combine disproportionate floating point performance with high data locality and are thus well suited to implement real-time algorithms. We…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-03-03 Sebastian Schaetz , Martin Uecker

Accelerating a Linear Programming Algorithm on AMD GPUs

Linear Programming (LP) is a foundational optimization technique with widespread applications in finance, energy trading, and supply chain logistics. However, traditional Central Processing Unit (CPU)-based LP solvers often struggle to meet…

Optimization and Control · Mathematics 2025-08-26 Xiyan Hu , Titus Parker , Connor Phillips , Yifa Yu

Adapting Atmospheric Chemistry Components for Efficient GPU Accelerators

Atmospheric models demand a lot of computational power and solving the chemical processes is one of its most computationally intensive components. This work shows how to improve the computational performance of the Multiscale Online…

Computational Physics · Physics 2025-01-03 Christian Guzman Ruiz , Matthew Dawson , Mario C. Acosta , Oriol Jorba , Eduardo Cesar Galobardes , Carlos Pérez García-Pando , Kim Serradell

Graphics Processing Unit acceleration of the Random Phase Approximation in the projector augmented wave method

The Random Phase Approximation (RPA) for correlation energy in the grid-based projector augmented wave (gpaw) code is accelerated by porting to the Graphics Processing Unit (GPU) architecture. The acceleration is achieved by grouping…

Computational Physics · Physics 2013-07-31 Jun Yan , Lin Li , Christopher O'Grady

Real-Time Computation of Parameter Fitting and Image Reconstruction Using Graphical Processing Units

In recent years graphical processing units (GPUs) have become a powerful tool in scientific computing. Their potential to speed up highly parallel applications brings the power of high performance computing to a wider range of users.…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-04-26 Uldis Locans , Andreas Adelmann , Andreas Suter , Jannis Fischer , Werner Lustermann , Gunther Dissertori , Qiulin Wang

Enabling Software Resilience in GPGPU Applications via Partial Thread Protection

Graphics Processing Units (GPUs) are widely used by various applications in a broad variety of fields to accelerate their computation but remain susceptible to transient hardware faults (soft errors) that can easily compromise application…

Software Engineering · Computer Science 2021-03-30 Lishan Yang , Bin Nie , Adwait Jog , Evgenia Smirni

Towards making the most of NLP-based device mapping optimization for OpenCL kernels

Nowadays, we are living in an era of extreme device heterogeneity. Despite the high variety of conventional CPU architectures, accelerator devices, such as GPUs and FPGAs, also appear in the foreground exploding the pool of available…

Machine Learning · Computer Science 2022-08-31 Petros Vavaroutsos , Ioannis Oroutzoglou , Dimosthenis Masouros , Dimitrios Soudris

Using hybrid GPU/CPU kernel splitting to accelerate spherical convolutions

We present a general method for accelerating by more than an order of magnitude the convolution of pixelated functions on the sphere with a radially-symmetric kernel. Our method splits the kernel into a compact real-space component and a…

Cosmology and Nongalactic Astrophysics · Physics 2015-03-31 P. M. Sutter , Benjamin D. Wandelt , Franz Elsner