Related papers: Scalable Parallel Numerical Constraint Solver Usin…

Scalable Parallel Numerical CSP Solver

We present a parallel solver for numerical constraint satisfaction problems (NCSPs) that can scale on a number of cores. Our proposed method runs worker solvers on the available cores and simultaneously the workers cooperate for the search…

Artificial Intelligence · Computer Science 2014-11-07 Daisuke Ishii , Kazuki Yoshizoe , Toyotaro Suzumura

GLB: Lifeline-based Global Load Balancing library in X10

We present GLB, a programming model and an associated implementation that can handle a wide range of irregular paral- lel programming problems running over large-scale distributed systems. GLB is applicable both to problems that are easily…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-12-20 Wei Zhang , Olivier Tardieu , David Grove , Benjamin Herta , Tomio Kamada , Vijay Saraswat , Mikio Takeuchi

Implicitly Parallel Neuromorphic Solver Design for Constraint Satisfaction Problems

Many real-life problems of practical importance -- spanning a wide range of applications from chip design to bioinformatics -- represent constraint satisfaction problems, where classical solvers have to rely on heuristic approximations due…

Emerging Technologies · Computer Science 2026-03-03 Recep Bugra Uludag , Ahmet Efe , Ismail Akturk , Ulya R Karpuzcu

A Variant of Concurrent Constraint Programming on GPU

The number of cores on graphical computing units (GPUs) is reaching thousands nowadays, whereas the clock speed of processors stagnates. Unfortunately, constraint programming solvers do not take advantage yet of GPU parallelism. One reason…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-07-26 Pierre Talbot , Frédéric Pinel , Pascal Bouvry

CuClarabel: GPU Acceleration for a Conic Optimization Solver

We present the GPU implementation of the general-purpose interior-point solver Clarabel for convex optimization problems with conic constraints. We introduce a mixed parallel computing strategy that processes linear constraints first, then…

Optimization and Control · Mathematics 2025-11-04 Yuwen Chen , Danny Tse , Parth Nobel , Paul Goulart , Stephen Boyd

Parallel algorithms for problems of cluster analysis with very large amount of data

In this paper we solve on GPUs massive problems with large amount of data, which are not appropriate for solution with the SIMD technology. For the given problem we consider a three-level parallelization. The multithreading of CPU is used…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-02-18 Natalya Litvinenko

Experimenting with Constraint Programming on GPU

The focus of my PhD thesis is on exploring parallel approaches to efficiently solve problems modeled by constraints and presenting a new proposal. Current solvers are very advanced; they are carefully designed to effectively manage the…

Artificial Intelligence · Computer Science 2019-09-23 Fabio Tardivo

Parallel degree computation for solution space of binomial systems with an application to the master space of $\mathcal{N}=1$ gauge theories

The problem of solving a system of polynomial equations is one of the most fundamental problems in applied mathematics. Among them, the problem of solving a system of binomial equations form a important subclass for which specialized…

Algebraic Geometry · Mathematics 2015-03-03 Tianran Chen , Dhagash Mehta

Parallel Sparse Matrix Solver on the GPU Applied to Simulation of Electrical Machines

Nowadays, several industrial applications are being ported to parallel architectures. In fact, these platforms allow acquire more performance for system modelling and simulation. In the electric machines area, there are many problems which…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-10-25 Antonio Wendell De Oliveira Rodrigues , Frédéric Guyomarch , Yvonnick Le Menach , Jean-Luc Dekeyser

Optimizing Fine-Grained Parallelism Through Dynamic Load Balancing on Multi-Socket Many-Core Systems

Achieving efficient task parallelism on many-core architectures is an important challenge. The widely used GNU OpenMP implementation of the popular OpenMP parallel programming model incurs high overhead for fine-grained, short-running tasks…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-20 Wenyi Wang , Maxime Gonthier , Poornima Nookala , Haochen Pan , Ian Foster , Ioan Raicu , Kyle Chard

TurboSAT: Gradient-Guided Boolean Satisfiability Accelerated on GPU-CPU Hybrid System

While accelerated computing has transformed many domains of computing, its impact on logical reasoning, specifically Boolean satisfiability (SAT), remains limited. State-of-the-art SAT solvers rely heavily on inherently sequential…

Logic in Computer Science · Computer Science 2025-11-12 Steve Dai , Cunxi Yu , Kalyan Krishnamani , Brucek Khailany

Scalable hierarchical parallel algorithm for the solution of super large-scale sparse linear equations

The parallel linear equations solver capable of effectively using 1000+ processors becomes the bottleneck of large-scale implicit engineering simulations. In this paper, we present a new hierarchical parallel master-slave-structural…

Computational Physics · Physics 2015-06-11 Ran Xu , Bin Liu , Yuan Dong

A scalable elliptic solver with task-based parallelism for the SpECTRE numerical relativity code

Elliptic partial differential equations must be solved numerically for many problems in numerical relativity, such as initial data for every simulation of merging black holes and neutron stars. Existing elliptic solvers can take multiple…

General Relativity and Quantum Cosmology · Physics 2022-04-19 Nils L. Vu , Harald P. Pfeiffer , Gabriel S. Bonilla , Nils Deppe , François Hébert , Lawrence E. Kidder , Geoffrey Lovelace , Jordan Moxon , Mark A. Scheel , Saul A. Teukolsky , William Throwe , Nikolas A. Wittek , Tom Włodarczyk

A New Parallel N-body Gravity Solver: TPM

We have developed a gravity solver based on combining the well developed Particle-Mesh (PM) method and TREE methods. It is designed for and has been implemented on parallel computer architectures. The new code can deal with tens of millions…

Astrophysics · Physics 2009-10-22 Guohong Xu

Hybrid Numerical Solvers for Massively Parallel Eigenvalue Computation and Their Benchmark with Electronic Structure Calculations

Optimally hybrid numerical solvers were constructed for massively parallel generalized eigenvalue problem (GEP).The strong scaling benchmark was carried out on the K computer and other supercomputers for electronic structure calculation…

Computational Physics · Physics 2016-02-10 Hiroto Imachi , Takeo Hoshi

Parallel Branch Model Predictive Control on GPUs

We present a parallel GPU-accelerated solver for branch Model Predictive Control problems. Based on iterative LQR methods, our solver exploits the tree-sparse structure and implements temporal parallelism using the parallel scan algorithm.…

Systems and Control · Electrical Eng. & Systems 2025-06-17 Luyao Zhang , Chenghuai Lin , Sergio Grammatico

GSPMD: General and Scalable Parallelization for ML Computation Graphs

We present GSPMD, an automatic, compiler-based parallelization system for common machine learning computations. It allows users to write programs in the same way as for a single device, then give hints through a few annotations on how to…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-12-28 Yuanzhong Xu , HyoukJoong Lee , Dehao Chen , Blake Hechtman , Yanping Huang , Rahul Joshi , Maxim Krikun , Dmitry Lepikhin , Andy Ly , Marcello Maggioni , Ruoming Pang , Noam Shazeer , Shibo Wang , Tao Wang , Yonghui Wu , Zhifeng Chen

A Parallel and Highly-Portable HPC Poisson Solver: Preconditioned Bi-CGSTAB with alpaka

This paper presents the design, implementation, and performance analysis of a parallel and GPU-accelerated Poisson solver based on the Preconditioned Bi-Conjugate Gradient Stabilized (Bi-CGSTAB) method. The implementation utilizes the MPI…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-13 Luca Pennati , Måns I. Andersson , Klaus Steiniger , Rene Widera , Tapish Narwal , Michael Bussmann , Stefano Markidis

Multi-GPU SNN Simulation with Static Load Balancing

We present a SNN simulator which scales to millions of neurons, billions of synapses, and 8 GPUs. This is made possible by 1) a novel, cache-aware spike transmission algorithm 2) a model parallel multi-GPU distribution scheme and 3) a…

Neural and Evolutionary Computing · Computer Science 2021-09-23 Dennis Bautembach , Iason Oikonomidis , Antonis Argyros

Simultaneous Solving of Batched Linear Programs on a GPU

Linear Programs (LPs) appear in a large number of applications and offloading them to a GPU is viable to gain performance. Existing work on offloading and solving an LP on a GPU suggests that there is performance gain generally on large…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-02-26 Amit Gurung , Rajarshi Ray