Computational Physics · Physics
A Batched GPU Methodology for Numerical Solutions of Partial Differential Equations
Enda Carroll, Andrew Gloster, Miguel D. Bustamante, Lennon Ó' Náraigh
2021-07-13
Distributed, Parallel, and Cluster Computing · Computer Science
Simultaneous Solving of Batched Linear Programs on a GPU
Amit Gurung, Rajarshi Ray
2018-02-26
Distributed, Parallel, and Cluster Computing · Computer Science
Boosting Performance of Iterative Applications on GPUs: Kernel Batching with CUDA Graphs
Jonah Ekelund, Stefano Markidis, Ivy Peng
2025-05-01
Optimization and Control · Mathematics
Batched First-Order Methods for Parallel LP Solving in MIP
Nicolas Blin, Stefano Gualandi, Christopher Maes, Andrea Lodi +1
2026-01-30
Machine Learning · Computer Science
Efficient GPU implementation of randomized SVD and its applications
Łukasz Struski, Paweł Morkisz, Przemysław Spurek, Samuel Rodriguez Bernabeu +1
2024-03-13
Distributed, Parallel, and Cluster Computing · Computer Science
CuLDA_CGS: Solving Large-scale LDA Problems on GPUs
Xiaolong Xie, Yun Liang, Xiuhong Li, Wei Tan
2018-03-14
Distributed, Parallel, and Cluster Computing · Computer Science
Developing a High Performance Software Library with MPI and CUDA for Matrix Computations
Bogdan Oancea, Tudorel Andrei
2018-02-09
Distributed, Parallel, and Cluster Computing · Computer Science
Solving Large Rank-Deficient Linear Least-Squares Problems on Shared-Memory CPU Architectures and GPU Architectures
Mónica Chillarón, Gregorio Quintana-Ortí, Vicente Vidal, Per-Gunnar Martinsson
2024-08-13
Distributed, Parallel, and Cluster Computing · Computer Science
Efficient hybrid topology optimization using GPU and homogenization based multigrid approach
Arya Prakash Padhi, Souvik Chakraborty, Anupam Chakrabarti, Rajib Chowdhury
2022-02-01
Optimization and Control · Mathematics
cuHALLaR: A GPU Accelerated Low-Rank Augmented Lagrangian Method for Large-Scale Semidefinite Programming
Jacob M. Aguirre, Diego Cifuentes, Vincent Guigues, Renato D. C. Monteiro +2
2025-10-27
Distributed, Parallel, and Cluster Computing · Computer Science
Two-Dimensional Batch Linear Programming on the GPU
John Charlton, Steve Maddock, Paul Richmond
2019-02-14
Computational Engineering, Finance, and Science · Computer Science
Iterative Methods in GPU-Resident Linear Solvers for Nonlinear Constrained Optimization
Kasia Świrydowicz, Nicholson Koukpaizan, Maksudul Alam, Shaked Regev +2
2024-01-26
Distributed, Parallel, and Cluster Computing · Computer Science
Accelerating Matrix Multiplication: A Performance Comparison Between Multi-Core CPU and GPU
Mufakir Qamar Ansari, Mudabir Qamar Ansari
2025-07-30
Distributed, Parallel, and Cluster Computing · Computer Science
Solving Batched Linear Programs on GPU and Multicore CPU
Amit Gurung, Rajarshi Ray
2016-09-27
Mathematical Software · Computer Science
An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures
Shengguo Li, Francois-Henry Rouet, Jie Liu, Chun Huang +2
2016-12-27
Distributed, Parallel, and Cluster Computing · Computer Science
ML-Based Optimum Number of CUDA Streams for the GPU Implementation of the Tridiagonal Partition Method
Milena Veneva, Toshiyuki Imamura
2026-05-22
Distributed, Parallel, and Cluster Computing · Computer Science
Heterogeneous FPGA+GPU Embedded Systems: Challenges and Opportunities
Mohammad Hosseinabady, Mohd Amiruddin Bin Zainol, Jose Nunez-Yanez
2019-01-28
Distributed, Parallel, and Cluster Computing · Computer Science
Accelerating Bidiagonalization of Banded Matrices through Memory-Aware Bulge-Chasing on GPUs
Evelyne Ringoot, Rabab Alomairy, Alan Edelman
2026-01-14
Distributed, Parallel, and Cluster Computing · Computer Science
A Hybrid Multi-GPU Implementation of Simplex Algorithm with CPU Collaboration
Basilis Mamalis, Marios Perlitis
2022-11-22
Distributed, Parallel, and Cluster Computing · Computer Science
Faster and Cheaper: Parallelizing Large-Scale Matrix Factorization on GPUs
Wei Tan, Liangliang Cao, Liana Fong
2016-10-25
Mathematical Software · Computer Science
CUDACLAW: A high-performance programmable GPU framework for the solution of hyperbolic PDEs
H. Gorune Ohannessian, George Turkiyyah, Aron Ahmadia, David Ketcheson
2018-05-24