Related papers: GPU Accelerated Sparse Cholesky Factorization

An Efficient Solver for Sparse Linear Systems Based on Rank-Structured Cholesky Factorization

Direct factorization methods for the solution of large, sparse linear systems that arise from PDE discretizations are robust, but typically show poor time and memory scalability for large systems. In this paper, we describe an efficient…

Numerical Analysis · Computer Science 2015-07-21 Jeffrey N. Chadwick , David S. Bindel

Sparse Cholesky Factorization for Solving Nonlinear PDEs via Gaussian Processes

In recent years, there has been widespread adoption of machine learning-based approaches to automate the solving of partial differential equations (PDEs). Among these approaches, Gaussian processes (GPs) and kernel methods have garnered…

Numerical Analysis · Mathematics 2024-03-12 Yifan Chen , Houman Owhadi , Florian Schäfer

Fast Sparse Matrix Permutation for Mesh-Based Direct Solvers

We present a fast sparse matrix permutation algorithm tailored to linear systems arising from triangle meshes. Our approach produces nested-dissection-style permutations while significantly reducing permutation runtime overhead. Rather than…

Graphics · Computer Science 2026-02-03 Behrooz Zarebavami , Ahmed H. Mahmoud , Ana Dodik , Changcheng Yuan , Serban D. Porumbescu , John D. Owens , Maryam Mehri Dehnavi , Justin Solomon

Neural Acceleration of Incomplete Cholesky Preconditioners

The solution of a sparse system of linear equations is ubiquitous in scientific applications. Iterative methods, such as the Preconditioned Conjugate Gradient method (PCG), are normally chosen over direct methods due to memory and…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-04 Joshua Dennis Booth , Hongyang Sun , Trevor Garnett

Optimization of the Sparse Multi-Threaded Cholesky Factorization for A64FX

Sparse linear algebra routines are fundamental building blocks of a large variety of scientific applications. Direct solvers, which are methods for solving linear systems via the factorization of matrices into products of triangular…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-02-21 Valentin Le Fèvre , Tetsuzo Usui , Marc Casas

GPU-Accelerated Cholesky Factorization of Block Tridiagonal Matrices

This paper presents a GPU-accelerated framework for solving block tridiagonal linear systems that arise naturally in numerous real-time applications across engineering and scientific computing. Through a multi-stage permutation strategy…

Optimization and Control · Mathematics 2026-01-08 Roland Schwan , Daniel Kuhn , Colin N. Jones

Parallel Cholesky Factorization for Banded Matrices using OpenMP Tasks

Cholesky factorization is a widely used method for solving linear systems involving symmetric, positive-definite matrices, and can be an attractive choice in applications where a high degree of numerical stability is needed. One such…

Numerical Analysis · Mathematics 2023-05-09 Felix Liu , Albin Fredriksson , Stefano Markidis

sTiles: An Accelerated Computational Framework for Sparse Factorizations of Structured Matrices

This paper introduces sTiles, a GPU-accelerated framework for factorizing sparse structured symmetric matrices. By leveraging tile algorithms for fine-grained computations, sTiles uses a structure-aware task execution flow to handle…

Performance · Computer Science 2025-01-07 Esmail Abdul Fattah , Hatem Ltaief , Havard Rue , David Keyes

Parallel GPU-Accelerated Randomized Construction of Approximate Cholesky Preconditioners

We introduce a parallel algorithm to construct a preconditioner for solving a large, sparse linear system where the coefficient matrix is a Laplacian matrix (a.k.a., graph Laplacian). Such a linear system arises from applications such as…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-05-30 Tianyu Liang , Chao Chen , Yotam Yaniv , Hengrui Luo , David Tench , Xiaoye S. Li , Aydin Buluc , James Demmel

Efficient parallelization strategy for real-time FE simulations

This paper introduces an efficient and generic framework for finite-element simulations under an implicit time integration scheme. Being compatible with generic constitutive models, a fast matrix assembly method exploits the fact that…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-12 Ziqiu Zeng , Hadrien Courtecuisse

An Asynchronous Task-based Fan-Both Sparse Cholesky Solver

Systems of linear equations arise at the heart of many scientific and engineering applications. Many of these linear systems are sparse; i.e., most of the elements in the coefficient matrix are zero. Direct methods based on matrix…

Mathematical Software · Computer Science 2016-08-24 Mathias Jacquelin , Yili Zheng , Esmond Ng , Katherine Yelick

Solving Linear Systems on a GPU with Hierarchically Off-Diagonal Low-Rank Approximations

We are interested in solving linear systems arising from three applications: (1) kernel methods in machine learning, (2) discretization of boundary integral equations from mathematical physics, and (3) Schur complements formed in the…

Numerical Analysis · Mathematics 2022-08-15 Chao Chen , Per-Gunnar Martinsson

Developing robust incomplete Cholesky factorizations in half precision arithmetic

Incomplete factorizations have long been popular general-purpose algebraic preconditioners for solving large sparse linear systems of equations. Guaranteeing the factorization is breakdown free while computing a high quality preconditioner…

Numerical Analysis · Mathematics 2025-02-04 Jennifer Scott , Miroslav Tůma

Sparse inverse Cholesky factorization of dense kernel matrices by greedy conditional selection

Dense kernel matrices resulting from pairwise evaluations of a kernel function arise naturally in machine learning and statistics. Previous work in constructing sparse approximate inverse Cholesky factors of such matrices by minimizing…

Computation · Statistics 2025-05-12 Stephen Huan , Joseph Guinness , Matthias Katzfuss , Houman Owhadi , Florian Schäfer

Spectrum-Revealing Cholesky Factorization for Kernel Methods

Kernel methods represent some of the most popular machine learning tools for data analysis. Since exact kernel methods can be prohibitively expensive for large problems, reliable low-rank matrix approximations and high-performance…

Numerical Analysis · Mathematics 2018-04-17 Jianwei Xiao , Ming Gu

GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation

LU factorization for sparse matrices is the most important computing step for many engineering and scientific computing problems such as circuit simulation. But parallelizing LU factorization with the Graphic Processing Units (GPU) still…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-14 Shaoyi Peng , Sheldon X. -D. Tan

Comparing the Performance of Heterogeneous Conjugate Gradient and Cholesky Solvers on Various Hardware Using SYCL

Many important real-world applications, such as System Identification with Gaussian Processes, involve solving linear systems with symmetric positive-definite matrices. The iterative CG method and direct solvers based on the Cholesky…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-14 Tim Thüring , Alexander Strack , Dirk Pflüger

GPU accelerated matrix factorization of large scale data using block based approach

Matrix Factorization (MF) on large scale data takes substantial time on a Central Processing Unit (CPU). While Graphical Processing Unit (GPU)s could expedite the computation of MF, the available memory on a GPU is finite. Leveraging GPUs…

Machine Learning · Computer Science 2023-04-28 Prasad Bhavana , Vineet Padmanabhan

A computational study of low precision incomplete Cholesky factorization preconditioners for sparse linear least-squares problems

Our interest lies in the robust and efficient solution of large sparse linear least-squares problems. In recent years, hardware developments have led to a surge in interest in exploiting mixed precision arithmetic within numerical linear…

Numerical Analysis · Mathematics 2025-04-11 Jennifer Scott , Miroslav Tůma

A Hybrid Direct-Iterative Method for Solving KKT Linear Systems

We propose a solution strategy for linear systems arising in interior method optimization, which is suitable for implementation on hardware accelerators such as graphical processing units (GPUs). The current gold standard for solving these…

Optimization and Control · Mathematics 2026-03-09 Shaked Regev , Nai-Yuan Chiang , Eric Darve , Cosmin G. Petra , Michael A. Saunders , Kasia Świrydowicz , Slaven Peleš