Related papers: A Nested Krylov Method Using Half-Precision Arithm…

Compressed Basis GMRES on High Performance GPUs

Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the performance of practical realizations of these methods is constrained by the…

Mathematical Software · Computer Science 2020-09-28 José I. Aliaga , Hartwig Anzt , Thomas Grützmacher , Enrique S. Quintana-Ortí , Andrés E. Tomás

Mixed Precision GMRES-based Iterative Refinement with Recycling

With the emergence of mixed precision capabilities in hardware, iterative refinement schemes for solving linear systems $Ax=b$ have recently been revisited and reanalyzed in the context of three or more precisions. These new analyses show…

Numerical Analysis · Mathematics 2022-02-17 Eda Oktay , Erin Carson

Performance evaluation of accelerated real and complex multiple-precision sparse matrix-vector multiplication

Sparse matrices have recently played a significant and impactful role in scientific computing, including artificial intelligence-related fields. According to historical studies on sparse matrix--vector multiplication (SpMV), Krylov subspace…

Numerical Analysis · Mathematics 2024-12-24 Tomonori Kouya

Multistage Mixed Precision Iterative Refinement

Low precision arithmetic, in particular half precision floating point arithmetic, is now available in commercial hardware. Using lower precision can offer significant savings in computation and communication costs with proportional savings…

Numerical Analysis · Mathematics 2021-11-16 Eda Oktay , Erin Carson

Accelerating Geometric Multigrid Preconditioning with Half-Precision Arithmetic on GPUs

With the hardware support for half-precision arithmetic on NVIDIA V100 GPUs, high-performance computing applications can benefit from lower precision at appropriate spots to speed up the overall execution time. In this paper, we investigate…

Mathematical Software · Computer Science 2020-07-16 Kyaw L. Oo , Andreas Vogel

Three-precision iterative refinement with parameter regularization and prediction for solving large sparse linear systems

This study presents a novel mixed-precision iterative refinement algorithm, GADI-IR, within the general alternating-direction implicit (GADI) framework, designed for efficiently solving large-scale sparse linear systems. By employing…

Numerical Analysis · Mathematics 2025-03-24 Jifeng Ge , Juan Zhang

On an integrated Krylov-ADI solver for large-scale Lyapunov equations

One of the most computationally expensive steps of the low-rank ADI method for large-scale Lyapunov equations is the solution of a shifted linear system at each iteration. We propose the use of the extended Krylov subspace method for this…

Numerical Analysis · Mathematics 2022-08-09 Peter Benner , Davide Palitta , Jens Saak

Improving the Performance of the GMRES Method using Mixed-Precision Techniques

The GMRES method is used to solve sparse, non-symmetric systems of linear equations arising from many scientific applications. The solver performance within a single node is memory bound, due to the low arithmetic intensity of its…

Numerical Analysis · Mathematics 2020-11-04 Neil Lindquist , Piotr Luszczek , Jack Dongarra

Iterative Methods at Lower Precision

Since numbers in the computer are represented with a fixed number of bits, loss of accuracy during calculation is unavoidable. At high precision where more bits (e.g. 64) are allocated to each number, round-off errors are typically small.…

Numerical Analysis · Mathematics 2022-10-11 Yizhou Chen , Xiaoyun Gong , Xiang Ji

An Introduction to Krylov Subspace Methods

Nowadays, many fields of study are have to deal with large and sparse data matrixes, but the most important issue is finding the inverse of these matrixes. Thankfully, Krylov subspace methods can be used in solving these types of problem.…

Optimization and Control · Mathematics 2018-11-26 Shitao Fan

Experimental Evaluation of Multiprecision Strategies for GMRES on GPUs

Support for lower precision computation is becoming more common in accelerator hardware due to lower power usage, reduced data movement and increased computational performance. However, computational science and engineering (CSE) problems…

Numerical Analysis · Mathematics 2021-05-18 Jennifer A. Loe , Christian A. Glusa , Ichitaro Yamazaki , Erik G. Boman , Sivasankaran Rajamanickam

Randomized sketched TT-GMRES for linear systems with tensor structure

In the last decade, tensors have shown their potential as valuable tools for various tasks in numerical linear algebra. While most of the research has been focusing on how to compress a given tensor in order to maintain information as well…

Numerical Analysis · Mathematics 2024-09-17 Alberto Bucci , Davide Palitta , Leonardo Robol

Avoiding breakdown in incomplete factorizations in low precision arithmetic

The emergence of low precision floating-point arithmetic in computer hardware has led to a resurgence of interest in the use of mixed precision numerical linear algebra. For linear systems of equations, there has been renewed enthusiasm for…

Numerical Analysis · Mathematics 2024-02-22 Jennifer Scott , Miroslav Tůma

Krylov-Type Methods for Tensor Computations

Several Krylov-type procedures are introduced that generalize matrix Krylov methods for tensor computations. They are denoted minimal Krylov recursion, maximal Krylov recursion, contracted tensor product Krylov recursion. It is proved that…

Numerical Analysis · Mathematics 2010-05-07 Berkant Savas , Lars Eldén

Pipelined, Flexible Krylov Subspace Methods

We present variants of the Conjugate Gradient (CG), Conjugate Residual (CR), and Generalized Minimal Residual (GMRES) methods which are both pipelined and flexible. These allow computation of inner products and norms to be overlapped with…

Numerical Analysis · Mathematics 2016-09-16 Patrick Sanan , Sascha M. Schnepp , Dave. A. May

A Study of Mixed Precision Strategies for GMRES on GPUs

Support for lower precision computation is becoming more common in accelerator hardware due to lower power usage, reduced data movement and increased computational performance. However, computational science and engineering (CSE) problems…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-09-06 Jennifer A. Loe , Christian A. Glusa , Ichitaro Yamazaki , Erik G. Boman , Sivasankaran Rajamanickam

Mixed precision solvers with half-precision floating point numbers for Lattice QCD on A64FX processor

We investigate the use of half-precision floating-point numbers (FP16) in mixed-precision linear solvers for lattice QCD simulations. Since the emergence of GPUs for general-purpose, mixed-precision algorithms that combine single-precision…

High Energy Physics - Lattice · Physics 2026-02-17 Issaku Kanamori , Hideo Matsufuru , Tatsumi Aoyama , Kazuyuki Kanaya , Yusuke Namekawa , Hidekatsu Nemura , Keigo Nitadori

General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of BiCGStab methods

Parallel implementations of Krylov subspace methods often help to accelerate the procedure of finding an approximate solution of a linear system. However, such parallelization coupled with asynchronous and out-of-order execution often…

Mathematical Software · Computer Science 2023-02-09 Roman Iakymchuk , Jose I. Aliaga

A method for accelerating low precision operations by sparse matrix multiplication

In recent years, the fervent demand for computational power across various domains has prompted hardware manufacturers to introduce specialized computing hardware aimed at enhancing computational capabilities. Particularly, the utilization…

Numerical Analysis · Mathematics 2024-03-12 Hongyaoxing Gu

A Stochastic Performance Model for Pipelined Krylov Methods

Pipelined Krylov methods seek to ameliorate the latency due to inner products necessary for projection by overlapping it with the computation associated with sparse matrix-vector multiplication. We clarify a folk theorem that this can only…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-02-17 Hannah Morgan , Matthew G. Knepley , Patrick Sanan , L. Ridgway Scott