English

A numerically stable communication-avoiding s-step GMRES algorithm

Numerical Analysis 2024-07-29 v3 Numerical Analysis

Abstract

Krylov subspace methods are extensively used in scientific computing to solve large-scale linear systems. However, the performance of these iterative Krylov solvers on modern supercomputers is limited by expensive communication costs. The ss-step strategy generates a series of ss Krylov vectors at a time to avoid communication. Asymptotically, the ss-step approach can reduce communication latency by a factor of ss. Unfortunately, due to finite-precision implementation, the step size has to be kept small for stability. In this work, we tackle the numerical instabilities encountered in the ss-step GMRES algorithm. By choosing an appropriate polynomial basis and block orthogonalization schemes, we construct a communication avoiding ss-step GMRES algorithm that automatically selects the optimal step size to ensure numerical stability. To further maximize communication savings, we introduce scaled Newton polynomials that can increase the step size ss to a few hundreds for many problems. An initial step size estimator is also developed to efficiently choose the optimal step size for stability. The guaranteed stability of the proposed algorithm is demonstrated using numerical experiments. In the process, we also evaluate how the choice of polynomial and preconditioning affects the stability limit of the algorithm. Finally, we show parallel scalability on more than 114,000 cores in a distributed-memory setting. Perfectly linear scaling has been observed in both strong and weak scaling studies with negligible communication costs.

Keywords

Cite

@article{arxiv.2303.08953,
  title  = {A numerically stable communication-avoiding s-step GMRES algorithm},
  author = {Zan Xu and Juan J. Alonso and Eric Darve},
  journal= {arXiv preprint arXiv:2303.08953},
  year   = {2024}
}

Comments

36 pages, 15 figures

R2 v1 2026-06-28T09:19:27.810Z