English

Implementing Communication-Optimal Parallel and Sequential QR Factorizations

Numerical Analysis 2008-09-16 v1

Abstract

We present parallel and sequential dense QR factorization algorithms for tall and skinny matrices and general rectangular matrices that both minimize communication, and are as stable as Householder QR. The sequential and parallel algorithms for tall and skinny matrices lead to significant speedups in practice over some of the existing algorithms, including LAPACK and ScaLAPACK, for example up to 6.7x over ScaLAPACK. The parallel algorithm for general rectangular matrices is estimated to show significant speedups over ScaLAPACK, up to 22x over ScaLAPACK.

Keywords

Cite

@article{arxiv.0809.2407,
  title  = {Implementing Communication-Optimal Parallel and Sequential QR Factorizations},
  author = {James Demmel and Laura Grigori and Mark Hoemmen and Julien Langou},
  journal= {arXiv preprint arXiv:0809.2407},
  year   = {2008}
}
R2 v1 2026-06-21T11:20:05.739Z