English

Communication-optimal parallel and sequential QR and LU factorizations: theory and practice

Numerical Analysis 2008-08-29 v3

Abstract

We present parallel and sequential dense QR factorization algorithms that are both optimal (up to polylogarithmic factors) in the amount of communication they perform, and just as stable as Householder QR. Our first algorithm, Tall Skinny QR (TSQR), factors m-by-n matrices in a one-dimensional (1-D) block cyclic row layout, and is optimized for m >> n. Our second algorithm, CAQR (Communication-Avoiding QR), factors general rectangular matrices distributed in a two-dimensional block cyclic layout. It invokes TSQR for each block column factorization.

Keywords

Cite

@article{arxiv.0806.2159,
  title  = {Communication-optimal parallel and sequential QR and LU factorizations: theory and practice},
  author = {James Demmel and Laura Grigori and Mark Hoemmen and Julien Langou},
  journal= {arXiv preprint arXiv:0806.2159},
  year   = {2008}
}
R2 v1 2026-06-21T10:50:09.141Z