English

Efficient volume sampling for row/column subset selection

Data Structures and Algorithms 2010-04-26 v1

Abstract

We give efficient algorithms for volume sampling, i.e., for picking kk-subsets of the rows of any given matrix with probabilities proportional to the squared volumes of the simplices defined by them and the origin (or the squared volumes of the parallelepipeds defined by these subsets of rows). This solves an open problem from the monograph on spectral algorithms by Kannan and Vempala. Our first algorithm for volume sampling kk-subsets of rows from an mm-by-nn matrix runs in O(kmnωlogn)O(kmn^{\omega} \log n) arithmetic operations and a second variant of it for (1+ϵ)(1+\epsilon)-approximate volume sampling runs in O(mnlogmk2/ϵ2+mlogωmk2ω+1/ϵ2ωlog(kϵ1logm))O(mn \log m \cdot k^{2}/\epsilon^{2} + m \log^{\omega} m \cdot k^{2\omega+1}/\epsilon^{2\omega} \cdot \log(k \epsilon^{-1} \log m)) arithmetic operations, which is almost linear in the size of the input (i.e., the number of entries) for small kk. Our efficient volume sampling algorithms imply several interesting results for low-rank matrix approximation.

Keywords

Cite

@article{arxiv.1004.4057,
  title  = {Efficient volume sampling for row/column subset selection},
  author = {Amit Deshpande and Luis Rademacher},
  journal= {arXiv preprint arXiv:1004.4057},
  year   = {2010}
}
R2 v1 2026-06-21T15:13:50.578Z