English

Efficient Frequent Directions Algorithm for Sparse Matrices

Data Structures and Algorithms 2016-02-18 v2

Abstract

This paper describes Sparse Frequent Directions, a variant of Frequent Directions for sketching sparse matrices. It resembles the original algorithm in many ways: both receive the rows of an input matrix An×dA^{n \times d} one by one in the streaming setting and compute a small sketch BR×dB \in R^{\ell \times d}. Both share the same strong (provably optimal) asymptotic guarantees with respect to the space-accuracy tradeoff in the streaming setting. However, unlike Frequent Directions which runs in O(nd)O(nd\ell) time regardless of the sparsity of the input matrix AA, Sparse Frequent Directions runs in O~(nnz(A)+n2)\tilde{O} (nnz(A)\ell + n\ell^2) time. Our analysis loosens the dependence on computing the Singular Value Decomposition (SVD) as a black box within the Frequent Directions algorithm. Our bounds require recent results on the properties of fast approximate SVD computations. Finally, we empirically demonstrate that these asymptotic improvements are practical and significant on real and synthetic data.

Keywords

Cite

@article{arxiv.1602.00412,
  title  = {Efficient Frequent Directions Algorithm for Sparse Matrices},
  author = {Mina Ghashami and Edo Liberty and Jeff M. Phillips},
  journal= {arXiv preprint arXiv:1602.00412},
  year   = {2016}
}
R2 v1 2026-06-22T12:40:38.202Z