English

Information-theoretically Optimal Sparse PCA

Information Theory 2014-05-06 v2 math.IT Statistics Theory Statistics Theory

Abstract

Sparse Principal Component Analysis (PCA) is a dimensionality reduction technique wherein one seeks a low-rank representation of a data matrix with additional sparsity constraints on the obtained representation. We consider two probabilistic formulations of sparse PCA: a spiked Wigner and spiked Wishart (or spiked covariance) model. We analyze an Approximate Message Passing (AMP) algorithm to estimate the underlying signal and show, in the high dimensional limit, that the AMP estimates are information-theoretically optimal. As an immediate corollary, our results demonstrate that the posterior expectation of the underlying signal, which is often intractable to compute, can be obtained using a polynomial-time scheme. Our results also effectively provide a single-letter characterization of the sparse PCA problem.

Keywords

Cite

@article{arxiv.1402.2238,
  title  = {Information-theoretically Optimal Sparse PCA},
  author = {Yash Deshpande and Andrea Montanari},
  journal= {arXiv preprint arXiv:1402.2238},
  year   = {2014}
}

Comments

5 pages, 1 figure, conference

R2 v1 2026-06-22T03:05:01.563Z