Related papers: Augmented sparse principal component analysis for …

Minimax bounds for sparse PCA with noisy high-dimensional data

We study the problem of estimating the leading eigenvectors of a high-dimensional population covariance matrix based on independent Gaussian observations. We establish a lower bound on the minimax risk of estimators under the $l_2$ loss, in…

Statistics Theory · Mathematics 2012-03-06 Aharon Birnbaum , Iain M. Johnstone , Boaz Nadler , Debashis Paul

Minimax Rates of Estimation for Sparse PCA in High Dimensions

We study sparse principal components analysis in the high-dimensional setting, where $p$ (the number of variables) can be much larger than $n$ (the number of observations). We prove optimal, non-asymptotic lower and upper bounds on the…

Machine Learning · Statistics 2012-02-07 Vincent Q. Vu , Jing Lei

De-biased sparse PCA: Inference and testing for eigenstructure of large covariance matrices

Sparse principal component analysis (sPCA) has become one of the most widely used techniques for dimensionality reduction in high-dimensional datasets. The main challenge underlying sPCA is to estimate the first vector of loadings of the…

Methodology · Statistics 2018-02-01 Jana Janková , Sara van de Geer

Sparse PCA: Optimal rates and adaptive estimation

Principal component analysis (PCA) is one of the most commonly used statistical procedures with a wide range of applications. This paper considers both minimax and adaptive estimation of the principal subspace in the high dimensional…

Statistics Theory · Mathematics 2014-01-08 T. Tony Cai , Zongming Ma , Yihong Wu

Rate-optimal posterior contraction for sparse PCA

Principal component analysis (PCA) is possibly one of the most widely used statistical tools to recover a low-rank structure of the data. In the high-dimensional settings, the leading eigenvector of the sample covariance can be nearly…

Statistics Theory · Mathematics 2015-04-06 Chao Gao , Harrison H. Zhou

Quantifying the Estimation Error of Principal Components

Principal component analysis is an important pattern recognition and dimensionality reduction tool in many applications. Principal components are computed as eigenvectors of a maximum likelihood covariance $\widehat{\Sigma}$ that…

Statistics Theory · Mathematics 2017-10-30 Raphael Hauser , Raul Kangro , Jüri Lember , Heinrich Matzinger

Sparse principal component analysis and iterative thresholding

Principal component analysis (PCA) is a classical dimension reduction method which projects data onto the principal subspace spanned by the leading eigenvectors of the covariance matrix. However, it behaves poorly when the number of…

Statistics Theory · Mathematics 2013-05-27 Zongming Ma

High-dimensional analysis of semidefinite relaxations for sparse principal components

Principal component analysis (PCA) is a classical method for dimensionality reduction based on extracting the dominant eigenvectors of the sample covariance matrix. However, PCA is well known to behave poorly in the ``large $p$, small $n$''…

Statistics Theory · Mathematics 2009-08-26 Arash A. Amini , Martin J. Wainwright

Residual Component Analysis: Generalising PCA for more flexible inference in linear-Gaussian models

Probabilistic principal component analysis (PPCA) seeks a low dimensional representation of a data set in the presence of independent spherical Gaussian noise. The maximum likelihood solution for the model is an eigenvalue problem on the…

Machine Learning · Computer Science 2012-06-22 Alfredo Kalaitzis , Neil Lawrence

Orthogonal Sparse PCA and Covariance Estimation via Procrustes Reformulation

The problem of estimating sparse eigenvectors of a symmetric matrix attracts a lot of attention in many applications, especially those with high dimensional data set. While classical eigenvectors can be obtained as the solution of a…

Machine Learning · Statistics 2016-11-03 Konstantinos Benidis , Ying Sun , Prabhu Babu , Daniel P. Palomar

Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

Estimating a covariance matrix and its associated principal components is a fundamental problem in contemporary statistics. While optimal estimation procedures have been developed with well-understood properties, the increasing demand for…

Statistics Theory · Mathematics 2024-09-30 T. Tony Cai , Dong Xia , Mengyue Zha

The High-Dimensional Asymptotics of Principal Component Regression

We study principal components regression (PCR) in an asymptotic high-dimensional regression setting, where the number of data points is proportional to the dimension. We derive exact limiting formulas for the estimation and prediction…

Statistics Theory · Mathematics 2025-09-18 Alden Green , Elad Romanov

Sparse principal component analysis via axis-aligned random projections

We introduce a new method for sparse principal component analysis, based on the aggregation of eigenvector information from carefully-selected axis-aligned random projections of the sample covariance matrix. Unlike most alternative…

Methodology · Statistics 2019-05-07 Milana Gataric , Tengyao Wang , Richard J. Samworth

Statistical and computational trade-offs in estimation of sparse principal components

In recent years, sparse principal component analysis has emerged as an extremely popular dimension reduction technique for high-dimensional data. The theoretical challenge, in the simplest case, is to estimate the leading eigenvector of a…

Statistics Theory · Mathematics 2016-09-29 Tengyao Wang , Quentin Berthet , Richard J. Samworth

Minimax sparse principal subspace estimation in high dimensions

We study sparse principal components analysis in high dimensions, where $p$ (the number of variables) can be much larger than $n$ (the number of observations), and analyze the problem of estimating the subspace spanned by the principal…

Statistics Theory · Mathematics 2014-01-06 Vincent Q. Vu , Jing Lei

Beyond Regularization: Inherently Sparse Principal Component Analysis

Sparse principal component analysis (sparse PCA) is a widely used technique for dimensionality reduction in multivariate analysis, addressing two key limitations of standard PCA. First, sparse PCA can be implemented in high-dimensional low…

Methodology · Statistics 2025-10-07 Jan O. Bauer

PAC-Bayesian bounds for Principal Component Analysis in Hilbert spaces

Based on some new robust estimators of the covariance matrix, we propose stable versions of Principal Component Analysis (PCA) and we qualify it independently of the dimension of the ambient space. We first provide a robust estimator of the…

Statistics Theory · Mathematics 2015-11-20 Ilaria Giulini

Sparse Principal Component Analysis with missing observations

In this paper, we study the problem of sparse Principal Component Analysis (PCA) in the high-dimensional setting with missing observations. Our goal is to estimate the first principal component when we only have access to partial…

Statistics Theory · Mathematics 2012-06-04 Karim Lounici

Sparse PCA: Convex Relaxations, Algorithms and Applications

Given a sample covariance matrix, we examine the problem of maximizing the variance explained by a linear combination of the input variables while constraining the number of nonzero coefficients in this combination. This is known as sparse…

Optimization and Control · Mathematics 2010-12-24 Youwei Zhang , Alexandre d'Aspremont , Laurent El Ghaoui

Sparse PCA through Low-rank Approximations

We introduce a novel algorithm that computes the $k$-sparse principal component of a positive semidefinite matrix $A$. Our algorithm is combinatorial and operates by examining a discrete set of special vectors lying in a low-dimensional…

Machine Learning · Statistics 2014-05-09 Dimitris S. Papailiopoulos , Alexandros G. Dimakis , Stavros Korokythakis