Related papers: A randomized algorithm for principal component ana…

Generalized Low Rank Models

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal,…

Machine Learning · Statistics 2015-05-06 Madeleine Udell , Corinne Horn , Reza Zadeh , Stephen Boyd

An algorithm for the principal component analysis of large data sets

Recently popularized randomized methods for principal component analysis (PCA) efficiently and reliably produce nearly optimal accuracy --- even on parallel processors --- unlike the classical (deterministic) alternatives. We adapt one of…

Computation · Statistics 2011-12-23 Nathan Halko , Per-Gunnar Martinsson , Yoel Shkolnisky , Mark Tygert

A random version of principal component analysis in data clustering

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance-correlation matrix of the analyzed data. However to properly work with high-dimensional data, PCA poses severe mathematical…

Quantitative Methods · Quantitative Biology 2018-10-18 Luigi Leonardo Palese

Near-Optimal Stochastic Approximation for Online Principal Component Estimation

Principal component analysis (PCA) has been a prominent tool for high-dimensional data analysis. Online algorithms that estimate the principal component by processing streaming data are of tremendous practical and theoretical interests.…

Optimization and Control · Mathematics 2017-10-09 Chris Junchi Li , Mengdi Wang , Han Liu , Tong Zhang

Robust PCA via Nonconvex Rank Approximation

Numerous applications in data mining and machine learning require recovering a matrix of minimal rank. Robust principal component analysis (RPCA) is a general framework for handling this kind of problems. Nuclear norm based convex surrogate…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 Zhao Kang , Chong Peng , Qiang Cheng

Approximation Algorithms for Sparse Principal Component Analysis

Principal component analysis (PCA) is a widely used dimension reduction technique in machine learning and multivariate statistics. To improve the interpretability of PCA, various approaches to obtain sparse principal direction loadings have…

Data Structures and Algorithms · Computer Science 2021-06-07 Agniva Chowdhury , Petros Drineas , David P. Woodruff , Samson Zhou

An implementation of a randomized algorithm for principal component analysis

Recent years have witnessed intense development of randomized methods for low-rank approximation. These methods target principal component analysis (PCA) and the calculation of truncated singular value decompositions (SVD). The present…

Computation · Statistics 2017-01-02 Arthur Szlam , Yuval Kluger , Mark Tygert

Maximum Margin Principal Components

Principal Component Analysis (PCA) is a very successful dimensionality reduction technique, widely used in predictive modeling. A key factor in its widespread use in this domain is the fact that the projection of a dataset onto its first…

Machine Learning · Statistics 2017-05-19 Xianghui Luo , Robert J. Durrant

Near-Optimal Algorithms for Differentially-Private Principal Components

Principal components analysis (PCA) is a standard tool for identifying good low-dimensional approximations to data in high dimension. Many data sets of interest contain private or sensitive information about individuals. Algorithms which…

Machine Learning · Statistics 2013-08-09 Kamalika Chaudhuri , Anand D. Sarwate , Kaushik Sinha

Optimal Sparse Linear Auto-Encoders and Sparse PCA

Principal components analysis (PCA) is the optimal linear auto-encoder of data, and it is often used to construct features. Enforcing sparsity on the principal components can promote better generalization, while improving the…

Machine Learning · Computer Science 2015-02-25 Malik Magdon-Ismail , Christos Boutsidis

Sparse PCA through Low-rank Approximations

We introduce a novel algorithm that computes the $k$-sparse principal component of a positive semidefinite matrix $A$. Our algorithm is combinatorial and operates by examining a discrete set of special vectors lying in a low-dimensional…

Machine Learning · Statistics 2014-05-09 Dimitris S. Papailiopoulos , Alexandros G. Dimakis , Stavros Korokythakis

Conformalized Robust Principal Component Analysis

Robust principal component analysis (RPCA) is a widely used technique for recovering low-rank structure from matrices with missing entries and sparse, possibly large-magnitude corruptions. Although numerous algorithms achieve accurate point…

Methodology · Statistics 2026-03-17 Liangliang Yuan , Lei Wang , Quan Kong , Liuhua Peng

Low-Rank Principal Eigenmatrix Analysis

Sparse PCA is a widely used technique for high-dimensional data analysis. In this paper, we propose a new method called low-rank principal eigenmatrix analysis. Different from sparse PCA, the dominant eigenvectors are allowed to be dense…

Machine Learning · Statistics 2019-04-30 Krishna Balasubramanian , Elynn Y. Chen , Jianqing Fan , Xiang Wu

A Fast Algorithm for a Weighted Low Rank Approximation

Matrix low rank approximation including the classical PCA and the robust PCA (RPCA) method have been applied to solve the background modeling problem in video analysis. Recently, it has been demonstrated that a special weighted low rank…

Optimization and Control · Mathematics 2017-03-21 Aritra Dutta , Xin Li

Sketching for Principal Component Regression

Principal component regression (PCR) is a useful method for regularizing linear regression. Although conceptually simple, straightforward implementations of PCR have high computational costs and so are inappropriate when learning with large…

Numerical Analysis · Mathematics 2019-03-08 Liron Mor-Yosef , Haim Avron

Fast Randomized PCA for Sparse Data

Principal component analysis (PCA) is widely used for dimension reduction and embedding of real data in social network analysis, information retrieval, and natural language processing, etc. In this work we propose a fast randomized PCA…

Machine Learning · Computer Science 2018-10-17 Xu Feng , Yuyang Xie , Mingye Song , Wenjian Yu , Jie Tang

Sparse PCA: Optimal rates and adaptive estimation

Principal component analysis (PCA) is one of the most commonly used statistical procedures with a wide range of applications. This paper considers both minimax and adaptive estimation of the principal subspace in the high dimensional…

Statistics Theory · Mathematics 2014-01-08 T. Tony Cai , Zongming Ma , Yihong Wu

Sparse Principal Components Analysis

Principal components analysis (PCA) is a classical method for the reduction of dimensionality of data in the form of n observations (or cases) of a vector with p variables. For a simple model of factor analysis type, it is proved that…

Statistics Theory · Mathematics 2009-01-29 Iain M Johnstone , Arthur Yu Lu

On the Estimation of Coherence

Low-rank matrix approximations are often used to help scale standard machine learning algorithms to large-scale problems. Recently, matrix coherence has been used to characterize the ability to extract global information from a subset of…

Machine Learning · Statistics 2010-09-07 Mehryar Mohri , Ameet Talwalkar

Sparse PCA With Multiple Components

Sparse Principal Component Analysis (sPCA) is a cardinal technique for obtaining combinations of features, or principal components (PCs), that explain the variance of high-dimensional datasets in an interpretable manner. This involves…

Optimization and Control · Mathematics 2025-12-02 Ryan Cory-Wright , Jean Pauphilet