English
Related papers

Related papers: Oja's Algorithm for Streaming Sparse PCA

200 papers

This work provides improved guarantees for streaming principle component analysis (PCA). Given $A_1, \ldots, A_n\in \mathbb{R}^{d\times d}$ sampled independently from distributions satisfying $\mathbb{E}[A_i] = \Sigma$ for $\Sigma \succeq…

Machine Learning · Computer Science 2016-03-29 Prateek Jain , Chi Jin , Sham M. Kakade , Praneeth Netrapalli , Aaron Sidford

We study streaming principal component analysis (PCA), that is to find, in $O(dk)$ space, the top $k$ eigenvectors of a $d\times d$ hidden matrix $\bf \Sigma$ with online vectors drawn from covariance matrix $\bf \Sigma$. We provide…

Optimization and Control · Mathematics 2017-04-18 Zeyuan Allen-Zhu , Yuanzhi Li

Since its inception in 1982, Oja's algorithm has become an established method for streaming principle component analysis (PCA). We study the problem of streaming PCA, where the data-points are sampled from an irreducible, aperiodic, and…

Statistics Theory · Mathematics 2023-06-21 Syamantak Kumar , Purnamrita Sarkar

Principal Component Analysis (PCA) is a widely used technique in machine learning, data analysis and signal processing. With the increase in the size and complexity of datasets, it has become important to develop low-space usage algorithms…

Machine Learning · Computer Science 2023-03-09 Yichuan Deng , Zhao Song , Zifan Wang , Han Zhang

Low-precision streaming PCA estimates the top principal component in a streaming setting under limited precision. We establish an information-theoretic lower bound on the quantization resolution required to achieve a target accuracy for the…

Machine Learning · Computer Science 2025-10-28 Sanjoy Dasgupta , Syamantak Kumar , Shourya Pandey , Purnamrita Sarkar

We propose a novel statistical inference framework for streaming principal component analysis (PCA) using Oja's algorithm, enabling the construction of confidence intervals for individual entries of the estimated eigenvector. Most existing…

Statistics Theory · Mathematics 2025-07-22 Syamantak Kumar , Shourya Pandey , Purnamrita Sarkar

Oja's rule [Oja, Journal of mathematical biology 1982] is a well-known biologically-plausible algorithm using a Hebbian-type synaptic update rule to solve streaming principal component analysis (PCA). Computational neuroscientists have…

Neurons and Cognition · Quantitative Biology 2020-06-19 Chi-Ning Chou , Mien Brabeeba Wang

We analyze Oja's algorithm for streaming $k$-PCA and prove that it achieves performance nearly matching that of an optimal offline algorithm. Given access to a sequence of i.i.d. $d \times d$ symmetric matrices, we show that Oja's algorithm…

Data Structures and Algorithms · Computer Science 2021-02-09 De Huang , Jonathan Niles-Weed , Rachel Ward

In this paper we analyze the behavior of the Oja's algorithm for online/streaming principal component subspace estimation. It is proved that with high probability it performs an efficient, gap-free, global convergence rate to approximate an…

Machine Learning · Computer Science 2024-03-06 Xin Liang

Oja's algorithm has been the cornerstone of streaming methods in Principal Component Analysis (PCA) since it was first proposed in 1982. However, Oja's algorithm does not have a standardized choice of learning rate (step size) that both…

Machine Learning · Statistics 2019-11-04 Amelia Henriksen , Rachel Ward

We study the dynamics of an online algorithm for learning a sparse leading eigenvector from samples generated from a spiked covariance model. This algorithm combines the classical Oja's method for online PCA with an element-wise…

Information Theory · Computer Science 2016-09-09 Chuang Wang , Yue M. Lu

Oja's algorithm of principal component analysis (PCA) has been one of the methods utilized in practice to reduce dimension. In this paper, we focus on the convergence property of the discrete algorithm. To realize that, we view the…

Numerical Analysis · Mathematics 2023-01-05 Jian-Guo Liu , Zibu Liu

We consider streaming, one-pass principal component analysis (PCA), in the high-dimensional regime, with limited memory. Here, $p$-dimensional samples are presented sequentially, and the goal is to produce the $k$-dimensional subspace that…

Machine Learning · Statistics 2013-07-02 Ioannis Mitliagkas , Constantine Caramanis , Prateek Jain

In streaming PCA, we see a stream of vectors $x_1, \dotsc, x_n \in \mathbb{R}^d$ and want to estimate the top eigenvector of their covariance matrix. This is easier if the spectral ratio $R = \lambda_1 / \lambda_2$ is large. We ask: how…

Data Structures and Algorithms · Computer Science 2024-08-21 Eric Price , Zhiyang Xun

Principal component analysis (PCA) is a classical dimension reduction method which projects data onto the principal subspace spanned by the leading eigenvectors of the covariance matrix. However, it behaves poorly when the number of…

Statistics Theory · Mathematics 2013-05-27 Zongming Ma

In this paper, we propose to adopt the diffusion approximation tools to study the dynamics of Oja's iteration which is an online stochastic gradient descent method for the principal component analysis. Oja's iteration maintains a running…

Machine Learning · Statistics 2018-08-30 Chris Junchi Li , Mengdi Wang , Han Liu , Tong Zhang

In this paper we propose a new algorithm for streaming principal component analysis. With limited memory, small devices cannot store all the samples in the high-dimensional regime. Streaming principal component analysis aims to find the…

Machine Learning · Statistics 2018-02-16 Puyudi Yang , Cho-Jui Hsieh , Jane-Ling Wang

Sparse Principal Component Analysis (sPCA) is a cardinal technique for obtaining combinations of features, or principal components (PCs), that explain the variance of high-dimensional datasets in an interpretable manner. This involves…

Optimization and Control · Mathematics 2025-12-02 Ryan Cory-Wright , Jean Pauphilet

Principal component analysis (PCA) is one of the most powerful tools in machine learning. The simplest method for PCA, the power iteration, requires $\mathcal O(1/\Delta)$ full-data passes to recover the principal component of a matrix with…

Optimization and Control · Mathematics 2017-07-11 Christopher De Sa , Bryan He , Ioannis Mitliagkas , Christopher Ré , Peng Xu

We consider streaming principal component analysis when the stochastic data-generating model is subject to perturbations. While existing models assume a fixed covariance, we adopt a robust perspective where the covariance matrix belongs to…

Machine Learning · Statistics 2022-10-13 Daniel Bienstock , Minchan Jeong , Apurv Shukla , Se-Young Yun
‹ Prev 1 2 3 10 Next ›