English
Related papers

Related papers: Robust Streaming PCA

200 papers

Principal Component Analysis (PCA) is a widely used technique in machine learning, data analysis and signal processing. With the increase in the size and complexity of datasets, it has become important to develop low-space usage algorithms…

Machine Learning · Computer Science 2023-03-09 Yichuan Deng , Zhao Song , Zifan Wang , Han Zhang

Since its inception in 1982, Oja's algorithm has become an established method for streaming principle component analysis (PCA). We study the problem of streaming PCA, where the data-points are sampled from an irreducible, aperiodic, and…

Statistics Theory · Mathematics 2023-06-21 Syamantak Kumar , Purnamrita Sarkar

Low-precision streaming PCA estimates the top principal component in a streaming setting under limited precision. We establish an information-theoretic lower bound on the quantization resolution required to achieve a target accuracy for the…

Machine Learning · Computer Science 2025-10-28 Sanjoy Dasgupta , Syamantak Kumar , Shourya Pandey , Purnamrita Sarkar

Oja's rule [Oja, Journal of mathematical biology 1982] is a well-known biologically-plausible algorithm using a Hebbian-type synaptic update rule to solve streaming principal component analysis (PCA). Computational neuroscientists have…

Neurons and Cognition · Quantitative Biology 2020-06-19 Chi-Ning Chou , Mien Brabeeba Wang

We propose a novel statistical inference framework for streaming principal component analysis (PCA) using Oja's algorithm, enabling the construction of confidence intervals for individual entries of the estimated eigenvector. Most existing…

Statistics Theory · Mathematics 2025-07-22 Syamantak Kumar , Shourya Pandey , Purnamrita Sarkar

This work provides improved guarantees for streaming principle component analysis (PCA). Given $A_1, \ldots, A_n\in \mathbb{R}^{d\times d}$ sampled independently from distributions satisfying $\mathbb{E}[A_i] = \Sigma$ for $\Sigma \succeq…

Machine Learning · Computer Science 2016-03-29 Prateek Jain , Chi Jin , Sham M. Kakade , Praneeth Netrapalli , Aaron Sidford

Kernel principal component analysis (KPCA) provides a concise set of basis vectors which capture non-linear structures within large data sets, and is a central tool in data analysis and learning. To allow for non-linear relations, typically…

Data Structures and Algorithms · Computer Science 2015-12-17 Mina Ghashami , Daniel Perry , Jeff M. Phillips

We provide a new robust convergence analysis of the well-known power method for computing the dominant singular vectors of a matrix that we call the noisy power method. Our result characterizes the convergence behavior of the algorithm when…

Data Structures and Algorithms · Computer Science 2015-02-05 Moritz Hardt , Eric Price

Fair Principal Component Analysis (PCA) is a problem setting where we aim to perform PCA while making the resulting representation fair in that the projected distributions, conditional on the sensitive attributes, match one another.…

Machine Learning · Statistics 2023-10-31 Junghyun Lee , Hanseul Cho , Se-Young Yun , Chulhee Yun

We study the Principal Component Analysis (PCA) problem in the distributed and streaming models of computation. Given a matrix $A \in R^{m \times n},$ a rank parameter $k < rank(A)$, and an accuracy parameter $0 < \epsilon < 1$, we want to…

Data Structures and Algorithms · Computer Science 2016-07-13 Christos Boutsidis , David P. Woodruff , Peilin Zhong

We study streaming principal component analysis (PCA), that is to find, in $O(dk)$ space, the top $k$ eigenvectors of a $d\times d$ hidden matrix $\bf \Sigma$ with online vectors drawn from covariance matrix $\bf \Sigma$. We provide…

Optimization and Control · Mathematics 2017-04-18 Zeyuan Allen-Zhu , Yuanzhi Li

In this paper we analyze the behavior of the Oja's algorithm for online/streaming principal component subspace estimation. It is proved that with high probability it performs an efficient, gap-free, global convergence rate to approximate an…

Machine Learning · Computer Science 2024-03-06 Xin Liang

Oja's algorithm for Streaming Principal Component Analysis (PCA) for $n$ data-points in a $d$ dimensional space achieves the same sin-squared error $O(r_{\mathsf{eff}}/n)$ as the offline algorithm in $O(d)$ space and $O(nd)$ time and a…

Statistics Theory · Mathematics 2025-03-12 Syamantak Kumar , Purnamrita Sarkar

Stochastic optimization naturally arises in machine learning. Efficient algorithms with provable guarantees, however, are still largely missing, when the objective function is nonconvex and the data points are dependent. This paper studies…

Machine Learning · Computer Science 2018-10-02 Minshuo Chen , Lin Yang , Mengdi Wang , Tuo Zhao

We propose a robust principal component analysis (RPCA) framework to recover low-rank and sparse matrices from temporal observations. We develop an online version of the batch temporal algorithm in order to process larger datasets or…

Machine Learning · Statistics 2022-08-04 Hong-Lan Botterman , Julien Roussel , Thomas Morzadec , Ali Jabbari , Nicolas Brunel

Principal component analysis (PCA) has been a prominent tool for high-dimensional data analysis. Online algorithms that estimate the principal component by processing streaming data are of tremendous practical and theoretical interests.…

Optimization and Control · Mathematics 2017-10-09 Chris Junchi Li , Mengdi Wang , Han Liu , Tong Zhang

We study principal component analysis (PCA), where given a dataset in $\mathbb{R}^d$ from a distribution, the task is to find a unit vector $v$ that approximately maximizes the variance of the distribution after being projected along $v$.…

Machine Learning · Computer Science 2023-05-05 Ilias Diakonikolas , Daniel M. Kane , Ankit Pensia , Thanasis Pittas

Principal Component Analysis is a novel way of of dimensionality reduction. This problem essentially boils down to finding the top k eigen vectors of the data covariance matrix. A considerable amount of literature is found on algorithms…

Machine Learning · Computer Science 2019-01-08 Jian Vora

In multivariate time series classification, although current sequence analysis models have excellent classification capabilities, they show significant shortcomings when dealing with long sequence multivariate data, such as prolonged…

Machine Learning · Computer Science 2024-10-29 Enshuo Yan , Huachuan Wang , Weihao Xia

We analyze the Accelerated Noisy Power Method, an algorithm for Principal Component Analysis in the setting where only inexact matrix-vector products are available, which can arise for instance in decentralized PCA. While previous works…

Machine Learning · Statistics 2026-02-04 Pierre Aguié , Mathieu Even , Laurent Massoulié
‹ Prev 1 2 3 10 Next ›