English
Related papers

Related papers: Memory Limited, Streaming PCA

200 papers

In this paper we propose a new algorithm for streaming principal component analysis. With limited memory, small devices cannot store all the samples in the high-dimensional regime. Streaming principal component analysis aims to find the…

Machine Learning · Statistics 2018-02-16 Puyudi Yang , Cho-Jui Hsieh , Jane-Ling Wang

Principal Component Analysis (PCA) is a widely used technique in machine learning, data analysis and signal processing. With the increase in the size and complexity of datasets, it has become important to develop low-space usage algorithms…

Machine Learning · Computer Science 2023-03-09 Yichuan Deng , Zhao Song , Zifan Wang , Han Zhang

Principal component analysis (PCA) is a fundamental dimension reduction tool in statistics and machine learning. For large and high-dimensional data, computing the PCA (i.e., the singular vectors corresponding to a number of dominant…

Data Structures and Algorithms · Computer Science 2017-04-26 Wenjian Yu , Yu Gu , Jian Li , Shenghua Liu , Yaohang Li

Principal component analysis (PCA) is a classical method for dimensionality reduction based on extracting the dominant eigenvectors of the sample covariance matrix. However, PCA is well known to behave poorly in the ``large $p$, small $n$''…

Statistics Theory · Mathematics 2009-08-26 Arash A. Amini , Martin J. Wainwright

Principal component analysis (PCA) aims at estimating the direction of maximal variability of a high-dimensional dataset. A natural question is: does this task become easier, and estimation more accurate, when we exploit additional…

Information Theory · Computer Science 2014-06-19 Andrea Montanari , Emile Richard

In this paper, we propose an acceleration scheme for online memory-limited PCA methods. Our scheme converges to the first $k>1$ eigenvectors in a single data pass. We provide empirical convergence results of our scheme based on the spiked…

Machine Learning · Statistics 2018-07-18 Salaheddin Alakkari , John Dingliana

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance-correlation matrix of the analyzed data. However to properly work with high-dimensional data, PCA poses severe mathematical…

Quantitative Methods · Quantitative Biology 2018-10-18 Luigi Leonardo Palese

Fair Principal Component Analysis (PCA) is a problem setting where we aim to perform PCA while making the resulting representation fair in that the projected distributions, conditional on the sensitive attributes, match one another.…

Machine Learning · Statistics 2023-10-31 Junghyun Lee , Hanseul Cho , Se-Young Yun , Chulhee Yun

In the current context of data explosion, online techniques that do not require storing all data in memory are indispensable to routinely perform tasks like principal component analysis (PCA). Recursive algorithms that update the PCA with…

Machine Learning · Statistics 2015-11-13 Hervé Cardot , David Degras

Oja's algorithm for Streaming Principal Component Analysis (PCA) for $n$ data-points in a $d$ dimensional space achieves the same sin-squared error $O(r_{\mathsf{eff}}/n)$ as the offline algorithm in $O(d)$ space and $O(nd)$ time and a…

Statistics Theory · Mathematics 2025-03-12 Syamantak Kumar , Purnamrita Sarkar

We study the problem of recovering the subspace spanned by the first $k$ principal components of $d$-dimensional data under the streaming setting, with a memory bound of $O(kd)$. Two families of algorithms are known for this problem. The…

Machine Learning · Statistics 2015-10-13 Chun-Liang Li , Hsuan-Tien Lin , Chi-Jen Lu

Principal Component Analysis (PCA) has been widely used for dimensionality reduction and feature extraction. Robust PCA (RPCA), under different robust distance metrics, such as l1-norm and l2, p-norm, can deal with noise or outliers to some…

Machine Learning · Computer Science 2021-06-29 Zhao Kang , Hongfei Liu , Jiangxin Li , Xiaofeng Zhu , Ling Tian

Sparse Principal Component Analysis (PCA) methods are efficient tools to reduce the dimension (or the number of variables) of complex data. Sparse principal components (PCs) are easier to interpret than conventional PCs, because most…

Statistics Theory · Mathematics 2011-04-22 Dan Shen , Haipeng Shen , J. S. Marron

Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications. In this paper, an online distributed algorithm is proposed for recovering the principal eigenspaces. We further…

Machine Learning · Statistics 2019-05-20 Davoud Ataee Tarzanagh , Mohamad Kazem Shirani Faradonbeh , George Michailidis

Principal component analysis (PCA) is a classical dimension reduction method which projects data onto the principal subspace spanned by the leading eigenvectors of the covariance matrix. However, it behaves poorly when the number of…

Statistics Theory · Mathematics 2013-05-27 Zongming Ma

Principal component analysis (PCA) is one of the most powerful tools in machine learning. The simplest method for PCA, the power iteration, requires $\mathcal O(1/\Delta)$ full-data passes to recover the principal component of a matrix with…

Optimization and Control · Mathematics 2017-07-11 Christopher De Sa , Bryan He , Ioannis Mitliagkas , Christopher Ré , Peng Xu

Mining useful clusters from high dimensional data has received significant attention of the computer vision and pattern recognition community in the recent years. Linear and non-linear dimensionality reduction has played an important role…

Computer Vision and Pattern Recognition · Computer Science 2016-05-25 Nauman Shahid , Nathanael Perraudin , Vassilis Kalofolias , Gilles Puy , Pierre Vandergheynst

For many modern applications in science and engineering, data are collected in a streaming fashion carrying time-varying information, and practitioners need to process them with a limited amount of memory and computational resources in a…

Machine Learning · Statistics 2018-06-13 Laura Balzano , Yuejie Chi , Yue M. Lu

Principal component analysis (PCA) is a widely used unsupervised dimensionality reduction technique in machine learning, applied across various fields such as bioinformatics, computer vision and finance. However, when the response variables…

Applications · Statistics 2025-06-25 Theodosios Papazoglou , Guosheng Yin

We present a federated, asynchronous, and $(\varepsilon, \delta)$-differentially private algorithm for PCA in the memory-limited setting. Our algorithm incrementally computes local model updates using a streaming procedure and adaptively…

Machine Learning · Computer Science 2020-10-26 Andreas Grammenos , Rodrigo Mendoza-Smith , Jon Crowcroft , Cecilia Mascolo
‹ Prev 1 2 3 10 Next ›