English
Related papers

Related papers: Provable Model-Parallel Distributed Principal Comp…

200 papers

Principal Component Analysis (PCA) aims to find subspaces spanned by the so-called principal components that best represent the variance in the dataset. The deflation method is a popular meta-algorithm that sequentially finds individual…

Machine Learning · Computer Science 2024-05-30 Fangshuo Liao , Junhyung Lyle Kim , Cruz Barnum , Anastasios Kyrillidis

Principal Component Analysis (PCA) is the workhorse tool for dimensionality reduction in this era of big data. While often overlooked, the purpose of PCA is not only to reduce data dimensionality, but also to yield features that are…

Machine Learning · Computer Science 2021-11-30 Arpita Gang , Waheed U. Bajwa

Principal component analysis (PCA) is fundamental to statistical machine learning. It extracts latent principal factors that contribute to the most variation of the data. When data are stored across multiple machines, however, communication…

Computation · Statistics 2018-01-11 Jianqing Fan , Dong Wang , Kaizheng Wang , Ziwei Zhu

We study distributed principal component analysis (PCA) in high-dimensional settings under the spiked model. In such regimes, sample eigenvectors can deviate significantly from population ones, introducing a persistent bias. Existing…

Methodology · Statistics 2025-05-29 Weiming Li , Zeng Li , Siyu Wang , Yanqing Yin , Junpeng Zhu

Principal Component Analysis (PCA) is a fundamental data preprocessing tool in the world of machine learning. While PCA is often thought of as a dimensionality reduction method, the purpose of PCA is actually two-fold: dimension reduction…

Machine Learning · Computer Science 2023-01-25 Arpita Gang , Waheed U. Bajwa

We present a novel view on principal component analysis (PCA) as a competitive game in which each approximate eigenvector is controlled by a player whose goal is to maximize their own utility function. We analyze the properties of this PCA…

Machine Learning · Computer Science 2021-03-18 Ian Gemp , Brian McWilliams , Claire Vernade , Thore Graepel

We study the distributed computing setting in which there are multiple servers, each holding a set of points, who wish to compute functions on the union of their point sets. A key task in this setting is Principal Component Analysis (PCA),…

Machine Learning · Computer Science 2014-12-24 Maria-Florina Balcan , Vandana Kanchanapally , Yingyu Liang , David Woodruff

We present a federated, asynchronous, and $(\varepsilon, \delta)$-differentially private algorithm for PCA in the memory-limited setting. Our algorithm incrementally computes local model updates using a streaming procedure and adaptively…

Machine Learning · Computer Science 2020-10-26 Andreas Grammenos , Rodrigo Mendoza-Smith , Jon Crowcroft , Cecilia Mascolo

The growing size of modern data sets brings many challenges to the existing statistical estimation approaches, which calls for new distributed methodologies. This paper studies distributed estimation for a fundamental statistical machine…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-02-04 Xi Chen , Jason D. Lee , He Li , Yun Yang

We consider principal component analysis (PCA) in decomposable Gaussian graphical models. We exploit the prior information in these models in order to distribute its computation. For this purpose, we reformulate the problem in the sparse…

Machine Learning · Statistics 2015-05-13 Ami Wiesel , Alfred O. Hero

The Principal Component Analysis (PCA) is a data dimensionality reduction technique well-suited for processing data from sensor networks. It can be applied to tasks like compression, event detection, and event recognition. This technique is…

Networking and Internet Architecture · Computer Science 2010-03-13 Yann-Aël Le Borgne , Sylvain Raybaud , Gianluca Bontempi

Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications. In this paper, an online distributed algorithm is proposed for recovering the principal eigenspaces. We further…

Machine Learning · Statistics 2019-05-20 Davoud Ataee Tarzanagh , Mohamad Kazem Shirani Faradonbeh , George Michailidis

Principal component analysis (PCA), a ubiquitous dimensionality reduction technique in signal processing, searches for a projection matrix that minimizes the mean squared error between the reduced dataset and the original one. Since…

Machine Learning · Computer Science 2022-08-25 Guilherme Dean Pelegrina , Leonardo Tomazeli Duarte

Principal component analysis (PCA) is a widely used technique for dimension reduction. As datasets continue to grow in size, distributed-PCA (DPCA) has become an active research area. A key challenge in DPCA lies in efficiently aggregating…

Machine Learning · Statistics 2024-10-02 Zhi-Yu Jou , Su-Yun Huang , Hung Hung , Shinto Eguchi

Due to the rapid growth of smart agents such as weakly connected computational nodes and sensors, developing decentralized algorithms that can perform computations on local agents becomes a major research direction. This paper considers the…

Machine Learning · Computer Science 2021-02-09 Haishan Ye , Tong Zhang

Principal component analysis (PCA) is a statistical technique commonly used in multivariate data analysis. However, PCA can be difficult to interpret and explain since the principal components (PCs) are linear combinations of the original…

Mathematical Software · Computer Science 2013-12-24 W. Liu , H. Zhang , D. Tao , Y. Wang , K. Lu

As tensors become widespread in modern data analysis, Tucker low-rank Principal Component Analysis (PCA) has become essential for dimensionality reduction and structural discovery in tensor datasets. Motivated by the common scenario where…

Methodology · Statistics 2025-04-08 Elynn Chen , Xi Chen , Wenbo Jing , Yichen Zhang

This paper describes some applications of an incremental implementation of the principal component analysis (PCA). The algorithm updates the transformation coefficients matrix on-line for each new sample, without the need to keep all the…

Machine Learning · Statistics 2019-08-14 Vittorio Lippi , Giacomo Ceccarelli

Distributed algorithms and theories are called for in this era of big data. Under weaker local signal-to-noise ratios, we improve upon the celebrated one-round distributed principal component analysis (PCA) algorithm designed in the spirit…

Methodology · Statistics 2025-07-01 ZeYu Li , Xinsheng Zhang , Wang Zhou

We propose a stable version of Principal Component Analysis (PCA) in the general framework of a separable Hilbert space. It consists in interpreting the projection on the first eigenvectors as a step function applied to the spectrum of the…

Statistics Theory · Mathematics 2017-04-03 Ilaria Giulini
‹ Prev 1 2 3 10 Next ›