Related papers: Learning a Factor Model via Regularized PCA

Regularised PCA to denoise and visualise data

Principal component analysis (PCA) is a well-established method commonly used to explore and visualise data. A classical PCA model is the fixed effect model where data are generated as a fixed structure of low rank corrupted by noise. Under…

Methodology · Statistics 2013-05-13 Marie Verbanck , Julie Josse , François Husson

Efficient fair PCA for fair representation learning

We revisit the problem of fair principal component analysis (PCA), where the goal is to learn the best low-rank linear approximation of the data that obfuscates demographic information. We propose a conceptually simple approach that allows…

Machine Learning · Statistics 2023-02-28 Matthäus Kleindessner , Michele Donini , Chris Russell , Muhammad Bilal Zafar

Large-dimensional Factor Analysis with Weighted PCA

Principal component analysis (PCA) is arguably the most widely used approach for large-dimensional factor analysis. While it is effective when the factors are sufficiently strong, it can be inconsistent when the factors are weak and/or the…

Methodology · Statistics 2025-08-22 Zhongyuan Lyu , Ming Yuan

Robust Principal Component Analysis on Graphs

Principal Component Analysis (PCA) is the most widely used tool for linear dimensionality reduction and clustering. Still it is highly sensitive to outliers and does not scale well with respect to the number of data samples. Robust PCA…

Computer Vision and Pattern Recognition · Computer Science 2015-04-24 Nauman Shahid , Vassilis Kalofolias , Xavier Bresson , Michael Bronstein , Pierre Vandergheynst

Generalized Low Rank Models

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal,…

Machine Learning · Statistics 2015-05-06 Madeleine Udell , Corinne Horn , Reza Zadeh , Stephen Boyd

Principal Component Analysis versus Factor Analysis

The article discusses selected problems related to both principal component analysis (PCA) and factor analysis (FA). In particular, both types of analysis were compared. A vector interpretation for both PCA and FA has also been proposed.…

Machine Learning · Computer Science 2021-10-22 Zenon Gniazdowski

A Generalization of Principal Component Analysis

Conventional principal component analysis (PCA) finds a principal vector that maximizes the sum of second powers of principal components. We consider a generalized PCA that aims at maximizing the sum of an arbitrary convex function of…

Machine Learning · Computer Science 2019-11-19 Samuele Battaglino , Erdem Koyuncu

Principal Component Analysis: A Natural Approach to Data Exploration

Principal component analysis (PCA) is often used for analyzing data in the most diverse areas. In this work, we report an integrated approach to several theoretical and practical aspects of PCA. We start by providing, in an intuitive and…

Computational Engineering, Finance, and Science · Computer Science 2021-06-09 Felipe L. Gewers , Gustavo R. Ferreira , Henrique F. de Arruda , Filipi N. Silva , Cesar H. Comin , Diego R. Amancio , Luciano da F. Costa

An algorithm for the principal component analysis of large data sets

Recently popularized randomized methods for principal component analysis (PCA) efficiently and reliably produce nearly optimal accuracy --- even on parallel processors --- unlike the classical (deterministic) alternatives. We adapt one of…

Computation · Statistics 2011-12-23 Nathan Halko , Per-Gunnar Martinsson , Yoel Shkolnisky , Mark Tygert

Fixed-order PCA: Theory for Overestimated Factor Models

We develop asymptotic theory for principal component analysis (PCA) of a high-dimensional factor model in which the working dimension $R$ is fixed and only required to satisfy $R \ge r$, where $r$ is the true number of factors. Building on…

Statistics Theory · Mathematics 2026-05-19 Yuan Liao , Xin Tong , Wanjie Wang , Dacheng Xiu

Generalized probabilistic principal component analysis of correlated data

Principal component analysis (PCA) is a well-established tool in machine learning and data processing. The principal axes in PCA were shown to be equivalent to the maximum marginal likelihood estimator of the factor loading matrix in a…

Methodology · Statistics 2019-10-25 Mengyang Gu , Weining Shen

A random version of principal component analysis in data clustering

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance-correlation matrix of the analyzed data. However to properly work with high-dimensional data, PCA poses severe mathematical…

Quantitative Methods · Quantitative Biology 2018-10-18 Luigi Leonardo Palese

FAST-PCA: A Fast and Exact Algorithm for Distributed Principal Component Analysis

Principal Component Analysis (PCA) is a fundamental data preprocessing tool in the world of machine learning. While PCA is often thought of as a dimensionality reduction method, the purpose of PCA is actually two-fold: dimension reduction…

Machine Learning · Computer Science 2023-01-25 Arpita Gang , Waheed U. Bajwa

Projected principal component analysis in factor models

This paper introduces a Projected Principal Component Analysis (Projected-PCA), which employs principal component analysis to the projected (smoothed) data matrix onto a given linear space spanned by covariates. When it applies to…

Methodology · Statistics 2016-01-18 Jianqing Fan , Yuan Liao , Weichen Wang

Sparse Principal Components Analysis

Principal components analysis (PCA) is a classical method for the reduction of dimensionality of data in the form of n observations (or cases) of a vector with p variables. For a simple model of factor analysis type, it is proved that…

Statistics Theory · Mathematics 2009-01-29 Iain M Johnstone , Arthur Yu Lu

Modal Principal Component Analysis

Principal component analysis (PCA) is a widely used method for data processing, such as for dimension reduction and visualization. Standard PCA is known to be sensitive to outliers, and thus, various robust PCA methods have been proposed.…

Machine Learning · Statistics 2020-08-11 Keishi Sando , Hideitsu Hino

Robust PCA via Regularized REAPER with a Matrix-Free Proximal Algorithm

Principal component analysis (PCA) is known to be sensitive to outliers, so that various robust PCA variants were proposed in the literature. A recent model, called REAPER, aims to find the principal components by solving a convex…

Numerical Analysis · Mathematics 2021-03-19 Robert Beinert , Gabriele Steidl

Robust high dimensional factor models with applications to statistical machine learning

Factor models are a class of powerful statistical models that have been widely used to deal with dependent measurements that arise frequently from various applications from genomics and neuroscience to economics and finance. As data are…

Methodology · Statistics 2018-08-14 Jianqing Fan , Kaizheng Wang , Yiqiao Zhong , Ziwei Zhu

Supervised PCA: A Multiobjective Approach

Methods for supervised principal component analysis (SPCA) aim to incorporate label information into principal component analysis (PCA), so that the extracted features are more useful for a prediction task of interest. Prior work on SPCA…

Machine Learning · Statistics 2022-08-18 Alexander Ritchie , Laura Balzano , Daniel Kessler , Chandra S. Sripada , Clayton Scott

Non-negative Principal Component Analysis: Message Passing Algorithms and Sharp Asymptotics

Principal component analysis (PCA) aims at estimating the direction of maximal variability of a high-dimensional dataset. A natural question is: does this task become easier, and estimation more accurate, when we exploit additional…

Information Theory · Computer Science 2014-06-19 Andrea Montanari , Emile Richard