English
Related papers

Related papers: Covariance and PCA for Categorical Variables

200 papers

A system with many degrees of freedom can be characterized by a covariance matrix; principal components analysis (PCA) focuses on the eigenvalues of this matrix, hoping to find a lower dimensional description. But when the spectrum is…

Biological Physics · Physics 2017-04-26 Serena Bradde , William Bialek

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance-correlation matrix of the analyzed data. However to properly work with high-dimensional data, PCA poses severe mathematical…

Quantitative Methods · Quantitative Biology 2018-10-18 Luigi Leonardo Palese

Principal components analysis (PCA) is a classical method for the reduction of dimensionality of data in the form of n observations (or cases) of a vector with p variables. For a simple model of factor analysis type, it is proved that…

Statistics Theory · Mathematics 2009-01-29 Iain M Johnstone , Arthur Yu Lu

Principal component analysis (PCA) is often used for analyzing data in the most diverse areas. In this work, we report an integrated approach to several theoretical and practical aspects of PCA. We start by providing, in an intuitive and…

Computational Engineering, Finance, and Science · Computer Science 2021-06-09 Felipe L. Gewers , Gustavo R. Ferreira , Henrique F. de Arruda , Filipi N. Silva , Cesar H. Comin , Diego R. Amancio , Luciano da F. Costa

Probabilistic principal component analysis (PPCA) seeks a low dimensional representation of a data set in the presence of independent spherical Gaussian noise, Sigma = (sigma^2)*I. The maximum likelihood solution for the model is an…

Machine Learning · Statistics 2011-06-23 Alfredo A. Kalaitzis , Neil D. Lawrence

Probabilistic principal component analysis (PPCA) seeks a low dimensional representation of a data set in the presence of independent spherical Gaussian noise. The maximum likelihood solution for the model is an eigenvalue problem on the…

Machine Learning · Computer Science 2012-06-22 Alfredo Kalaitzis , Neil Lawrence

We propose a multiple imputation method based on principal component analysis (PCA) to deal with incomplete continuous data. To reflect the uncertainty of the parameters from one imputation to the next, we use a Bayesian treatment of the…

Methodology · Statistics 2015-08-20 Vincent Audigier , François Husson , Julie Josse

In this brief note, we formulate Principal Component Analysis (PCA) over datasets consisting not of points but of distributions, characterized by their location and covariance. Just like the usual PCA on points can be equivalently derived…

Machine Learning · Statistics 2023-06-26 Vlad Niculae

Canonical correlation analysis (CCA) is a classic statistical method for discovering latent co-variation that underpins two or more observed random vectors. Several extensions and variations of CCA have been proposed that have strengthened…

Machine Learning · Computer Science 2023-12-22 Paris A. Karakasis , Nicholas D. Sidiropoulos

Principal component analysis (PCA) is a dimensionality reduction method in data analysis that involves diagonalizing the covariance matrix of the dataset. Recently, quantum algorithms have been formulated for PCA based on diagonalizing a…

Quantum Physics · Physics 2022-10-26 Max Hunter Gordon , M. Cerezo , Lukasz Cincio , Patrick J. Coles

The literature provides strong evidence that stock prices can be predicted from past price data. Principal component analysis (PCA) is a widely used mathematical technique for dimensionality reduction and analysis of data by identifying a…

Mathematical Finance · Quantitative Finance 2018-03-15 Mahsa Ghorbani , Edwin K. P. Chong

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal,…

Machine Learning · Statistics 2015-05-06 Madeleine Udell , Corinne Horn , Reza Zadeh , Stephen Boyd

Principal Components Analysis (PCA) is a common way to study the sources of variation in a high-dimensional data set. Typically, the leading principal components are used to understand the variation in the data or to reduce the dimension of…

Covariance matrix estimation and principal component analysis (PCA) are two cornerstones of multivariate analysis. Classic textbook solutions perform poorly when the dimension of the data is of a magnitude similar to the sample size, or…

Statistics Theory · Mathematics 2014-06-25 Olivier Ledoit , Michael Wolf

Principal component analysis (PCA) is a widely used unsupervised dimensionality reduction technique in machine learning, applied across various fields such as bioinformatics, computer vision and finance. However, when the response variables…

Applications · Statistics 2025-06-25 Theodosios Papazoglou , Guosheng Yin

Principal component analysis (PCA) defines a reduced space described by PC axes for a given multidimensional-data sequence to capture the variations of the data. In practice, we need multiple data sequences that accurately obey individual…

Methodology · Statistics 2021-04-19 Ikuo Fukuda , Kei Moritsugu

Principal Component Analysis (PCA) is a well known procedure to reduce intrinsic complexity of a dataset, essentially through simplifying the covariance structure or the correlation structure. We introduce a novel algebraic, model-based…

Methodology · Statistics 2021-12-09 Martin Schlather , Felix Reinbott

Principal Component Analysis (PCA) is one of the most commonly used statistical methods for data exploration, and for dimensionality reduction wherein the first few principal components account for an appreciable proportion of the…

Methodology · Statistics 2024-01-11 Caren Marzban , Ulvi Yurtsever , Michael Richman

Principal component analysis (PCA) is arguably the most widely used approach for large-dimensional factor analysis. While it is effective when the factors are sufficiently strong, it can be inconsistent when the factors are weak and/or the…

Methodology · Statistics 2025-08-22 Zhongyuan Lyu , Ming Yuan

A set of curves or images of similar shape is an increasingly common functional data set collected in the sciences. Principal Component Analysis (PCA) is the most widely used technique to decompose variation in functional data. However, the…

Methodology · Statistics 2009-09-29 Rima Izem , J. S. Marron
‹ Prev 1 2 3 10 Next ›