English
Related papers

Related papers: A Bayesian factor analysis model for high-dimensio…

200 papers

Principal component analysis (PCA) is very popular to perform dimension reduction. The selection of the number of significant components is essential but often based on some practical heuristics depending on the application. Only few works…

Machine Learning · Statistics 2017-09-19 Clément Elvira , Pierre Chainais , Nicolas Dobigeon

Big data is transforming our world, revolutionizing operations and analytics everywhere, from financial engineering to biomedical sciences. The complexity of big data often makes dimension reduction techniques necessary before conducting…

Methodology · Statistics 2018-01-08 Jianqing Fan , Qiang Sun , Wen-Xin Zhou , Ziwei Zhu

Factors models are routinely used to analyze high-dimensional data in both single-study and multi-study settings. Bayesian inference for such models relies on Markov Chain Monte Carlo (MCMC) methods which scale poorly as the number of…

Methodology · Statistics 2025-04-29 Blake Hansen , Alejandra Avalos-Pacheco , Massimiliano Russo , Roberta De Vito

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal,…

Machine Learning · Statistics 2015-05-06 Madeleine Udell , Corinne Horn , Reza Zadeh , Stephen Boyd

We present a Bayesian model selection approach to estimate the intrinsic dimensionality of a high-dimensional dataset. To this end, we introduce a novel formulation of the probabilisitic principal component analysis model based on a…

Methodology · Statistics 2019-05-22 Charles Bouveyron , Pierre Latouche , Pierre-Alexandre Mattei

Principal component analysis (PCA) is perhaps the most widely used method for data dimensionality reduction. A key question in PCA is deciding how many factors to retain. This manuscript describes a new approach to automatically selecting…

Methodology · Statistics 2026-02-10 Enes Makalic , Daniel F. Schmidt

Principal component analysis (PCA) is arguably the most widely used approach for large-dimensional factor analysis. While it is effective when the factors are sufficiently strong, it can be inconsistent when the factors are weak and/or the…

Methodology · Statistics 2025-08-22 Zhongyuan Lyu , Ming Yuan

Principal component analysis (PCA) is often used to analyze multivariate data together with cluster analysis, which depends on the number of principal components used. It is therefore important to determine the number of significant…

Applications · Statistics 2024-09-19 Joshua C. Macdonald , Javier Blanco-Portillo , Marcus W. Feldman , Yoav Ram

This is a tutorial and survey paper on factor analysis, probabilistic Principal Component Analysis (PCA), variational inference, and Variational Autoencoder (VAE). These methods, which are tightly related, are dimensionality reduction and…

Machine Learning · Statistics 2022-05-25 Benyamin Ghojogh , Ali Ghodsi , Fakhri Karray , Mark Crowley

Gaussian factor models have proven widely useful for parsimoniously characterizing dependence in multivariate data. There is a rich literature on their extension to mixed categorical and continuous variables, using latent Gaussian variables…

Methodology · Statistics 2013-01-14 Jared S. Murray , David B. Dunson , Lawrence Carin , Joseph E. Lucas

Sparse versions of principal component analysis (PCA) have imposed themselves as simple, yet powerful ways of selecting relevant features of high-dimensional data in an unsupervised manner. However, when several sparse principal components…

Machine Learning · Statistics 2019-05-22 Charles Bouveyron , Pierre Latouche , Pierre-Alexandre Mattei

We propose a multiple imputation method based on principal component analysis (PCA) to deal with incomplete continuous data. To reflect the uncertainty of the parameters from one imputation to the next, we use a Bayesian treatment of the…

Methodology · Statistics 2015-08-20 Vincent Audigier , François Husson , Julie Josse

Principal components analysis (PCA) is a classical method for the reduction of dimensionality of data in the form of n observations (or cases) of a vector with p variables. For a simple model of factor analysis type, it is proved that…

Statistics Theory · Mathematics 2009-01-29 Iain M Johnstone , Arthur Yu Lu

Detecting associations between microbial compositions and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with…

Principal Component Analysis (PCA) is a very successful dimensionality reduction technique, widely used in predictive modeling. A key factor in its widespread use in this domain is the fact that the projection of a dataset onto its first…

Machine Learning · Statistics 2017-05-19 Xianghui Luo , Robert J. Durrant

This paper introduces a Projected Principal Component Analysis (Projected-PCA), which employs principal component analysis to the projected (smoothed) data matrix onto a given linear space spanned by covariates. When it applies to…

Methodology · Statistics 2016-01-18 Jianqing Fan , Yuan Liao , Weichen Wang

The advances of next-generation sequencing technology have accelerated study of the microbiome and stimulated the high throughput profiling of metagenomes. The large volume of sequenced data has encouraged the rise of various studies for…

Methodology · Statistics 2019-04-30 Qiwei Li , Shuang Jiang , Andrew Y. Koh , Guanghua Xiao , Xiaowei Zhan

Principal component analysis (PCA) is one of the most widely used dimension reduction and multivariate statistical techniques. From a probabilistic perspective, PCA seeks a low-dimensional representation of data in the presence of…

Machine Learning · Computer Science 2021-01-06 Chihao Zhang , Kuo Gai , Shihua Zhang

This paper considers the estimation and inference of the low-rank components in high-dimensional matrix-variate factor models, where each dimension of the matrix-variates ($p \times q$) is comparable to or greater than the number of…

Statistics Theory · Mathematics 2022-10-20 Elynn Y. Chen , Jianqing Fan

Principal Component Analysis (PCA) is a dimension reduction technique. It produces inconsistent estimators when the dimensionality is moderate to high, which is often the problem in modern large-scale applications where algorithm…

Computation · Statistics 2016-01-29 Qiaoya Zhang , Yiyuan She
‹ Prev 1 2 3 10 Next ›