English
Related papers

Related papers: Probabilistic Count Matrix Factorization for Singl…

200 papers

Single-cell gene expression data are often characterized by large matrices, where the number of cells may be lower than the number of genes of interest. Factorization models have emerged as powerful tools to condense the available…

Methodology · Statistics 2023-05-22 Antonio Canale , Luisa Galtarossa , Davide Risso , Lorenzo Schiavon , Giovanni Toto

Single-cell RNA-seq provides detailed molecular snapshots of individual cells but is notoriously noisy. Variability stems from biological differences and technical factors, such as amplification bias and limited RNA capture efficiency,…

Machine Learning · Computer Science 2026-02-27 Victor Chardès

Traditional principal component analysis (PCA) is well known in high-dimensional data analysis, but it requires to express data by a matrix with observations to be continuous. To overcome the limitations, a new method called flexible PCA…

Methodology · Statistics 2021-08-17 Tonglin Zhang , Baijian Yang , Qianqian Song , Jing Su

Matrix factorization methods - including Factor analysis (FA), and Principal Components Analysis (PCA) - are widely used for inferring and summarizing structure in multivariate data. Many matrix factorization methods exist, corresponding to…

Methodology · Statistics 2021-05-04 Wei Wang , Matthew Stephens

Principal components analysis (PCA) is a classical method for the reduction of dimensionality of data in the form of n observations (or cases) of a vector with p variables. For a simple model of factor analysis type, it is proved that…

Statistics Theory · Mathematics 2009-01-29 Iain M Johnstone , Arthur Yu Lu

Single-cell RNA sequencing (scRNA-seq) has revolutionized our ability to analyze gene expression at the cellular level. By providing data on gene expression for each individual cell, scRNA-seq generates large datasets with thousands of…

Computational Complexity · Computer Science 2025-02-11 Md Romizul Islam , Swakkhar Shatabda

Factors models are routinely used to analyze high-dimensional data in both single-study and multi-study settings. Bayesian inference for such models relies on Markov Chain Monte Carlo (MCMC) methods which scale poorly as the number of…

Methodology · Statistics 2025-04-29 Blake Hansen , Alejandra Avalos-Pacheco , Massimiliano Russo , Roberta De Vito

Dimension reduction techniques are among the most essential analytical tools in the analysis of high-dimensional data. Generalized principal component analysis (PCA) is an extension to standard PCA that has been widely used to identify…

Dimensionality reduction is critical across various domains of science including neuroscience. Probabilistic Principal Component Analysis (PPCA) is a prominent dimensionality reduction method that provides a probabilistic approach unlike…

Machine Learning · Computer Science 2025-09-24 Han-Lin Hsieh , Maryam M. Shanechi

Network models provide a powerful framework for analysing single-cell count data, facilitating the characterisation of cellular identities, disease mechanisms, and developmental trajectories. However, uncertainty modeling in unsupervised…

Genomics · Quantitative Biology 2026-04-27 Shanshan Ren , Thomas E. Bartlett , Lina Gerontogianni , Swati Chandna

In this paper we analyze approximate methods for undertaking a principal components analysis (PCA) on large data sets. PCA is a classical dimension reduction method that involves the projection of the data onto the subspace spanned by the…

Machine Learning · Statistics 2017-08-16 Darren Homrighausen , Daniel J. McDonald

Cellular differentiation is governed by gene regulatory networks, the high-dimensional stochastic biochemical systems that determine the transcriptional landscape and mediate cellular responses to signals and perturbations. Although…

Molecular Networks · Quantitative Biology 2026-04-29 Suryanarayana Maddu , Victor Chardès , Michael J. Shelley

This paper considers the estimation and inference of the low-rank components in high-dimensional matrix-variate factor models, where each dimension of the matrix-variates ($p \times q$) is comparable to or greater than the number of…

Statistics Theory · Mathematics 2022-10-20 Elynn Y. Chen , Jianqing Fan

We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing. In the model, each cell has a low-dimensional latent representation. Additional latent variables account for…

Machine Learning · Computer Science 2018-01-18 Romain Lopez , Jeffrey Regier , Michael Cole , Michael Jordan , Nir Yosef

Single-cell RNA-seq data are challenging because of the sparseness of the read counts, the tiny expression of many relevant genes, and the variability in the efficiency of RNA extraction for different cells. We consider a simple…

Methodology · Statistics 2020-02-10 Silvia Giulia Galfre' , Francesco Morandin

Single-cell RNA sequencing (scRNA-seq) is a relatively new technology that has stimulated enormous interest in statistics, data science, and computational biology due to the high dimensionality, complexity, and large scale associated with…

Machine Learning · Statistics 2023-10-25 Yuta Hozumi , Guo-Wei Wei

Factor analysis (FA) and principal component analysis (PCA) are popular statistical methods for summarizing and explaining the variability in multivariate datasets. By default, FA and PCA assume the number of components or factors to be…

Methodology · Statistics 2022-05-17 Chetkar Jha , Ian Barnett

We present a technique to perform dimensionality reduction on data that is subject to uncertainty. Our method is a generalization of traditional principal component analysis (PCA) to multivariate probability distributions. In comparison to…

Machine Learning · Computer Science 2019-10-14 Jochen Görtler , Thilo Spinner , Dirk Streeb , Daniel Weiskopf , Oliver Deussen

This is a tutorial and survey paper on factor analysis, probabilistic Principal Component Analysis (PCA), variational inference, and Variational Autoencoder (VAE). These methods, which are tightly related, are dimensionality reduction and…

Machine Learning · Statistics 2022-05-25 Benyamin Ghojogh , Ali Ghodsi , Fakhri Karray , Mark Crowley

Principal component analysis (PCA) is a well-established tool in machine learning and data processing. The principal axes in PCA were shown to be equivalent to the maximum marginal likelihood estimator of the factor loading matrix in a…

Methodology · Statistics 2019-10-25 Mengyang Gu , Weining Shen
‹ Prev 1 2 3 10 Next ›