English
Related papers

Related papers: A factor mixture analysis model for multivariate b…

200 papers

Graphical models are commonly used tools for modeling multivariate random variables. While there exist many convenient multivariate distributions such as Gaussian distribution for continuous data, mixed data with the presence of discrete…

Machine Learning · Statistics 2014-04-30 Jianqing Fan , Han Liu , Yang Ning , Hui Zou

Finite mixture models have become a popular tool for clustering. Amongst other uses, they have been applied for clustering longitudinal data and clustering high-dimensional data. In the latter case, a latent Gaussian mixture model is…

Methodology · Statistics 2018-04-17 Vanessa S. E. Bierling , Paul D. McNicholas

We describe and analyze a broad class of mixture models for real-valued multivariate data in which the probability density of observations within each component of the model is represented as an arbitrary combination of basis functions.…

Methodology · Statistics 2025-02-28 M. E. J. Newman

Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting…

Methodology · Statistics 2019-08-29 Panagiotis Papastamoulis

We introduce a novel class of Bayesian mixtures for normal linear regression models which incorporates a further Gaussian random component for the distribution of the predictor variables. The proposed cluster-weighted model aims to…

Methodology · Statistics 2026-05-26 Panagiotis Papastamoulis , Konstantinos Perrakis

In this study, we develop a latent factor model for analysing high-dimensional binary data. Specifically, a standard probit model is used to describe the regression relationship between the observed binary data and the continuous latent…

Methodology · Statistics 2024-04-15 Jiaxin Shi , Yuan Gao , Rui Pan , Hansheng Wang

We propose to learn latent graphical models when data have mixed variables and missing values. This model could be used for further data analysis, including regression, classification, ranking etc. It also could be used for imputing missing…

Methodology · Statistics 2015-11-17 Xiao Li , Jinzhu Jia , Yuan Yao

Two key challenges in modern statistical applications are the large amount of information recorded per individual, and that such data are often not collected all at once but in batches. These batch effects can be complex, causing…

Applications · Statistics 2019-05-21 Alejandra Avalos-Pacheco , David Rossell , Richard S. Savage

We propose a mixture of latent trait models with common slope parameters (MCLT) for model-based clustering of high-dimensional binary data, a data type for which few established methods exist. Recent work on clustering of binary data, based…

Methodology · Statistics 2017-10-09 Yang Tang , Ryan P. Browne , Paul D. McNicholas

Clustered data is ubiquitous in a variety of scientific fields. In this paper, we propose a flexible and interpretable modeling approach, called grouped heterogenous mixture modeling, for clustered data, which models cluster-wise…

Methodology · Statistics 2020-02-10 Shonosuke Sugasawa

Mixture models postulate the overall population as a mixture of finite subpopulations with unobserved membership. Fitting mixture models usually requires large sample sizes and combining data from multiple sites can be beneficial. However,…

Methodology · Statistics 2025-12-19 Xiaokang Liu , Rui Duan , Raymond J. Carroll , Yang Ning , Yong Chen

A multimodal system with Poisson, Gaussian, and multinomial observations is considered. A generative graphical model that combines multiple modalities through common factor loadings is proposed. In this model, latent factors are like…

Applications · Statistics 2015-08-04 Yasin Yilmaz , Alfred O. Hero

Model-based clustering of moderate or large dimensional data is notoriously difficult. We propose a model for simultaneous dimensionality reduction and clustering by assuming a mixture model for a set of latent scores, which are then linked…

Methodology · Statistics 2024-06-04 Lorenzo Ghilotti , Mario Beraha , Alessandra Guglielmi

Recent advances in engineering technologies have enabled the collection of a large number of longitudinal features. This wealth of information presents unique opportunities for researchers to investigate the complex nature of diseases and…

Methodology · Statistics 2023-11-27 Zihang Lu , Noirrit Kiran Chandra

In the analysis of observational data in social sciences and businesses, it is difficult to obtain a "(quasi) single-source dataset" in which the variables of interest are simultaneously observed. Instead, multiple-source datasets are…

Methodology · Statistics 2021-09-02 Masaki Mitsuhiro , Takahiro Hoshino

In this paper, we propose a general framework for combining evidence of varying quality to estimate underlying binary latent variables in the presence of restrictions imposed to respect the scientific context. The resulting algorithms…

Methodology · Statistics 2018-08-28 Zhenke Wu , Livia Casciola-Rosen , Antony Rosen , Scott L. Zeger

Clustering methods with dimension reduction have been receiving considerable wide interest in statistics lately and a lot of methods to simultaneously perform clustering and dimension reduction have been proposed. This work presents a novel…

Methodology · Statistics 2014-06-17 Michio Yamamoto , Kenichi Hayashi

This paper develops a two-part finite mixture quantile regression model for semi-continuous longitudinal data. The proposed methodology allows heterogeneity sources that influence the model for the binary response variable, to influence…

Methodology · Statistics 2021-07-19 Antonello Maruotti , Luca Merlo , Lea Petrella

This paper develops a Bayesian graphical model for fusing disparate types of count data. The motivating application is the study of bacterial communities from diverse high dimensional features, in this case transcripts, collected from…

In many applications, data can be heterogeneous in the sense of spanning latent groups with different underlying distributions. When predictive models are applied to such data the heterogeneity can affect both predictive performance and…

Machine Learning · Statistics 2022-05-04 Thomas Lartigue , Sach Mukherjee
‹ Prev 1 2 3 10 Next ›