Related papers: Bayesian Clustering Factor Models

Bayesian Dynamic Clustering Factor Models

We propose novel Bayesian Dynamic Clustering Factor Models (BDCFM) for the analysis of multivariate longitudinal data. BDCFM combines factor models with hidden Markov models to concomitantly perform dimension reduction, clustering, and…

Methodology · Statistics 2025-05-28 Tsering Dolkar , Marco A. R. Ferreira , Hwasoo Shin , Allison N. Tegge

A Sparse Factor Model for Clustering High-Dimensional Longitudinal Data

Recent advances in engineering technologies have enabled the collection of a large number of longitudinal features. This wealth of information presents unique opportunities for researchers to investigate the complex nature of diseases and…

Methodology · Statistics 2023-11-27 Zihang Lu , Noirrit Kiran Chandra

Clustering Multivariate Data using Factor Analytic Bayesian Mixtures with an Unknown Number of Components

Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting…

Methodology · Statistics 2019-08-29 Panagiotis Papastamoulis

A multiscale Bayesian nonparametric framework for partial hierarchical clustering

In recent years, there has been a growing demand to discern clusters of subjects in datasets characterized by a large set of features. Often, these clusters may be highly variable in size and present partial hierarchical structures. In this…

Methodology · Statistics 2024-07-01 Lorenzo Schiavon , Mattia Stival

A Bayesian Semiparametric Factor Analysis Model for Subtype Identification

Disease subtype identification (clustering) is an important problem in biomedical research. Gene expression profiles are commonly utilized to infer disease subtypes, which often lead to biologically meaningful insights into disease. Despite…

Methodology · Statistics 2016-09-27 Jiehuan Sun , Joshua L. Warren , Hongyu Zhao

A unified framework for model-based clustering, linear regression and multiple cluster structure detection

A general framework for dealing with both linear regression and clustering problems is described. It includes Gaussian clusterwise linear regression analysis with random covariates and cluster analysis via Gaussian mixture models with…

Methodology · Statistics 2015-10-13 Giuliano Galimberti , Annamaria Manisi , Gabriele Soffritti

Model-based clustering of categorical data based on the Hamming distance

A model-based approach is developed for clustering categorical data with no natural ordering. The proposed method exploits the Hamming distance to define a family of probability mass functions to model the data. The elements of this family…

Methodology · Statistics 2024-07-02 Raffaele Argiento , Edoardo Filippi-Mazzola , Lucia Paci

Bayesian clustering of high-dimensional data via latent repulsive mixtures

Model-based clustering of moderate or large dimensional data is notoriously difficult. We propose a model for simultaneous dimensionality reduction and clustering by assuming a mixture model for a set of latent scores, which are then linked…

Methodology · Statistics 2024-06-04 Lorenzo Ghilotti , Mario Beraha , Alessandra Guglielmi

Bayesian Cluster Weighted Gaussian Models

We introduce a novel class of Bayesian mixtures for normal linear regression models which incorporates a further Gaussian random component for the distribution of the predictor variables. The proposed cluster-weighted model aims to…

Methodology · Statistics 2026-05-26 Panagiotis Papastamoulis , Konstantinos Perrakis

Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions

We propose a novel method for multiple clustering that assumes a co-clustering structure (partitions in both rows and columns of the data matrix) in each view. The new method is applicable to high-dimensional data. It is based on a…

Machine Learning · Statistics 2019-07-03 Tomoki Tokuda , Junichiro Yoshimoto , Yu Shimizu , Shigeru Toki , Go Okada , Masahiro Takamura , Tetsuya Yamamoto , Shinpei Yoshimura , Yasumasa Okamoto , Shigeto Yamawaki , Kenji Doya

Bayesian Bi-clustering Methods with Applications in Computational Biology

Bi-clustering is a useful approach in analyzing biological data when observations come from heterogeneous groups and have a large number of features. We outline a general Bayesian approach in tackling bi-clustering problems in moderate to…

Applications · Statistics 2021-02-11 Han Yan , Jiexing Wu , Yang Li , Jun S. Liu

Bayesian Consensus Clustering

The task of clustering a set of objects based on multiple sources of data arises in several modern applications. We propose an integrative statistical model that permits a separate clustering of the objects for each data source. These…

Machine Learning · Statistics 2015-12-01 Eric F. Lock , David B. Dunson

Distributed Bayesian clustering using finite mixture of mixtures

In many modern applications, there is interest in analyzing enormous data sets that cannot be easily moved across computers or loaded into memory on a single computer. In such settings, it is very common to be interested in clustering.…

Computation · Statistics 2020-05-15 Hanyu Song , Yingjian Wang , David B. Dunson

Model-based clustering based on sparse finite Gaussian mixtures

In the framework of Bayesian model-based clustering based on a finite mixture of Gaussian distributions, we present a joint approach to estimate the number of mixture components and identify cluster-relevant variables simultaneously as well…

Methodology · Statistics 2016-06-23 Gertraud Malsiner-Walli , Sylvia Frühwirth-Schnatter , Bettina Grün

A Bayesian approach for clustering skewed data using mixtures of multivariate normal-inverse Gaussian distributions

Non-Gaussian mixture models are gaining increasing attention for mixture model-based clustering particularly when dealing with data that exhibit features such as skewness and heavy tails. Here, such a mixture distribution is presented,…

Computation · Statistics 2020-05-07 Yuan Fang , Dimitris Karlis , Sanjeena Subedi

Bayesian Sparse Gaussian Mixture Model in High Dimensions

We study the sparse high-dimensional Gaussian mixture model when the number of clusters is allowed to grow with the sample size. A minimax lower bound for parameter estimation is established, and we show that a constrained maximum…

Statistics Theory · Mathematics 2024-02-26 Dapeng Yao , Fangzheng Xie , Yanxun Xu

Identifying Mixtures of Mixtures Using Bayesian Estimation

The use of a finite mixture of normal distributions in model-based clustering allows to capture non-Gaussian data clusters. However, identifying the clusters from the normal components is challenging and in general either achieved by…

Methodology · Statistics 2016-06-21 Gertraud Malsiner-Walli , Sylvia Frühwirth-Schnatter , Bettina Grün

Optimal Bayesian estimators for latent variable cluster models

In cluster analysis interest lies in probabilistically capturing partitions of individuals, items or observations into groups, such that those belonging to the same group share similar attributes or relational profiles. Bayesian posterior…

Methodology · Statistics 2017-03-23 Riccardo Rastelli , Nial Friel

Sparse group factor analysis for biclustering of multiple data sources

Motivation: Modelling methods that find structure in data are necessary with the current large volumes of genomic data, and there have been various efforts to find subsets of genes exhibiting consistent patterns over subsets of treatments.…

Machine Learning · Computer Science 2016-09-15 Kerstin Bunte , Eemeli Leppäaho , Inka Saarinen , Samuel Kaski

Bayesian Clustering via Fusing of Localized Densities

Bayesian clustering typically relies on mixture models, with each component interpreted as a different cluster. After defining a prior for the component parameters and weights, Markov chain Monte Carlo (MCMC) algorithms are commonly used to…

Methodology · Statistics 2024-07-30 Alexander Dombowsky , David B. Dunson