Related papers: Predictor-Informed Bayesian Nonparametric Clusteri…

A Common Atom Model for the Bayesian Nonparametric Analysis of Nested Data

The use of high-dimensional data for targeted therapeutic interventions requires new ways to characterize the heterogeneity observed across subgroups of a specific population. In particular, models for partially exchangeable data are needed…

Methodology · Statistics 2020-08-18 Francesco Denti , Federico Camerlenghi , Michele Guindani , Antonietta Mira

Flexible clustering via hidden hierarchical Dirichlet priors

The Bayesian approach to inference stands out for naturally allowing borrowing information across heterogeneous populations, with different samples possibly sharing the same distribution. A popular Bayesian nonparametric model for…

Methodology · Statistics 2022-01-25 Antonio Lijoi , Igor Prünster , Giovanni Rebaudo

Cluster Prediction for Opinion Dynamics from Partial Observations

We present a Bayesian approach to predict the clustering of opinions for a system of interacting agents from partial observations. The Bayesian formulation overcomes the unobservability of the system and quantifies the uncertainty in the…

Computation · Statistics 2020-12-22 Zehong Zhang , Fei Lu

Non-parametric Multi-Partitions Clustering

In the framework of model-based clustering, a model, called multi-partitions clustering, allowing several latent class variables has been proposed. This model assumes that the distribution of the observed data can be factorized into several…

Methodology · Statistics 2023-01-09 Marie du Roy de Chaumaray , Vincent Vandewalle

Bayesian clustering using random effects models and predictive projections

Linear mixed models are widely used for analyzing hierarchically structured data involving missingness and unbalanced study designs. We consider a Bayesian clustering method that combines linear mixed models and predictive projections. For…

Methodology · Statistics 2021-07-07 Yinan Mao , David J. Nott

Model-based clustering of categorical data based on the Hamming distance

A model-based approach is developed for clustering categorical data with no natural ordering. The proposed method exploits the Hamming distance to define a family of probability mass functions to model the data. The elements of this family…

Methodology · Statistics 2024-07-02 Raffaele Argiento , Edoardo Filippi-Mazzola , Lucia Paci

Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions

We propose a novel method for multiple clustering that assumes a co-clustering structure (partitions in both rows and columns of the data matrix) in each view. The new method is applicable to high-dimensional data. It is based on a…

Machine Learning · Statistics 2019-07-03 Tomoki Tokuda , Junichiro Yoshimoto , Yu Shimizu , Shigeru Toki , Go Okada , Masahiro Takamura , Tetsuya Yamamoto , Shinpei Yoshimura , Yasumasa Okamoto , Shigeto Yamawaki , Kenji Doya

A Class of Dependent Random Distributions Based on Atom Skipping

We propose the Plaid Atoms Model (PAM), a novel Bayesian nonparametric model for grouped data. Founded on an idea of `atom skipping', PAM is part of a well-established category of models that generate dependent random distributions and…

Methodology · Statistics 2024-01-02 Dehua Bi , Yuan Ji

Nested Atoms Model with Application to Clustering Big Population-Scale Single-Cell Data

We consider the problem of clustering nested or hierarchical data, where observations are grouped and there are both group-level and observation-level variables. In our motivating OneK1K dataset, observations consist of single-cell…

Methodology · Statistics 2026-04-14 Arhit Chakrabarti , Yang Ni , Yuchao Jiang , Bani K. Mallick

Optimal Bayesian clustering using non-negative matrix factorization

Bayesian model-based clustering is a widely applied procedure for discovering groups of related observations in a dataset. These approaches use Bayesian mixture models, estimated with MCMC, which provide posterior samples of the model…

Methodology · Statistics 2018-09-24 Ketong Wang , Michael D. Porter

A Bayesian Approach to Restricted Latent Class Models for Scientifically-Structured Clustering of Multivariate Binary Outcomes

In this paper, we propose a general framework for combining evidence of varying quality to estimate underlying binary latent variables in the presence of restrictions imposed to respect the scientific context. The resulting algorithms…

Methodology · Statistics 2018-08-28 Zhenke Wu , Livia Casciola-Rosen , Antony Rosen , Scott L. Zeger

Mixture of multilayer stochastic block models for multiview clustering

In this work, we propose an original method for aggregating multiple clustering coming from different sources of information. Each partition is encoded by a co-membership matrix between observations. Our approach uses a mixture of…

Machine Learning · Computer Science 2024-01-10 Kylliann De Santiago , Marie Szafranski , Christophe Ambroise

A Latent-Variable Bayesian Nonparametric Regression Model

We introduce a random partition model for Bayesian nonparametric regression. The model is based on infinitely-many disjoint regions of the range of a latent covariate-dependent Gaussian process. Given a realization of the process, the…

Methodology · Statistics 2013-01-04 George Karabatsos , Stephen G. Walker

BayesCPclust: A Bayesian Approach for Clustering Constant-Wise Change-Point Data

Change-point models deal with ordered data sequences. Their primary goal is to infer the locations where an aspect of the data sequence changes. In this paper, we propose and implement a nonparametric Bayesian model for clustering…

Methodology · Statistics 2025-02-12 Ana Carolina da Cruz , Camila P. E. de Souza

Advances in Bayesian random partition models: A comprehensive review

Clustering is a crucial task in various domains of knowledge, including medicine, epidemiology, genomics, environmental science, economics, and visual sciences, among others. Methodologies for inferring the number of clusters have often…

Methodology · Statistics 2025-05-26 Clara Grazian

Monitoring Adverse Events Through Bayesian Nonparametric Clustering Across Studies

We introduce a Bayesian nonparametric inference approach for aggregate adverse event (AE) monitoring across studies. The proposed model seamlessly integrates external data from historical trials to define a relevant background rate and…

Methodology · Statistics 2025-09-10 Shijie Yuan , Kevin Roberts , Noirrit Kiran Chandra , Yuan Ji , Peter Müller

A Bayesian Nonparametric Approach for Clustering Functional Trajectories over Time

Functional concurrent, or varying-coefficient, regression models are commonly used in biomedical and clinical settings to investigate how the relation between an outcome and observed covariate varies as a function of another covariate. In…

Methodology · Statistics 2024-10-10 Mingrui Liang , Matthew D. Koslovsky , Emily T. Hebert , Darla E. Kendzor , Marina Vannucci

Model Based Clustering for Mixed Data: clustMD

A model based clustering procedure for data of mixed type, clustMD, is developed using a latent variable model. It is proposed that a latent variable, following a mixture of Gaussian distributions, generates the observed data of mixed type.…

Methodology · Statistics 2015-11-06 Damien McParland , Isobel Claire Gormley

Clustering blood donors via mixtures of product partition models with covariates

Motivated by the problem of accurately predicting gap times between successive blood donations, we present here a general class of Bayesian nonparametric models for clustering. These models allow for prediction of new recurrences,…

Methodology · Statistics 2022-10-18 Raffaele Argiento , Riccardo Corradin , Alessandra Guglielmi , Ettore Lanzarone

Determinantal Clustering Processes - A Nonparametric Bayesian Approach to Kernel Based Semi-Supervised Clustering

Semi-supervised clustering is the task of clustering data points into clusters where only a fraction of the points are labelled. The true number of clusters in the data is often unknown and most models require this parameter as an input.…

Machine Learning · Computer Science 2013-09-27 Amar Shah , Zoubin Ghahramani