English

Fast sampling and model selection for Bayesian mixture models

Computation 2025-11-03 v2 Machine Learning

Abstract

We study Bayesian estimation of mixture models and argue in favor of fitting the marginal posterior distribution over component assignments directly, rather than Gibbs sampling from the joint posterior on components and parameters as is commonly done. Some previous authors have found the former approach to have slow mixing, but we show that, implemented correctly, it can achieve excellent performance. In particular, we describe a new Monte Carlo algorithm for sampling from the marginal posterior of a general integrable mixture that makes use of rejection-free sampling from the prior over component assignments to achieve excellent mixing times in typical applications, outperforming standard Gibbs sampling, in some cases by a wide margin. We demonstrate the approach with a selection of applications to Gaussian, Poisson, and categorical models.

Keywords

Cite

@article{arxiv.2501.07668,
  title  = {Fast sampling and model selection for Bayesian mixture models},
  author = {M. E. J. Newman},
  journal= {arXiv preprint arXiv:2501.07668},
  year   = {2025}
}

Comments

Additional material on algorithms and example applications in this version. Code and data available at https://www.umich.edu/~mejn/mixture