English

Two statistical problems for multivariate mixture distributions

Statistics Theory 2026-04-30 v6 Statistics Theory

Abstract

We address two important statistical problems: that of estimating mixtures of multivariate normal distributions and mixtures of tt-distributions based on univariate projections, and that of quantifying a discrepancy between mixture distributions induced by two model-based clusterings. In the second problem, rather than introducing a direct metric on partitions, we propose a model-based distributional discrepancy between the fitted mixture distributions associated with two clusterings. The results are based on an earlier work of the authors, where it was shown that mixtures of multivariate Gaussian or tt-distributions can be distinguished by projecting them onto a certain predetermined finite set of lines, the number of lines depending only on the total number of distributions involved and on the ambient dimension. We also compare our proposal with robust versions of the expectation-maximization method EM. In each case, we present algorithms for effecting the task, and compare them with existing methods by carrying out some simulations.

Keywords

Cite

@article{arxiv.2503.12147,
  title  = {Two statistical problems for multivariate mixture distributions},
  author = {Ricardo Fraiman and Leonardo Moreno and Thomas Ransford},
  journal= {arXiv preprint arXiv:2503.12147},
  year   = {2026}
}

Comments

41 pages, 12 figures

R2 v1 2026-06-28T22:22:00.503Z