Two statistical problems for multivariate mixture distributions
Abstract
We address two important statistical problems: that of estimating mixtures of multivariate normal distributions and mixtures of -distributions based on univariate projections, and that of quantifying a discrepancy between mixture distributions induced by two model-based clusterings. In the second problem, rather than introducing a direct metric on partitions, we propose a model-based distributional discrepancy between the fitted mixture distributions associated with two clusterings. The results are based on an earlier work of the authors, where it was shown that mixtures of multivariate Gaussian or -distributions can be distinguished by projecting them onto a certain predetermined finite set of lines, the number of lines depending only on the total number of distributions involved and on the ambient dimension. We also compare our proposal with robust versions of the expectation-maximization method EM. In each case, we present algorithms for effecting the task, and compare them with existing methods by carrying out some simulations.
Cite
@article{arxiv.2503.12147,
title = {Two statistical problems for multivariate mixture distributions},
author = {Ricardo Fraiman and Leonardo Moreno and Thomas Ransford},
journal= {arXiv preprint arXiv:2503.12147},
year = {2026}
}
Comments
41 pages, 12 figures