English

Scalable multiscale density estimation

Methodology 2014-10-29 v1

Abstract

Although Bayesian density estimation using discrete mixtures has good performance in modest dimensions, there is a lack of statistical and computational scalability to high-dimensional multivariate cases. To combat the curse of dimensionality, it is necessary to assume the data are concentrated near a lower-dimensional subspace. However, Bayesian methods for learning this subspace along with the density of the data scale poorly computationally. To solve this problem, we propose an empirical Bayes approach, which estimates a multiscale dictionary using geometric multiresolution analysis in a first stage. We use this dictionary within a multiscale mixture model, which allows uncertainty in component allocation, mixture weights and scaling factors over a binary tree. A computational algorithm is proposed, which scales efficiently to massive dimensional problems. We provide some theoretical support for this geometric density estimation (GEODE) method, and illustrate the performance through simulated and real data examples.

Keywords

Cite

@article{arxiv.1410.7692,
  title  = {Scalable multiscale density estimation},
  author = {Ye Wang and Antonio Canale and David Dunson},
  journal= {arXiv preprint arXiv:1410.7692},
  year   = {2014}
}

Comments

9 pages with references, 7 pages appendix, 7 figures

R2 v1 2026-06-22T06:38:57.928Z