English

Data Structures for Density Estimation

Data Structures and Algorithms 2023-06-21 v1 Machine Learning Machine Learning

Abstract

We study statistical/computational tradeoffs for the following density estimation problem: given kk distributions v1,,vkv_1, \ldots, v_k over a discrete domain of size nn, and sampling access to a distribution pp, identify viv_i that is "close" to pp. Our main result is the first data structure that, given a sublinear (in nn) number of samples from pp, identifies viv_i in time sublinear in kk. We also give an improved version of the algorithm of Acharya et al. (2018) that reports viv_i in time linear in kk. The experimental evaluation of the latter algorithm shows that it achieves a significant reduction in the number of operations needed to achieve a given accuracy compared to prior work.

Keywords

Cite

@article{arxiv.2306.11312,
  title  = {Data Structures for Density Estimation},
  author = {Anders Aamand and Alexandr Andoni and Justin Y. Chen and Piotr Indyk and Shyam Narayanan and Sandeep Silwal},
  journal= {arXiv preprint arXiv:2306.11312},
  year   = {2023}
}

Comments

To appear at ICML'23

R2 v1 2026-06-28T11:09:19.439Z