Optimal Coreset for Gaussian Kernel Density Estimation
Data Structures and Algorithms
2022-02-22 v5 Computational Geometry
Machine Learning
Abstract
Given a point set , the kernel density estimate of is defined as for any . We study how to construct a small subset of such that the kernel density estimate of is approximated by the kernel density estimate of . This subset is called a coreset. The main technique in this work is constructing a coloring on the point set by discrepancy theory and we leverage Banaszczyk's Theorem. When is a constant, our construction gives a coreset of size as opposed to the best-known result of . It is the first result to give a breakthrough on the barrier of factor even when .
Cite
@article{arxiv.2007.08031,
title = {Optimal Coreset for Gaussian Kernel Density Estimation},
author = {Wai Ming Tai},
journal= {arXiv preprint arXiv:2007.08031},
year = {2022}
}
Comments
Accepted for Symposium on Computational Geometry (SoCG) 2022