Related papers: Kernel Density Estimators in Large Dimensions

Hashing-Based-Estimators for Kernel Density in High Dimensions

Given a set of points $P\subset \mathbb{R}^{d}$ and a kernel $k$, the Kernel Density Estimate at a point $x\in\mathbb{R}^{d}$ is defined as $\mathrm{KDE}_{P}(x)=\frac{1}{|P|}\sum_{y\in P} k(x,y)$. We study the problem of designing a data…

Data Structures and Algorithms · Computer Science 2018-09-03 Moses Charikar , Paris Siminelakis

Data-Based Optimal Bandwidth for Kernel Density Estimation of Statistical Samples

It is a common practice to evaluate probability density function or matter spatial density function from statistical samples. Kernel density estimation is a frequently used method, but to select an optimal bandwidth of kernel estimation,…

Methodology · Statistics 2021-04-27 Zhen-Wei Li , Ping He

Kernel Density Estimation for Dynamical Systems

We study the density estimation problem with observations generated by certain dynamical systems that admit a unique underlying invariant Lebesgue density. Observations drawn from dynamical systems are not independent and moreover, usual…

Machine Learning · Statistics 2016-07-14 Hanyuan Hang , Ingo Steinwart , Yunlong Feng , Johan A. K. Suykens

Bandwidth selection for kernel estimation in mixed multi-dimensional spaces

Kernel estimation techniques, such as mean shift, suffer from one major drawback: the kernel bandwidth selection. The bandwidth can be fixed for all the data set or can vary at each points. Automatic bandwidth selection becomes a real…

Computer Vision and Pattern Recognition · Computer Science 2011-11-10 Aurelie Bugeau , Patrick Pérez

Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Volume Dimension

We derive concentration inequalities for the supremum norm of the difference between a kernel density estimator (KDE) and its point-wise expectation that hold uniformly over the selection of the bandwidth and under weaker conditions on the…

Statistics Theory · Mathematics 2020-01-01 Jisu Kim , Jaehyeok Shin , Alessandro Rinaldo , Larry Wasserman

Improved Coresets for Kernel Density Estimates

We study the construction of coresets for kernel density estimates. That is we show how to approximate the kernel density estimate described by a large point set with another kernel density estimate with a much smaller point set. For…

Machine Learning · Computer Science 2017-10-13 Jeff M. Phillips , Wai Ming Tai

High-Dimensional Change-Point Detection via Angular Kernel Statistics

We study change-point detection for high-dimensional data in regimes where inference must be performed from small batches of observations. Our primary focus is the high-dimensional, low sample size (HDLSS) regime, where the sequence length…

Methodology · Statistics 2026-05-26 Jyotishka Ray Choudhury , Yao Xie

Kernel Density Estimation through Density Constrained Near Neighbor Search

In this paper we revisit the kernel density estimation problem: given a kernel $K(x, y)$ and a dataset of $n$ points in high dimensional Euclidean space, prepare a data structure that can quickly output, given a query $q$, a…

Data Structures and Algorithms · Computer Science 2020-11-16 Moses Charikar , Michael Kapralov , Navid Nouri , Paris Siminelakis

Density Estimation on Rectifiable Sets

Kernel density estimation is a popular method for estimating unseen probability distributions. However, the convergence of these classical estimators to the true density slows down in high dimensions. Moreover, they do not define meaningful…

Statistics Theory · Mathematics 2025-05-30 Jack Kendrick

Estimation of the invariant measure of a multidimensional diffusion from noisy observations

We introduce a new approach for estimating the invariant density of a multidimensional diffusion when dealing with high-frequency observations blurred by independent noises. We consider the intermediate regime, where observations occur at…

Statistics Theory · Mathematics 2024-04-19 Raphaël Maillet , Grégoire Szymanski

Kernel Estimation in High-Energy Physics

Kernel Estimation provides an unbinned and non-parametric estimate of the probability density function from which a set of data is drawn. In the first section, after a brief discussion on parametric and non-parametric methods, the theory of…

High Energy Physics - Experiment · Physics 2009-10-31 Kyle S. Cranmer

Reducing bias in nonparametric density estimation via bandwidth dependent kernels: $L_1$ view

We define a new bandwidth-dependent kernel density estimator that improves existing convergence rates for the bias, and preserves that of the variation, when the error is measured in $L_1$. No additional assumptions are imposed to the…

Statistics Theory · Mathematics 2016-12-28 Kairat Mynbaev , Carlos Martins-Filho

A note on kernel density estimators with optimal bandwidths

We show that the cumulative distribution function corresponding to a kernel density estimator with optimal bandwidth lies outside any confidence interval, around the empirical distribution function, with probability tending to 1 as the…

Statistics Theory · Mathematics 2026-04-17 Nils Lid Hjort , Stephen G. Walker

Kernel Density Estimation Bias under Minimal Assumptions

Kernel Density Estimation is a very popular technique of approximating a density function from samples. The accuracy is generally well-understood and depends, roughly speaking, on the kernel decay and local smoothness of the true density.…

Statistics Theory · Mathematics 2019-01-03 Maciej Skorski

The Discrepancy Principle for Choosing Bandwidths in Kernel Density Estimation

We investigate the discrepancy principle for choosing smoothing parameters for kernel density estimation. The method is based on the distance between the empirical and estimated distribution functions. We prove some new positive and…

Statistics Theory · Mathematics 2015-03-19 Thoralf Mildenberger

Kernel Two-Sample Tests for Manifold Data

We present a study of a kernel-based two-sample test statistic related to the Maximum Mean Discrepancy (MMD) in the manifold data setting, assuming that high-dimensional observations are close to a low-dimensional manifold. We characterize…

Machine Learning · Statistics 2024-02-27 Xiuyuan Cheng , Yao Xie

Bandwidth selection for kernel density estimation with length-biased data

Length-biased data are a particular case of weighted data, which arise in many situations: biomedicine, quality control or epidemiology among others. In this paper we study the theoretical properties of kernel density estimation in the…

Methodology · Statistics 2017-07-11 María Isabel Borrajo , Wenceslao González-Manteiga , María Dolores Martínez-Miranda

Improved Density and Distribution Function Estimation

Given additional distributional information in the form of moment restrictions, kernel density and distribution function estimators with implied generalised empirical likelihood probabilities as weights achieve a reduction in variance due…

Methodology · Statistics 2019-10-08 Vitaliy Oryshchenko , Richard J. Smith

Kernel density estimation of a multidimensional efficiency profile

Kernel density estimation is a convenient way to estimate the probability density of a distribution given the sample of data points. However, it has certain drawbacks: proper description of the density using narrow kernels needs large data…

Data Analysis, Statistics and Probability · Physics 2015-02-27 Anton Poluektov

Bandwidth Selection for Weighted Kernel Density Estimation

In the this paper, the authors propose to estimate the density of a targeted population with a weighted kernel density estimator (wKDE) based on a weighted sample. Bandwidth selection for wKDE is discussed. Three mean integrated squared…

Methodology · Statistics 2011-11-28 Bin Wang , Xiaofeng Wang