English
Related papers

Related papers: Indirect Cross-validation for Density Estimation

200 papers

In this paper we provide insight into the empirical properties of indirect cross-validation (ICV), a new method of bandwidth selection for kernel density estimators. First, we describe the method and report on the theoretical results used…

Methodology · Statistics 2008-12-02 Olga Y. Savchuk , Jeffrey D. Hart , Simon J. Sheather

Recent contributions to kernel smoothing show that the performance of cross-validated bandwidth selectors improve significantly from indirectness. Indirect crossvalidation first estimates the classical cross-validated bandwidth from a more…

One-sided cross-validation (OSCV) is a bandwidth selection method initially introduced by Hart and Yi (1998) in the context of smooth regression functions. Mart\'{\i}nez-Miranda et al. (2009) developed a version of OSCV for smooth density…

Methodology · Statistics 2017-03-16 Olga Y. Savchuk

Fully robust OSCV is a modification of the OSCV method that produces consistent bandwidth in the cases of smooth and nonsmooth regression functions. The current implementation of the method uses the kernel $H_I$ that is almost…

Methodology · Statistics 2016-03-01 Olga Y. Savchuk , Jeffrey D. Hart

This paper presents an intuitive application of multivariate kernel density estimation (KDE) for data correction. The method utilizes the expected value of the conditional probability density function (PDF) and a credible interval to…

Applications · Statistics 2025-09-19 Hai Bui , Mostafa Bakhoday-Paskyabi

Nonparametric estimation of copula density functions using kernel estimators presents significant challenges. One issue is the potential unboundedness of certain copula density functions at the corners of the unit square. Another is the…

Methodology · Statistics 2025-02-11 Mathias N. Muia , Olivia Atutey , Mahmud Hasan

In the this paper, the authors propose to estimate the density of a targeted population with a weighted kernel density estimator (wKDE) based on a weighted sample. Bandwidth selection for wKDE is discussed. Three mean integrated squared…

Methodology · Statistics 2011-11-28 Bin Wang , Xiaofeng Wang

Markov chain Monte Carlo samplers produce dependent streams of variates drawn from the limiting distribution of the Markov chain. With this as motivation, we introduce novel univariate kernel density estimators which are appropriate for the…

Methodology · Statistics 2016-07-29 Hang J. Kim , Steven N. MacEachern , Yoonsuh Jung

We present an efficient method to estimate cross-validation bandwidth parameters for kernel density estimation in very large datasets where ordinary cross-validation is rendered highly inefficient, both statistically and computationally.…

Methodology · Statistics 2016-09-02 Anirban Bhattacharya , Jeffrey D. Hart

A kernel density estimator (KDE) is one of the most popular non-parametric density estimators. In this paper we focus on a best bandwidth selection method for use in an analogue of a classical KDE using the tropical symmetric distance,…

Populations and Evolution · Quantitative Biology 2025-12-30 Ruriko Yoshida , Zhiwen Wang

A popular data-driven method for choosing the bandwidth in standard kernel regression is cross-validation. Even when there are outliers in the data, robust kernel regression can be used to estimate the unknown regression curve [Robust and…

Statistics Theory · Mathematics 2007-06-13 Denis Heng-Yan Leung

Averaging provides an alternative to bandwidth selection for density kernel estimation. We propose a procedure to combine linearly several kernel estimators of a density obtained from different, possibly data-driven, bandwidths. The method…

Statistics Theory · Mathematics 2019-11-05 O. Chernova , F. Lavancier , P. Rochet

With machine learning being a popular topic in current computational materials science literature, creating representations for compounds has become common place. These representations are rarely compared, as evaluating their performance -…

Machine Learning · Computer Science 2023-05-26 Samantha Durdy , Michael Gaultois , Vladimir Gusev , Danushka Bollegala , Matthew J. Rosseinsky

Length-biased data are a particular case of weighted data, which arise in many situations: biomedicine, quality control or epidemiology among others. In this paper we study the theoretical properties of kernel density estimation in the…

Allthough nonparametric kernel density estimation with bias reduce is nowadays a standard technique in explorative data-analysis, there is still a big dispute on how to assess the quality of the estimate and which choice of bandwidth is…

Methodology · Statistics 2019-03-26 Hamza Dhakera , El Hadji Demeb , Youssou Cissb

We present a methodology for model evaluation and selection where the sampling mechanism violates the i.i.d. assumption. Our methodology involves a formulation of the bias between the standard Cross-Validation (CV) estimator and the mean…

Methodology · Statistics 2025-03-14 Oren Yuval , Saharon Rosset

We consider the problem of bandwidth selection by cross-validation from a sequential point of view in a nonparametric regression model. Having in mind that in applications one often aims at estimation, prediction and change detection…

Statistics Theory · Mathematics 2018-03-20 Ansgar Steland

Common cross-validation (CV) methods like k-fold cross-validation or Monte-Carlo cross-validation estimate the predictive performance of a learner by repeatedly training it on a large portion of the given data and testing on the remaining…

Machine Learning · Computer Science 2021-11-30 Felix Mohr , Jan N. van Rijn

We define a new bandwidth-dependent kernel density estimator that improves existing convergence rates for the bias, and preserves that of the variation, when the error is measured in $L_1$. No additional assumptions are imposed to the…

Statistics Theory · Mathematics 2016-12-28 Kairat Mynbaev , Carlos Martins-Filho

This paper presents a method for hyperspectral image classification that uses support vector data description (SVDD) with the Gaussian kernel function. SVDD has been a popular machine learning technique for single-class classification, but…

Applications · Statistics 2019-04-08 Yuwei Liao , Deovrat Kakde , Arin Chaudhuri , Hansi Jiang , Carol Sadek , Seunghyun Kong
‹ Prev 1 2 3 10 Next ›