Related papers: Kernel density estimation-based sampling for neura…

Kernel density estimation based sampling for imbalanced class distribution

Imbalanced response variable distribution is a common occurrence in data science. In fields such as fraud detection, medical diagnostics, system intrusion detection and many others where abnormal behavior is rarely observed the data under…

Machine Learning · Computer Science 2019-11-21 Firuz Kamalov

Variational Weighting for Kernel Density Ratios

Kernel density estimation (KDE) is integral to a range of generative and discriminative tasks in machine learning. Drawing upon tools from the multidimensional calculus of variations, we derive an optimal weight function that reduces bias…

Machine Learning · Computer Science 2023-11-07 Sangwoong Yoon , Frank C. Park , Gunsu S Yun , Iljung Kim , Yung-Kyun Noh

Fast Kernel Density Estimation with Density Matrices and Random Fourier Features

Kernel density estimation (KDE) is one of the most widely used nonparametric density estimation methods. The fact that it is a memory-based method, i.e., it uses the entire training data set for prediction, makes it unsuitable for most…

Machine Learning · Computer Science 2022-08-08 Joseph A. Gallego , Juan F. Osorio , Fabio A. González

Robust Kernel Density Estimation

We propose a method for nonparametric density estimation that exhibits robustness to contamination of the training sample. This method achieves robustness by combining a traditional kernel density estimator (KDE) with ideas from classical…

Machine Learning · Statistics 2011-09-07 JooSeuk Kim , Clayton D. Scott

On Imbalanced Regression with Hoeffding Trees

Many real-world applications generate continuous data streams for regression. Hoeffding trees and their variants have a long-standing tradition due to their effectiveness, either alone or as base models in broader ensembles. Recent…

Machine Learning · Computer Science 2026-03-06 Pantia-Marina Alchirch , Dimitrios I. Diochnos

Robust Stochastic Configuration Networks with Kernel Density Estimation

Neural networks have been widely used as predictive models to fit data distribution, and they could be implemented through learning a collection of samples. In many applications, however, the given dataset may contain noisy samples or…

Neural and Evolutionary Computing · Computer Science 2017-05-30 Dianhui Wang , Ming Li

Partial Resampling of Imbalanced Data

Imbalanced data is a frequently encountered problem in machine learning. Despite a vast amount of literature on sampling techniques for imbalanced data, there is a limited number of studies that address the issue of the optimal sampling…

Machine Learning · Computer Science 2022-07-12 Firuz Kamalov , Amir F. Atiya , Dina Elreedy

Survey of resampling techniques for improving classification performance in unbalanced datasets

A number of classification problems need to deal with data imbalance between classes. Often it is desired to have a high recall on the minority class while maintaining a high precision on the majority class. In this paper, we review a…

Applications · Statistics 2016-08-23 Ajinkya More

An Empirical Analysis of the Efficacy of Different Sampling Techniques for Imbalanced Classification

Learning from imbalanced data is a challenging task. Standard classification algorithms tend to perform poorly when trained on imbalanced data. Some special strategies need to be adopted, either by modifying the data distribution or by…

Machine Learning · Computer Science 2022-08-26 Asif Newaz , Shahriar Hassan , Farhan Shahriyar Haq

Kernel-Based Enhanced Oversampling Method for Imbalanced Classification

This paper introduces a novel oversampling technique designed to improve classification performance on imbalanced datasets. The proposed method enhances the traditional SMOTE algorithm by incorporating convex combination and kernel-based…

Machine Learning · Computer Science 2025-04-15 Wenjie Li , Sibo Zhu , Zhijian Li , Hanlin Wang

Adaptive Kernel Density Estimation for Improved Sky Map Computation in Gamma-Ray Astronomy

We introduce an alternative method for the calculation of sky maps from data taken with gamma-ray telescopes. In contrast to the established method of smoothing the 2D histogram of reconstructed event directions with a static kernel, we…

High Energy Astrophysical Phenomena · Physics 2024-01-30 M. Holler , T. Mitterdorfer , S. Panny

Neural Network Based Undersampling Techniques

Class imbalance problem is commonly faced while developing machine learning models for real-life issues. Due to this problem, the fitted model tends to be biased towards the majority class data, which leads to lower precision, recall, AUC,…

Machine Learning · Computer Science 2019-08-20 Md. Adnan Arefeen , Sumaiya Tabassum Nimi , M Sohel Rahman

Dual-Tree Fast Gauss Transforms

Kernel density estimation (KDE) is a popular statistical technique for estimating the underlying density distribution with minimal assumptions. Although they can be shown to achieve asymptotic estimation optimality for any input…

Computation · Statistics 2011-02-15 Dongryeol Lee , Alexander G. Gray , Andrew W. Moore

Kernel Density Estimation for Multiclass Quantification

Several disciplines, like the social sciences, epidemiology, sentiment analysis, or market research, are interested in knowing the distribution of the classes in a population rather than the individual labels of the members thereof.…

Machine Learning · Computer Science 2024-01-04 Alejandro Moreo , Pablo González , Juan José del Coz

Kernel Density Estimation by Genetic Algorithm

This study proposes a data condensation method for multivariate kernel density estimation by genetic algorithm. First, our proposed algorithm generates multiple subsamples of a given size with replacement from the original sample. The…

Methodology · Statistics 2022-03-04 Kiheiji Nishida

A Tutorial on Kernel Density Estimation and Recent Advances

This tutorial provides a gentle introduction to kernel density estimation (KDE) and recent advances regarding confidence bands and geometric/topological features. We begin with a discussion of basic properties of KDE: the convergence rate…

Methodology · Statistics 2017-09-13 Yen-Chi Chen

Kernel Density Balancing

High-throughput chromatin conformation capture (Hi-C) data provide insights into the 3D structure of chromosomes, with normalization being a crucial pre-processing step. A common technique for normalization is matrix balancing, which…

Applications · Statistics 2025-06-17 John Park , Ning Hao , Yue Selena Niu , Ming Hu

Bandwidth Selection for Weighted Kernel Density Estimation

In the this paper, the authors propose to estimate the density of a targeted population with a weighted kernel density estimator (wKDE) based on a weighted sample. Bandwidth selection for wKDE is discussed. Three mean integrated squared…

Methodology · Statistics 2011-11-28 Bin Wang , Xiaofeng Wang

A Study imbalance handling by various data sampling methods in binary classification

The purpose of this research report is to present the our learning curve and the exposure to the Machine Learning life cycle, with the use of a Kaggle binary classification data set and taking to explore various techniques from…

Machine Learning · Computer Science 2021-05-25 Mohamed Hamama

A hybrid model of kernel density estimation and quantile regression for GEFCom2014 probabilistic load forecasting

We present a model for generating probabilistic forecasts by combining kernel density estimation (KDE) and quantile regression techniques, as part of the probabilistic load forecasting track of the Global Energy Forecasting Competition…

Applications · Statistics 2016-10-18 Stephen Haben , Georgios Giasemidis