English
Related papers

Related papers: Kernel density estimation-based sampling for neura…

200 papers

Imbalanced response variable distribution is a common occurrence in data science. In fields such as fraud detection, medical diagnostics, system intrusion detection and many others where abnormal behavior is rarely observed the data under…

Machine Learning · Computer Science 2019-11-21 Firuz Kamalov

Kernel density estimation (KDE) is integral to a range of generative and discriminative tasks in machine learning. Drawing upon tools from the multidimensional calculus of variations, we derive an optimal weight function that reduces bias…

Machine Learning · Computer Science 2023-11-07 Sangwoong Yoon , Frank C. Park , Gunsu S Yun , Iljung Kim , Yung-Kyun Noh

Kernel density estimation (KDE) is one of the most widely used nonparametric density estimation methods. The fact that it is a memory-based method, i.e., it uses the entire training data set for prediction, makes it unsuitable for most…

Machine Learning · Computer Science 2022-08-08 Joseph A. Gallego , Juan F. Osorio , Fabio A. González

We propose a method for nonparametric density estimation that exhibits robustness to contamination of the training sample. This method achieves robustness by combining a traditional kernel density estimator (KDE) with ideas from classical…

Machine Learning · Statistics 2011-09-07 JooSeuk Kim , Clayton D. Scott

Many real-world applications generate continuous data streams for regression. Hoeffding trees and their variants have a long-standing tradition due to their effectiveness, either alone or as base models in broader ensembles. Recent…

Machine Learning · Computer Science 2026-03-06 Pantia-Marina Alchirch , Dimitrios I. Diochnos

Neural networks have been widely used as predictive models to fit data distribution, and they could be implemented through learning a collection of samples. In many applications, however, the given dataset may contain noisy samples or…

Neural and Evolutionary Computing · Computer Science 2017-05-30 Dianhui Wang , Ming Li

Imbalanced data is a frequently encountered problem in machine learning. Despite a vast amount of literature on sampling techniques for imbalanced data, there is a limited number of studies that address the issue of the optimal sampling…

Machine Learning · Computer Science 2022-07-12 Firuz Kamalov , Amir F. Atiya , Dina Elreedy

A number of classification problems need to deal with data imbalance between classes. Often it is desired to have a high recall on the minority class while maintaining a high precision on the majority class. In this paper, we review a…

Applications · Statistics 2016-08-23 Ajinkya More

Learning from imbalanced data is a challenging task. Standard classification algorithms tend to perform poorly when trained on imbalanced data. Some special strategies need to be adopted, either by modifying the data distribution or by…

Machine Learning · Computer Science 2022-08-26 Asif Newaz , Shahriar Hassan , Farhan Shahriyar Haq

This paper introduces a novel oversampling technique designed to improve classification performance on imbalanced datasets. The proposed method enhances the traditional SMOTE algorithm by incorporating convex combination and kernel-based…

Machine Learning · Computer Science 2025-04-15 Wenjie Li , Sibo Zhu , Zhijian Li , Hanlin Wang

We introduce an alternative method for the calculation of sky maps from data taken with gamma-ray telescopes. In contrast to the established method of smoothing the 2D histogram of reconstructed event directions with a static kernel, we…

High Energy Astrophysical Phenomena · Physics 2024-01-30 M. Holler , T. Mitterdorfer , S. Panny

Class imbalance problem is commonly faced while developing machine learning models for real-life issues. Due to this problem, the fitted model tends to be biased towards the majority class data, which leads to lower precision, recall, AUC,…

Machine Learning · Computer Science 2019-08-20 Md. Adnan Arefeen , Sumaiya Tabassum Nimi , M Sohel Rahman

Kernel density estimation (KDE) is a popular statistical technique for estimating the underlying density distribution with minimal assumptions. Although they can be shown to achieve asymptotic estimation optimality for any input…

Computation · Statistics 2011-02-15 Dongryeol Lee , Alexander G. Gray , Andrew W. Moore

Several disciplines, like the social sciences, epidemiology, sentiment analysis, or market research, are interested in knowing the distribution of the classes in a population rather than the individual labels of the members thereof.…

Machine Learning · Computer Science 2024-01-04 Alejandro Moreo , Pablo González , Juan José del Coz

This study proposes a data condensation method for multivariate kernel density estimation by genetic algorithm. First, our proposed algorithm generates multiple subsamples of a given size with replacement from the original sample. The…

Methodology · Statistics 2022-03-04 Kiheiji Nishida

This tutorial provides a gentle introduction to kernel density estimation (KDE) and recent advances regarding confidence bands and geometric/topological features. We begin with a discussion of basic properties of KDE: the convergence rate…

Methodology · Statistics 2017-09-13 Yen-Chi Chen

High-throughput chromatin conformation capture (Hi-C) data provide insights into the 3D structure of chromosomes, with normalization being a crucial pre-processing step. A common technique for normalization is matrix balancing, which…

Applications · Statistics 2025-06-17 John Park , Ning Hao , Yue Selena Niu , Ming Hu

In the this paper, the authors propose to estimate the density of a targeted population with a weighted kernel density estimator (wKDE) based on a weighted sample. Bandwidth selection for wKDE is discussed. Three mean integrated squared…

Methodology · Statistics 2011-11-28 Bin Wang , Xiaofeng Wang

The purpose of this research report is to present the our learning curve and the exposure to the Machine Learning life cycle, with the use of a Kaggle binary classification data set and taking to explore various techniques from…

Machine Learning · Computer Science 2021-05-25 Mohamed Hamama

We present a model for generating probabilistic forecasts by combining kernel density estimation (KDE) and quantile regression techniques, as part of the probabilistic load forecasting track of the Global Energy Forecasting Competition…

Applications · Statistics 2016-10-18 Stephen Haben , Georgios Giasemidis
‹ Prev 1 2 3 10 Next ›