Related papers: Multiclass histogram-based thresholding using kern…

Kernel Density Estimation for Multiclass Quantification

Several disciplines, like the social sciences, epidemiology, sentiment analysis, or market research, are interested in knowing the distribution of the classes in a population rather than the individual labels of the members thereof.…

Machine Learning · Computer Science 2024-01-04 Alejandro Moreo , Pablo González , Juan José del Coz

A new interpoint distance-based clustering algorithm using kernel density estimation

A novel nonparametric clustering algorithm is proposed using the interpoint distances between the members of the data to reveal the inherent clustering structure existing in the given set of data, where we apply the classical nonparametric…

Methodology · Statistics 2024-09-02 Soumita Modak

Kernel Biclustering algorithm in Hilbert Spaces

Biclustering algorithms partition data and covariates simultaneously, providing new insights in several domains, such as analyzing gene expression to discover new biological functions. This paper develops a new model-free biclustering…

Methodology · Statistics 2022-08-09 Marcos Matabuena , J. C Vidal , Oscar Hernan Madrid Padilla , Dino Sejdinovic

Kernel density estimation based sampling for imbalanced class distribution

Imbalanced response variable distribution is a common occurrence in data science. In fields such as fraud detection, medical diagnostics, system intrusion detection and many others where abnormal behavior is rarely observed the data under…

Machine Learning · Computer Science 2019-11-21 Firuz Kamalov

A Novel Approach to Threshold Quantum Images by using Unsharp Measurements

We propose a hybrid quantum approach to threshold and binarize a grayscale image through unsharp measurements (UM) relying on image histogram. Generally, the histograms are characterized by multiple overlapping normal distributions…

Quantum Physics · Physics 2024-01-02 Ayan Barui , Mayukha Pal , Prasanta K. Panigrahi

Mixed-type Distance Shrinkage and Selection for Clustering via Kernel Metric Learning

Distance-based clustering and classification are widely used in various fields to group mixed numeric and categorical data. In many algorithms, a predefined distance measurement is used to cluster data points based on their dissimilarity.…

Machine Learning · Computer Science 2024-10-14 Jesse S. Ghashti , John R. J. Thompson

Kernel Density Estimation and Convolution Revisited

Kernel Density Estimation (KDE) is a cornerstone of nonparametric statistics, yet it remains sensitive to bandwidth choice, boundary bias, and computational inefficiency. This study revisits KDE through a principled convolutional framework,…

Methodology · Statistics 2025-10-24 Nicholas Tenkorang , Kwesi Appau Ohene-Obeng , Xiaogang Su

Fast Kernel Density Estimation with Density Matrices and Random Fourier Features

Kernel density estimation (KDE) is one of the most widely used nonparametric density estimation methods. The fact that it is a memory-based method, i.e., it uses the entire training data set for prediction, makes it unsuitable for most…

Machine Learning · Computer Science 2022-08-08 Joseph A. Gallego , Juan F. Osorio , Fabio A. González

Unsupervised Learning of Density Estimates with Topological Optimization

Kernel density estimation is a key component of a wide variety of algorithms in machine learning, Bayesian inference, stochastic dynamics and signal processing. However, the unsupervised density estimation technique requires tuning a…

Machine Learning · Computer Science 2025-12-17 Sunia Tanweer , Firas A. Khasawneh

Nonparametric density estimation with a parametric start

The traditional kernel density estimator of an unknown density is by construction completely nonparametric, in the sense that it has no preferences and will work reasonably well for all shapes. The present paper develops a class of…

Methodology · Statistics 2026-05-05 Nils Lid Hjort , Ingrid Kristine Glad

Kernel Density Machines

We introduce kernel density machines (KDM), an agnostic kernel-based framework for learning the Radon-Nikodym derivative (density) between probability measures under minimal assumptions. KDM applies to general measurable spaces and avoids…

Machine Learning · Statistics 2026-03-27 Andrea Della Vecchia , Damir Filipovic , Paul Schneider

Kernel Density Estimation by Stagewise Algorithm with a Simple Dictionary

This study proposes multivariate kernel density estimation by stagewise minimization algorithm based on $U$-divergence and a simple dictionary. The dictionary consists of an appropriate scalar bandwidth matrix and a part of the original…

Machine Learning · Statistics 2021-08-11 Kiheiji Nishida , Kanta Naito

Clustering by Deep Nearest Neighbor Descent (D-NND): A Density-based Parameter-Insensitive Clustering Method

Most density-based clustering methods largely rely on how well the underlying density is estimated. However, density estimation itself is also a challenging problem, especially the determination of the kernel bandwidth. A large bandwidth…

Machine Learning · Statistics 2015-12-08 Teng Qiu , Yongjie Li

Conditional Density Estimation with Histogram Trees

Conditional density estimation (CDE) goes beyond regression by modeling the full conditional distribution, providing a richer understanding of the data than just the conditional mean in regression. This makes CDE particularly useful in…

Machine Learning · Computer Science 2024-10-16 Lincen Yang , Matthijs van Leeuwen

Learning the kernel matrix by resampling

In this abstract paper, we introduce a new kernel learning method by a nonparametric density estimator. The estimator consists of a group of k-centroids clusterings. Each clustering randomly selects data points with randomly selected…

Machine Learning · Computer Science 2017-08-02 Xiao-Lei Zhang

Feature Selection For High-Dimensional Clustering

We present a nonparametric method for selecting informative features in high-dimensional clustering problems. We start with a screening step that uses a test for multimodality. Then we apply kernel density estimation and mode clustering to…

Statistics Theory · Mathematics 2014-06-10 Larry Wasserman , Martin Azizyan , Aarti Singh

A new measure for assessment of clustering based on kernel density estimation

A new clustering accuracy measure is proposed to determine the unknown number of clusters and to assess the quality of clustering of a data set given in any dimensional space. Our validity index applies the classical nonparametric…

Methodology · Statistics 2022-02-15 Soumita Modak

A Hybrid Approach To Hierarchical Density-based Cluster Selection

HDBSCAN is a density-based clustering algorithm that constructs a cluster hierarchy tree and then uses a specific stability measure to extract flat clusters from the tree. We show how the application of an additional threshold value can…

Databases · Computer Science 2021-01-22 Claudia Malzer , Marcus Baum

Resolution-Based Distillation for Efficient Histology Image Classification

Developing deep learning models to analyze histology images has been computationally challenging, as the massive size of the images causes excessive strain on all parts of the computing pipeline. This paper proposes a novel deep…

Image and Video Processing · Electrical Eng. & Systems 2021-01-13 Joseph DiPalma , Arief A. Suriawinata , Laura J. Tafe , Lorenzo Torresani , Saeed Hassanpour

Learning Kernel Tests Without Data Splitting

Modern large-scale kernel-based tests such as maximum mean discrepancy (MMD) and kernelized Stein discrepancy (KSD) optimize kernel hyperparameters on a held-out sample via data splitting to obtain the most powerful test statistics. While…

Machine Learning · Computer Science 2020-10-20 Jonas M. Kübler , Wittawat Jitkrittum , Bernhard Schölkopf , Krikamol Muandet