Related papers: Robust nearest-neighbor methods for classifying hi…

Distributionally Robust Weighted $k$-Nearest Neighbors

Learning a robust classifier from a few samples remains a key challenge in machine learning. A major thrust of research has been focused on developing $k$-nearest neighbor ($k$-NN) based algorithms combined with metric learning that…

Machine Learning · Statistics 2022-02-17 Shixiang Zhu , Liyan Xie , Minghe Zhang , Rui Gao , Yao Xie

Distributed Adaptive Nearest Neighbor Classifier: Algorithm and Theory

When data is of an extraordinarily large size or physically stored in different locations, the distributed nearest neighbor (NN) classifier is an attractive tool for classification. We propose a novel distributed adaptive NN classifier for…

Machine Learning · Statistics 2023-06-06 Ruiqi Liu , Ganggang Xu , Zuofeng Shang

Consistent Non-Parametric Methods for Maximizing Robustness

Learning classifiers that are robust to adversarial examples has received a great deal of recent attention. A major drawback of the standard robust learning framework is there is an artificial robustness radius $r$ that applies to all…

Machine Learning · Computer Science 2023-01-19 Robi Bhattacharjee , Kamalika Chaudhuri

Finding Relevant Points for Nearest-Neighbor Classification

In nearest-neighbor classification problems, a set of $d$-dimensional training points are given, each with a known classification, and are used to infer unknown classifications of other points by using the same classification as the nearest…

Data Structures and Algorithms · Computer Science 2021-10-13 David Eppstein

Fast Nearest-Neighbor Classification using RNN in Domains with Large Number of Classes

In scenarios involving text classification where the number of classes is large (in multiples of 10000s) and training samples for each class are few and often verbose, nearest neighbor methods are effective but very slow in computing a…

Information Retrieval · Computer Science 2017-12-12 Gautam Singh , Gargi Dasgupta , Yu Deng

Rates of Convergence for Large-scale Nearest Neighbor Classification

Nearest neighbor is a popular class of classification methods with many desirable properties. For a large data set which cannot be loaded into the memory of a single machine due to computation, communication, privacy, or ownership…

Machine Learning · Statistics 2019-11-01 Xingye Qiao , Jiexin Duan , Guang Cheng

Extracting robust and accurate features via a robust information bottleneck

We propose a novel strategy for extracting features in supervised learning that can be used to construct a classifier which is more robust to small perturbations in the input space. Our method builds upon the idea of the information…

Machine Learning · Computer Science 2019-10-16 Ankit Pensia , Varun Jog , Po-Ling Loh

Robust Classification of High-Dimensional Data using Data-Adaptive Energy Distance

Classification of high-dimensional low sample size (HDLSS) data poses a challenge in a variety of real-world situations, such as gene expression studies, cancer research, and medical imaging. This article presents the development and…

Machine Learning · Statistics 2026-05-27 Jyotishka Ray Choudhury , Aytijhya Saha , Sarbojit Roy , Subhajit Dutta

k-Nearest Neighbour Classifiers: 2nd Edition (with Python examples)

Perhaps the most straightforward classifier in the arsenal or machine learning techniques is the Nearest Neighbour Classifier -- classification is achieved by identifying the nearest neighbours to a query example and using those neighbours…

Machine Learning · Computer Science 2021-08-10 Padraig Cunningham , Sarah Jane Delany

RNNP: A Robust Few-Shot Learning Approach

Learning from a few examples is an important practical aspect of training classifiers. Various works have examined this aspect quite well. However, all existing approaches assume that the few examples provided are always correctly labeled.…

Computer Vision and Pattern Recognition · Computer Science 2020-11-24 Pratik Mazumder , Pravendra Singh , Vinay P. Namboodiri

An Extensive Experimental Study on the Cluster-based Reference Set Reduction for speeding-up the k-NN Classifier

The k-Nearest Neighbor (k-NN) classification algorithm is one of the most widely-used lazy classifiers because of its simplicity and ease of implementation. It is considered to be an effective classifier and has many applications. However,…

Machine Learning · Computer Science 2014-02-13 Stefanos Ougiaroglou , Georgios Evangelidis , Dimitris A. Dervos

Robust Classification via Support Vector Machines

Classification models are very sensitive to data uncertainty, and finding robust classifiers that are less sensitive to data uncertainty has raised great interest in the machine learning literature. This paper aims to construct robust…

Machine Learning · Statistics 2022-03-01 Vali Asimit , Ioannis Kyriakou , Simone Santoni , Salvatore Scognamiglio , Rui Zhu

Rates of Convergence for Nearest Neighbor Classification

Nearest neighbor methods are a popular class of nonparametric estimators with several desirable properties, such as adaptivity to different distance scales in different regions of space. Prior work on convergence rates for nearest neighbor…

Machine Learning · Computer Science 2014-07-03 Kamalika Chaudhuri , Sanjoy Dasgupta

Learning Deep Nearest Neighbor Representations Using Differentiable Boundary Trees

Nearest neighbor (kNN) methods have been gaining popularity in recent years in light of advances in hardware and efficiency of algorithms. There is a plethora of methods to choose from today, each with their own advantages and…

Machine Learning · Computer Science 2017-03-01 Daniel Zoran , Balaji Lakshminarayanan , Charles Blundell

A new hashing based nearest neighbors selection technique for big datasets

KNN has the reputation to be the word simplest but efficient supervised learning algorithm used for either classification or regression. KNN prediction efficiency highly depends on the size of its training data but when this training data…

Machine Learning · Computer Science 2021-07-01 Jude Tchaye-Kondi , Yanlong Zhai , Liehuang Zhu

On the Resistance of Nearest Neighbor to Random Noisy Labels

Nearest neighbor has always been one of the most appealing non-parametric approaches in machine learning, pattern recognition, computer vision, etc. Previous empirical studies partly shows that nearest neighbor is resistant to noise, yet…

Machine Learning · Computer Science 2018-09-14 Wei Gao , Bin-Bin Yang , Zhi-Hua Zhou

Robust Gradient Descent Estimation for Tensor Models under Heavy-Tailed Distributions

Low-rank tensor models are widely used in statistics. However, most existing methods rely heavily on the assumption that data follows a sub-Gaussian distribution. To address the challenges associated with heavy-tailed distributions…

Methodology · Statistics 2025-09-16 Xiaoyu Zhang , Di Wang , Guodong Li , Defeng Sun

A Formally Robust Time Series Distance Metric

Distance-based classification is among the most competitive classification methods for time series data. The most critical component of distance-based classification is the selected distance function. Past research has proposed various…

Machine Learning · Computer Science 2020-08-19 Maximilian Toller , Bernhard C. Geiger , Roman Kern

Feature selection using nearest attributes

Feature selection is an important problem in high-dimensional data analysis and classification. Conventional feature selection approaches focus on detecting the features based on a redundancy criterion using learning and feature searching…

Computer Vision and Pattern Recognition · Computer Science 2012-01-31 Alex Pappachen James , Sima Dimitrijev

Improving classifier decision boundaries using nearest neighbors

Neural networks are not learning optimal decision boundaries. We show that decision boundaries are situated in areas of low training data density. They are impacted by few training samples which can easily lead to overfitting. We provide a…

Machine Learning · Computer Science 2023-10-09 Johannes Schneider