Related papers: Distributed Adaptive Nearest Neighbor Classifier: …

Rates of Convergence for Large-scale Nearest Neighbor Classification

Nearest neighbor is a popular class of classification methods with many desirable properties. For a large data set which cannot be loaded into the memory of a single machine due to computation, communication, privacy, or ownership…

Machine Learning · Statistics 2019-11-01 Xingye Qiao , Jiexin Duan , Guang Cheng

Adaptive Nearest Neighbor: A General Framework for Distance Metric Learning

$K$-NN classifier is one of the most famous classification algorithms, whose performance is crucially dependent on the distance metric. When we consider the distance metric as a parameter of $K$-NN, learning an appropriate distance metric…

Machine Learning · Computer Science 2019-11-26 Kun Song

Distributed Nearest Neighbor Classification

Nearest neighbor is a popular nonparametric method for classification and regression with many appealing properties. In the big data era, the sheer volume and spatial/temporal disparity of big data may prohibit centrally processing and…

Statistics Theory · Mathematics 2018-12-13 Jiexin Duan , Xingye Qiao , Guang Cheng

Minimax Rate Optimal Adaptive Nearest Neighbor Classification and Regression

k Nearest Neighbor (kNN) method is a simple and popular statistical method for classification and regression. For both classification and regression problems, existing works have shown that, if the distribution of the feature vector has…

Statistics Theory · Mathematics 2019-10-24 Puning Zhao , Lifeng Lai

Minimax Optimal Algorithms with Fixed-$k$-Nearest Neighbors

This paper presents how to perform minimax optimal classification, regression, and density estimation based on fixed-$k$ nearest neighbor (NN) searches. We consider a distributed learning scenario, in which a massive dataset is split into…

Statistics Theory · Mathematics 2024-09-11 J. Jon Ryu , Young-Han Kim

Adaptive $k$-nearest neighbor classifier based on the local estimation of the shape operator

The $k$-nearest neighbor ($k$-NN) algorithm is one of the most popular methods for nonparametric classification. However, a relevant limitation concerns the definition of the number of neighbors $k$. This parameter exerts a direct impact on…

Machine Learning · Computer Science 2024-09-10 Alexandre Luís Magalhães Levada , Frank Nielsen , Michel Ferreira Cardia Haddad

An adaptive nearest neighbor rule for classification

We introduce a variant of the $k$-nearest neighbor classifier in which $k$ is chosen adaptively for each query, rather than supplied as a parameter. The choice of $k$ depends on properties of each neighborhood, and therefore may…

Machine Learning · Computer Science 2019-05-31 Akshay Balsubramani , Sanjoy Dasgupta , Yoav Freund , Shay Moran

Distributionally Robust Weighted $k$-Nearest Neighbors

Learning a robust classifier from a few samples remains a key challenge in machine learning. A major thrust of research has been focused on developing $k$-nearest neighbor ($k$-NN) based algorithms combined with metric learning that…

Machine Learning · Statistics 2022-02-17 Shixiang Zhu , Liyan Xie , Minghe Zhang , Rui Gao , Yao Xie

Robust nearest-neighbor methods for classifying high-dimensional data

We suggest a robust nearest-neighbor approach to classifying high-dimensional data. The method enhances sensitivity by employing a threshold and truncates to a sequence of zeros and ones in order to reduce the deleterious impact of…

Statistics Theory · Mathematics 2009-09-02 Yao-ban Chan , Peter Hall

Feasibility Based Large Margin Nearest Neighbor Metric Learning

Large margin nearest neighbor (LMNN) is a metric learner which optimizes the performance of the popular $k$NN classifier. However, its resulting metric relies on pre-selected target neighbors. In this paper, we address the feasibility of…

Data Structures and Algorithms · Computer Science 2018-05-03 Babak Hosseini , Barbara Hammer

An Improved k-Nearest Neighbor Algorithm for Text Categorization

k is the most important parameter in a text categorization system based on k-Nearest Neighbor algorithm (kNN).In the classification process, k nearest documents to the test one in the training set are determined firstly. Then, the…

Computation and Language · Computer Science 2007-05-23 Baoli Li , Shiwen Yu , Qin Lu

PL-kNN: A Parameterless Nearest Neighbors Classifier

Demands for minimum parameter setup in machine learning models are desirable to avoid time-consuming optimization processes. The $k$-Nearest Neighbors is one of the most effective and straightforward models employed in numerous problems.…

Machine Learning · Computer Science 2022-10-03 Danilo Samuel Jodas , Leandro Aparecido Passos , Ahsan Adeel , João Paulo Papa

Discriminative Learning of the Prototype Set for Nearest Neighbor Classification

The nearest neighbor rule is a classic yet essential classification model, particularly in problems where the supervising information is given by pairwise dissimilarities and the embedding function are not easily obtained. Prototype…

Machine Learning · Computer Science 2018-01-09 Shin Ando

Nearest Neighbor Classification based on Imbalanced Data: A Statistical Approach

When the competing classes in a classification problem are not of comparable size, many popular classifiers exhibit a bias towards larger classes, and the nearest neighbor classifier is no exception. To take care of this problem, we develop…

Methodology · Statistics 2023-11-02 Anvit Garg , Anil K. Ghosh , Soham Sarkar

An Extensive Experimental Study on the Cluster-based Reference Set Reduction for speeding-up the k-NN Classifier

The k-Nearest Neighbor (k-NN) classification algorithm is one of the most widely-used lazy classifiers because of its simplicity and ease of implementation. It is considered to be an effective classifier and has many applications. However,…

Machine Learning · Computer Science 2014-02-13 Stefanos Ougiaroglou , Georgios Evangelidis , Dimitris A. Dervos

Social Distancing is Good for Points too!

The nearest-neighbor rule is a well-known classification technique that, given a training set P of labeled points, classifies any unlabeled query point with the label of its closest point in P. The nearest-neighbor condensation problem aims…

Computational Geometry · Computer Science 2020-06-30 Alejandro Flores-Velazco

Local nearest neighbour classification with applications to semi-supervised learning

We derive a new asymptotic expansion for the global excess risk of a local-$k$-nearest neighbour classifier, where the choice of $k$ may depend upon the test point. This expansion elucidates conditions under which the dominant contribution…

Statistics Theory · Mathematics 2019-05-21 Timothy I. Cannings , Thomas B. Berrett , Richard J. Samworth

Estimation based on nearest neighbor matching: from density ratio to average treatment effect

Nearest neighbor (NN) matching as a tool to align data sampled from different groups is both conceptually natural and practically well-used. In a landmark paper, Abadie and Imbens (2006) provided the first large-sample analysis of NN…

Statistics Theory · Mathematics 2021-12-28 Zhexiao Lin , Peng Ding , Fang Han

Adaptive kNN using Expected Accuracy for Classification of Geo-Spatial Data

The k-Nearest Neighbor (kNN) classification approach is conceptually simple - yet widely applied since it often performs well in practical applications. However, using a global constant k does not always provide an optimal solution, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2018-01-08 Mark Kibanov , Martin Becker , Juergen Mueller , Martin Atzmueller , Andreas Hotho , Gerd Stumme

Local Distribution in Neighborhood for Classification

The k-nearest-neighbor method performs classification tasks for a query sample based on the information contained in its neighborhood. Previous studies into the k-nearest-neighbor algorithm usually achieved the decision value for a class by…

Machine Learning · Computer Science 2018-12-10 Chengsheng Mao , Bin Hu , Lei Chen , Philip Moore , Xiaowei Zhang