Related papers: Distributed Nearest Neighbor Classification

Rates of Convergence for Large-scale Nearest Neighbor Classification

Nearest neighbor is a popular class of classification methods with many desirable properties. For a large data set which cannot be loaded into the memory of a single machine due to computation, communication, privacy, or ownership…

Machine Learning · Statistics 2019-11-01 Xingye Qiao , Jiexin Duan , Guang Cheng

Distributed Adaptive Nearest Neighbor Classifier: Algorithm and Theory

When data is of an extraordinarily large size or physically stored in different locations, the distributed nearest neighbor (NN) classifier is an attractive tool for classification. We propose a novel distributed adaptive NN classifier for…

Machine Learning · Statistics 2023-06-06 Ruiqi Liu , Ganggang Xu , Zuofeng Shang

Rates of Convergence for Nearest Neighbor Classification

Nearest neighbor methods are a popular class of nonparametric estimators with several desirable properties, such as adaptivity to different distance scales in different regions of space. Prior work on convergence rates for nearest neighbor…

Machine Learning · Computer Science 2014-07-03 Kamalika Chaudhuri , Sanjoy Dasgupta

Local Distribution in Neighborhood for Classification

The k-nearest-neighbor method performs classification tasks for a query sample based on the information contained in its neighborhood. Previous studies into the k-nearest-neighbor algorithm usually achieved the decision value for a class by…

Machine Learning · Computer Science 2018-12-10 Chengsheng Mao , Bin Hu , Lei Chen , Philip Moore , Xiaowei Zhang

Nearest Neighbor Classification based on Imbalanced Data: A Statistical Approach

When the competing classes in a classification problem are not of comparable size, many popular classifiers exhibit a bias towards larger classes, and the nearest neighbor classifier is no exception. To take care of this problem, we develop…

Methodology · Statistics 2023-11-02 Anvit Garg , Anil K. Ghosh , Soham Sarkar

Discriminative Learning of the Prototype Set for Nearest Neighbor Classification

The nearest neighbor rule is a classic yet essential classification model, particularly in problems where the supervising information is given by pairwise dissimilarities and the embedding function are not easily obtained. Prototype…

Machine Learning · Computer Science 2018-01-09 Shin Ando

A new hashing based nearest neighbors selection technique for big datasets

KNN has the reputation to be the word simplest but efficient supervised learning algorithm used for either classification or regression. KNN prediction efficiency highly depends on the size of its training data but when this training data…

Machine Learning · Computer Science 2021-07-01 Jude Tchaye-Kondi , Yanlong Zhai , Liehuang Zhu

Distributed Parameter Estimation via Pseudo-likelihood

Estimating statistical models within sensor networks requires distributed algorithms, in which both data and computation are distributed across the nodes of the network. We propose a general approach for distributed learning based on…

Machine Learning · Computer Science 2012-07-03 Qiang Liu , Alexander Ihler

k*-Nearest Neighbors: From Global to Local

The weighted k-nearest neighbors algorithm is one of the most fundamental non-parametric methods in pattern recognition and machine learning. The question of setting the optimal number of neighbors as well as the optimal weights has…

Machine Learning · Statistics 2017-01-26 Oren Anava , Kfir Y. Levy

Enhanced Nearest Neighbor Classification for Crowdsourcing

In machine learning, crowdsourcing is an economical way to label a large amount of data. However, the noise in the produced labels may deteriorate the accuracy of any classification method applied to the labelled data. We propose an…

Human-Computer Interaction · Computer Science 2022-03-03 Jiexin Duan , Xingye Qiao , Guang Cheng

A Two-Stage Active Learning Algorithm for $k$-Nearest Neighbors

$k$-nearest neighbor classification is a popular non-parametric method because of desirable properties like automatic adaption to distributional scale changes. Unfortunately, it has thus far proved difficult to design active learning…

Machine Learning · Computer Science 2023-08-22 Nick Rittler , Kamalika Chaudhuri

Hyperdimensional Computing for Efficient Distributed Classification with Randomized Neural Networks

In the supervised learning domain, considering the recent prevalence of algorithms with high computational cost, the attention is steering towards simpler, lighter, and less computationally extensive training and inference approaches. In…

Machine Learning · Computer Science 2022-09-02 Antonello Rosato , Massimo Panella , Denis Kleyko

On high-dimensional modifications of the nearest neighbor classifier

Nearest neighbor classifier is arguably the most simple and popular nonparametric classifier available in the literature. However, due to the concentration of pairwise distances and the violation of the neighborhood structure, this…

Machine Learning · Statistics 2024-10-25 Annesha Ghosh , Deep Ghoshal , Bilol Banerjee , Anil K. Ghosh

Explaining the Success of Nearest Neighbor Methods in Prediction

Many modern methods for prediction leverage nearest neighbor search to find past training examples most similar to a test example, an idea that dates back in text to at least the 11th century and has stood the test of time. This monograph…

Machine Learning · Computer Science 2025-02-25 George H. Chen , Devavrat Shah

A Weighted Mutual k-Nearest Neighbour for Classification Mining

kNN is a very effective Instance based learning method, and it is easy to implement. Due to heterogeneous nature of data, noises from different possible sources are also widespread in nature especially in case of large-scale databases. For…

Machine Learning · Computer Science 2020-05-19 Joydip Dhar , Ashaya Shukla , Mukul Kumar , Prashant Gupta

Finding Nearest Neighbors in graphs locally

Many distributed learning techniques have been motivated by the increasing size of datasets and their inability to fit into main memory on a single machine. We propose an algorithm that finds the nearest neighbor in a graph locally without…

Data Structures and Algorithms · Computer Science 2019-02-18 Abhinav Mishra

Optimal Nonparametric Inference with Two-Scale Distributional Nearest Neighbors

The weighted nearest neighbors (WNN) estimator has been popularly used as a flexible and easy-to-implement nonparametric tool for mean regression estimation. The bagging technique is an elegant way to form WNN estimators with weights…

Machine Learning · Statistics 2022-07-19 Emre Demirkaya , Yingying Fan , Lan Gao , Jinchi Lv , Patrick Vossler , Jingbo Wang

Network Sampling Based on NN Representatives

The amount of large-scale real data around us increase in size very quickly and so does the necessity to reduce its size by obtaining a representative sample. Such sample allows us to use a great variety of analytical methods, whose direct…

Social and Information Networks · Computer Science 2014-02-10 Milos Kudelka , Sarka Zehnalova , Jan Platos

Classifying large-scale image data into object categories is an important problem that has received increasing research attention. Given the huge amount of data, non-parametric approaches such as nearest neighbor classifiers have shown…

Computer Vision and Pattern Recognition · Computer Science 2014-04-28 Zhaowen Wang , Jianchao Yang , Zhe Lin , Jonathan Brandt , Shiyu Chang , Thomas Huang

Communication-Efficient Sampling for Distributed Training of Graph Convolutional Networks

Training Graph Convolutional Networks (GCNs) is expensive as it needs to aggregate data recursively from neighboring nodes. To reduce the computation overhead, previous works have proposed various neighbor sampling methods that estimate the…

Machine Learning · Computer Science 2021-01-20 Peng Jiang , Masuma Akter Rumi