Related papers: Efficient Distributed Algorithms for the $K$-Neare…

Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Algorithms often carry out equally many computations for "easy" and "hard" problem instances. In particular, algorithms for finding nearest neighbors typically have the same running time regardless of the particular problem instance. In…

Data Structures and Algorithms · Computer Science 2020-03-25 Daniel LeJeune , Richard G. Baraniuk , Reinhard Heckel

An Efficient Algorithm for Bayesian Nearest Neighbours

K-Nearest Neighbours (k-NN) is a popular classification and regression algorithm, yet one of its main limitations is the difficulty in choosing the number of neighbours. We present a Bayesian algorithm to compute the posterior probability…

Machine Learning · Computer Science 2017-06-05 Giuseppe Nuti

Minimax Optimal Algorithms with Fixed-$k$-Nearest Neighbors

This paper presents how to perform minimax optimal classification, regression, and density estimation based on fixed-$k$ nearest neighbor (NN) searches. We consider a distributed learning scenario, in which a massive dataset is split into…

Statistics Theory · Mathematics 2024-09-11 J. Jon Ryu , Young-Han Kim

Distributed Algorithms for Large-Scale Graphs

Motivated by the increasing need for fast processing of large-scale graphs, we study a number of fundamental graph problems in a message-passing model for distributed computing, called $k$-machine model, where we have $k$ machines that…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-02-10 Khalid Hourani , Hartmut Klauck , William K. Moses , Danupon Nanongkai , Gopal Pandurangan , Peter Robinson , Michele Scquizzato

A Theory-Based Evaluation of Nearest Neighbor Models Put Into Practice

In the $k$-nearest neighborhood model ($k$-NN), we are given a set of points $P$, and we shall answer queries $q$ by returning the $k$ nearest neighbors of $q$ in $P$ according to some metric. This concept is crucial in many areas of data…

Machine Learning · Computer Science 2018-12-03 Hendrik Fichtenberger , Dennis Rohde

PL-kNN: A Parameterless Nearest Neighbors Classifier

Demands for minimum parameter setup in machine learning models are desirable to avoid time-consuming optimization processes. The $k$-Nearest Neighbors is one of the most effective and straightforward models employed in numerous problems.…

Machine Learning · Computer Science 2022-10-03 Danilo Samuel Jodas , Leandro Aparecido Passos , Ahsan Adeel , João Paulo Papa

Local Distribution in Neighborhood for Classification

The k-nearest-neighbor method performs classification tasks for a query sample based on the information contained in its neighborhood. Previous studies into the k-nearest-neighbor algorithm usually achieved the decision value for a class by…

Machine Learning · Computer Science 2018-12-10 Chengsheng Mao , Bin Hu , Lei Chen , Philip Moore , Xiaowei Zhang

On Distributed Algorithms for Minimum Dominating Set problem, from theory to application

In this paper, we propose a distributed algorithm for the minimum dominating set problem. For some especial networks, we prove theoretically that the achieved answer by our proposed algorithm is a constant approximation factor of the exact…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-01-05 Sharareh Alipour , Ehsan Futuhi , Shayan Karimi

Fast Distributed k-Means with a Small Number of Rounds

We propose a new algorithm for k-means clustering in a distributed setting, where the data is distributed across many machines, and a coordinator communicates with these machines to calculate the output clustering. Our algorithm guarantees…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-14 Tom Hess , Ron Visbord , Sivan Sabato

Near-Optimal Clustering in the $k$-machine model

The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-24 Sayan Bandyapadhyay , Tanmay Inamdar , Shreyas Pai , Sriram V. Pemmaraju

A Couple of Simple Algorithms for $k$-Dispersion

Given a set $P$ of $n$ points in $\mathbf{R}^d$, and a positive integer $k \leq n$, the $k$-dispersion problem is that of selecting $k$ of the given points so that the minimum inter-point distance among them is maximized (under Euclidean…

Computational Geometry · Computer Science 2025-11-04 Ke Chen , Adrian Dumitrescu

$k$-Center Clustering in Distributed Models

The $k$-center problem is a central optimization problem with numerous applications for machine learning, data mining, and communication networks. Despite extensive study in various scenarios, it surprisingly has not been thoroughly…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-26 Leyla Biabani , Ami Paz

K-Nearest Neighbor Approximation Via the Friend-of-a-Friend Principle

Suppose $V$ is an $n$-element set where for each $x \in V$, the elements of $V \setminus \{x\}$ are ranked by their similarity to $x$. The $K$-nearest neighbor graph is a directed graph including an arc from each $x$ to the $K$ points of $V…

Combinatorics · Mathematics 2020-12-29 Jacob D. Baron , R. W. R. Darling

An Improved k-Nearest Neighbor Algorithm for Text Categorization

k is the most important parameter in a text categorization system based on k-Nearest Neighbor algorithm (kNN).In the classification process, k nearest documents to the test one in the training set are determined firstly. Then, the…

Computation and Language · Computer Science 2007-05-23 Baoli Li , Shiwen Yu , Qin Lu

Distributionally Robust Weighted $k$-Nearest Neighbors

Learning a robust classifier from a few samples remains a key challenge in machine learning. A major thrust of research has been focused on developing $k$-nearest neighbor ($k$-NN) based algorithms combined with metric learning that…

Machine Learning · Statistics 2022-02-17 Shixiang Zhu , Liyan Xie , Minghe Zhang , Rui Gao , Yao Xie

Quantum K-nearest neighbor classification algorithm based on Hamming distance

K-nearest neighbor classification algorithm is one of the most basic algorithms in machine learning, which determines the sample's category by the similarity between samples. In this paper, we propose a quantum K-nearest neighbor…

Quantum Physics · Physics 2023-04-03 Jing Li , Song Lin , Yu Kai , Gongde Guo

Efficient Estimation of k for the Nearest Neighbors Class of Methods

The k Nearest Neighbors (kNN) method has received much attention in the past decades, where some theoretical bounds on its performance were identified and where practical optimizations were proposed for making it work fairly well in high…

Machine Learning · Computer Science 2016-06-14 Aleksander Lodwich , Faisal Shafait , Thomas Breuel

k*-Nearest Neighbors: From Global to Local

The weighted k-nearest neighbors algorithm is one of the most fundamental non-parametric methods in pattern recognition and machine learning. The question of setting the optimal number of neighbors as well as the optimal weights has…

Machine Learning · Statistics 2017-01-26 Oren Anava , Kfir Y. Levy

Comparison Based Nearest Neighbor Search

We consider machine learning in a comparison-based setting where we are given a set of points in a metric space, but we have no access to the actual distances between the points. Instead, we can only ask an oracle whether the distance…

Machine Learning · Statistics 2017-04-06 Siavash Haghiri , Debarghya Ghoshdastidar , Ulrike von Luxburg

Learned k-NN Distance Estimation

Big data mining is well known to be an important task for data science, because it can provide useful observations and new knowledge hidden in given large datasets. Proximity-based data analysis is particularly utilized in many real-life…

Databases · Computer Science 2022-11-29 Daichi Amagata , Yusuke Arai , Sumio Fujita , Takahiro Hara