Related papers: Efficient K-Nearest Neighbor Join Algorithms for H…

Efficient Processing of k Nearest Neighbor Joins using MapReduce

k nearest neighbor join (kNN join), designed to find k nearest neighbors from a dataset S for every object in another dataset R, is a primitive operation widely adopted by many data mining applications. As a combination of the k nearest…

Databases · Computer Science 2015-03-20 Wei Lu , Yanyan Shen , Su Chen , Beng Chin Ooi

Hybrid KNN-Join: Parallel Nearest Neighbor Searches Exploiting CPU and GPU Architectural Features

K Nearest Neighbor (KNN) joins are used in scientific domains for data analysis, and are building blocks of several well-known algorithms. KNN-joins find the KNN of all points in a dataset. This paper focuses on a hybrid CPU/GPU approach…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-11-19 Michael Gowanlock

On the Merge of k-NN Graph

k-nearest neighbor graph is a fundamental data structure in many disciplines such as information retrieval, data-mining, pattern recognition, and machine learning, etc. In the literature, considerable research has been focusing on how to…

Information Retrieval · Computer Science 2021-07-30 Wan-Lei Zhao , Hui Wang , Peng-Cheng Lin , Chong-Wah Ngo

Learned k-NN Distance Estimation

Big data mining is well known to be an important task for data science, because it can provide useful observations and new knowledge hidden in given large datasets. Proximity-based data analysis is particularly utilized in many real-life…

Databases · Computer Science 2022-11-29 Daichi Amagata , Yusuke Arai , Sumio Fujita , Takahiro Hara

Efficient Estimation of k for the Nearest Neighbors Class of Methods

The k Nearest Neighbors (kNN) method has received much attention in the past decades, where some theoretical bounds on its performance were identified and where practical optimizations were proposed for making it work fairly well in high…

Machine Learning · Computer Science 2016-06-14 Aleksander Lodwich , Faisal Shafait , Thomas Breuel

Scalable Nearest Neighbor Search based on kNN Graph

Nearest neighbor search is known as a challenging issue that has been studied for several decades. Recently, this issue becomes more and more imminent in viewing that the big data problem arises from various fields. In this paper, a…

Computer Vision and Pattern Recognition · Computer Science 2017-02-06 Wan-Lei Zhao , Jie Yang , Cheng-Hao Deng

Approximate Nearest Neighbour Search on Dynamic Datasets: An Investigation

Approximate k-Nearest Neighbour (ANN) methods are often used for mining information and aiding machine learning on large scale high-dimensional datasets. ANN methods typically differ in the index structure used for accelerating searches,…

Machine Learning · Computer Science 2025-02-04 Ben Harwood , Amir Dezfouli , Iadine Chades , Conrad Sanderson

An enhanced statistical feature fusion approach using an improved distance evaluation algorithm and weighted K-nearest neighbor for bearing fault diagnosis

Bearings are among the most failure-prone components in rotating machinery, and their condition directly impacts overall performance. Therefore, accurately diagnosing bearing faults is essential for ensuring system stability. However,…

Signal Processing · Electrical Eng. & Systems 2025-09-26 Amir Eshaghi Chaleshtori , Abdollah Aghaie

Neighborhood and Graph Constructions using Non-Negative Kernel Regression

Data-driven neighborhood definitions and graph constructions are often used in machine learning and signal processing applications. k-nearest neighbor~(kNN) and $\epsilon$-neighborhood methods are among the most common methods used for…

Machine Learning · Computer Science 2023-04-18 Sarath Shekkizhar , Antonio Ortega

Revisiting $k$-Nearest Neighbor Graph Construction on High-Dimensional Data : Experiments and Analyses

The $k$-nearest neighbor graph (KNNG) on high-dimensional data is a data structure widely used in many applications such as similarity search, dimension reduction and clustering. Due to its increasing popularity, several methods under the…

Data Structures and Algorithms · Computer Science 2021-12-07 Liu Yingfan , Cheng Hong , Cui Jiangtao

Leveraging Reinforcement Learning for evaluating Robustness of KNN Search Algorithms

The problem of finding K-nearest neighbors in the given dataset for a given query point has been worked upon since several years. In very high dimensional spaces the K-nearest neighbor search (KNNS) suffers in terms of complexity in…

Machine Learning · Computer Science 2021-02-15 Pramod Vadiraja , Christoph Peter Balada

High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations

Approximate K nearest neighbor (AKNN) search is a fundamental and challenging problem. We observe that in high-dimensional space, the time consumption of nearly all AKNN algorithms is dominated by that of the distance comparison operations…

Data Structures and Algorithms · Computer Science 2023-03-20 Jianyang Gao , Cheng Long

Accelerated Search for Non-Negative Greedy Sparse Decomposition via Dimensionality Reduction

Non-negative signals form an important class of sparse signals. Many algorithms have already beenproposed to recover such non-negative representations, where greedy and convex relaxed algorithms are among the most popular methods. One fast…

Signal Processing · Electrical Eng. & Systems 2020-06-09 Konstantinos Voulgaris , Mike E. Davies , Mehrdad Yaghoobi

A new hashing based nearest neighbors selection technique for big datasets

KNN has the reputation to be the word simplest but efficient supervised learning algorithm used for either classification or regression. KNN prediction efficiency highly depends on the size of its training data but when this training data…

Machine Learning · Computer Science 2021-07-01 Jude Tchaye-Kondi , Yanlong Zhai , Liehuang Zhu

Multiple Index Merge for Approximate Nearest Neighbor Search

Approximate $k$ nearest neighbor (AKNN) search in high-dimensional space is a foundational problem in vector databases with widespread applications. Among the numerous AKNN indexes, Proximity Graph-based indexes achieve state-of-the-art…

Databases · Computer Science 2026-02-20 Liuchang Jing , Mingyu Yang , Lei Li , Jianbin Qin , Wei Wang

Efficient Estimation of the number of neighbours in Probabilistic K Nearest Neighbour Classification

Probabilistic k-nearest neighbour (PKNN) classification has been introduced to improve the performance of original k-nearest neighbour (KNN) classification algorithm by explicitly modelling uncertainty in the classification of each feature…

Machine Learning · Computer Science 2013-05-07 Ji Won Yoon , Nial Friel

Rapid Near-Neighbor Interaction of High-dimensional Data via Hierarchical Clustering

Calculation of near-neighbor interactions among high dimensional, irregularly distributed data points is a fundamental task to many graph-based or kernel-based machine learning algorithms and applications. Such calculations, involving…

Machine Learning · Computer Science 2017-09-13 Nikos Pitsianis , Dimitris Floros , Alexandros-Stavros Iliopoulos , Kostas Mylonakis , Nikos Sismanis , Xiaobai Sun

Fast k-Nearest Neighbour Search via Dynamic Continuous Indexing

Existing methods for retrieving k-nearest neighbours suffer from the curse of dimensionality. We argue this is caused in part by inherent deficiencies of space partitioning, which is the underlying strategy used by most existing methods. We…

Data Structures and Algorithms · Computer Science 2017-04-07 Ke Li , Jitendra Malik

Fast $k$-NNG construction with GPU-based quick multi-select

In this paper we describe a new brute force algorithm for building the $k$-Nearest Neighbor Graph ($k$-NNG). The $k$-NNG algorithm has many applications in areas such as machine learning, bio-informatics, and clustering analysis. While…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-06-17 Ivan Komarov , Ali Dashti , Roshan D'Souza

Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search

High-dimensional approximate $K$ nearest neighbor search (AKNN) is a fundamental task for various applications, including information retrieval. Most existing algorithms for AKNN can be decomposed into two main components, i.e., candidate…

Databases · Computer Science 2024-12-03 Liwei Deng , Penghao Chen , Ximu Zeng , Tianfu Wang , Yan Zhao , Kai Zheng