Related papers: A Heterogeneous High Dimensional Approximate Neare…

Approximate Nearest Neighbor Search in High Dimensions

The nearest neighbor problem is defined as follows: Given a set $P$ of $n$ points in some metric space $(X,D)$, build a data structure that, given any point $q$, returns a point in $P$ that is closest to $q$ (its "nearest neighbor" in $P$).…

Data Structures and Algorithms · Computer Science 2018-06-27 Alexandr Andoni , Piotr Indyk , Ilya Razenshteyn

Approximate Nearest Neighbor Search for Low Dimensional Queries

We study the Approximate Nearest Neighbor problem for metric spaces where the query points are constrained to lie on a subspace of low doubling dimension, while the data is high-dimensional. We show that this problem can be solved…

Computational Geometry · Computer Science 2012-09-19 Sariel Har-Peled , Nirman Kumar

Proximity in the Age of Distraction: Robust Approximate Nearest Neighbor Search

We introduce a new variant of the nearest neighbor search problem, which allows for some coordinates of the dataset to be arbitrarily corrupted or unknown. Formally, given a dataset of $n$ points $P=\{ x_1,\ldots, x_n\}$ in high-dimensions,…

Computational Geometry · Computer Science 2015-11-24 Sariel Har-Peled , Sepideh Mahabadi

Bucketing Coding and Information Theory for the Statistical High Dimensional Nearest Neighbor Problem

Consider the problem of finding high dimensional approximate nearest neighbors, where the data is generated by some known probabilistic model. We will investigate a large natural class of algorithms which we call bucketing codes. We will…

Information Theory · Computer Science 2016-11-17 Moshe Dubiner

Sampling a Near Neighbor in High Dimensions -- Who is the Fairest of Them All?

Similarity search is a fundamental algorithmic primitive, widely used in many computer science disciplines. Given a set of points $S$ and a radius parameter $r>0$, the $r$-near neighbor ($r$-NN) problem asks for a data structure that, given…

Data Structures and Algorithms · Computer Science 2021-01-27 Martin Aumüller , Sariel Har-Peled , Sepideh Mahabadi , Rasmus Pagh , Francesco Silvestri

Fair Near Neighbor Search: Independent Range Sampling in High Dimensions

Similarity search is a fundamental algorithmic primitive, widely used in many computer science disciplines. There are several variants of the similarity search problem, and one of the most relevant is the $r$-near neighbor ($r$-NN) problem:…

Data Structures and Algorithms · Computer Science 2020-06-16 Martin Aumüller , Rasmus Pagh , Francesco Silvestri

Entropy based Nearest Neighbor Search in High Dimensions

In this paper we study the problem of finding the approximate nearest neighbor of a query point in the high dimensional space, focusing on the Euclidean space. The earlier approaches use locality-preserving hash functions (that tend to map…

Data Structures and Algorithms · Computer Science 2007-05-23 Rina Panigrahy

Approximate Nearest Neighbors in Limited Space

We consider the $(1+\epsilon)$-approximate nearest neighbor search problem: given a set $X$ of $n$ points in a $d$-dimensional space, build a data structure that, given any query point $y$, finds a point $x \in X$ whose distance to $y$ is…

Data Structures and Algorithms · Computer Science 2018-07-03 Piotr Indyk , Tal Wagner

Dynamic Feature Scaling for K-Nearest Neighbor Algorithm

Nearest Neighbors Algorithm is a Lazy Learning Algorithm, in which the algorithm tries to approximate the predictions with the help of similar existing vectors in the training dataset. The predictions made by the K-Nearest Neighbors…

Machine Learning · Computer Science 2018-11-14 Chandrasekaran Anirudh Bhardwaj , Megha Mishra , Kalyani Desikan

Quantum K-nearest neighbor classification algorithm based on Hamming distance

K-nearest neighbor classification algorithm is one of the most basic algorithms in machine learning, which determines the sample's category by the similarity between samples. In this paper, we propose a quantum K-nearest neighbor…

Quantum Physics · Physics 2023-04-03 Jing Li , Song Lin , Yu Kai , Gongde Guo

Randomized approximate nearest neighbor search with limited adaptivity

We study the fundamental problem of approximate nearest neighbor search in $d$-dimensional Hamming space $\{0,1\}^d$. We study the complexity of the problem in the famous cell-probe model, a classic model for data structures. We consider…

Data Structures and Algorithms · Computer Science 2016-02-16 Mingmou Liu , Xiaoyin Pan , Yitong Yin

Fast k-Nearest Neighbour Search via Dynamic Continuous Indexing

Existing methods for retrieving k-nearest neighbours suffer from the curse of dimensionality. We argue this is caused in part by inherent deficiencies of space partitioning, which is the underlying strategy used by most existing methods. We…

Data Structures and Algorithms · Computer Science 2017-04-07 Ke Li , Jitendra Malik

Comparison Based Nearest Neighbor Search

We consider machine learning in a comparison-based setting where we are given a set of points in a metric space, but we have no access to the actual distances between the points. Instead, we can only ask an oracle whether the distance…

Machine Learning · Statistics 2017-04-06 Siavash Haghiri , Debarghya Ghoshdastidar , Ulrike von Luxburg

Approximate Nearest Neighbor Searching with Non-Euclidean and Weighted Distances

We present a new approach to approximate nearest-neighbor queries in fixed dimension under a variety of non-Euclidean distances. We are given a set $S$ of $n$ points in $\mathbb{R}^d$, an approximation parameter $\varepsilon > 0$, and a…

Computational Geometry · Computer Science 2023-06-28 Ahmed Abdelkader , Sunil Arya , Guilherme D. da Fonseca , David M. Mount

K-Nearest Neighbor Approximation Via the Friend-of-a-Friend Principle

Suppose $V$ is an $n$-element set where for each $x \in V$, the elements of $V \setminus \{x\}$ are ranked by their similarity to $x$. The $K$-nearest neighbor graph is a directed graph including an arc from each $x$ to the $K$ points of $V…

Combinatorics · Mathematics 2020-12-29 Jacob D. Baron , R. W. R. Darling

Implicit Sparse Code Hashing

We address the problem of converting large-scale high-dimensional image data into binary codes so that approximate nearest-neighbor search over them can be efficiently performed. Different from most of the existing unsupervised approaches…

Computer Vision and Pattern Recognition · Computer Science 2015-12-02 Tsung-Yu Lin , Tsung-Wei Ke , Tyng-Luh Liu

Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Algorithms often carry out equally many computations for "easy" and "hard" problem instances. In particular, algorithms for finding nearest neighbors typically have the same running time regardless of the particular problem instance. In…

Data Structures and Algorithms · Computer Science 2020-03-25 Daniel LeJeune , Richard G. Baraniuk , Reinhard Heckel

Aren't we all nearest neighbors: Spatial trees, high dimensional reductions and batch nearest neighbor search

We start with a review of the pervasiveness of the nearest neighbor search problem and techniques used to solve it along with some experimental results. In the second chapter, we show reductions between two different classes of geo- metric…

Computational Geometry · Computer Science 2015-07-14 Mark Saroufim

Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction

Dimensionality reduction is crucial both for visualization and preprocessing high dimensional data for machine learning. We introduce a novel method based on a hierarchy built on 1-nearest neighbor graphs in the original space which is used…

Computer Vision and Pattern Recognition · Computer Science 2022-05-31 M. Saquib Sarfraz , Marios Koulakis , Constantin Seibold , Rainer Stiefelhagen

A Faster Algorithm for Finding Closest Pairs in Hamming Metric

We study the Closest Pair Problem in Hamming metric, which asks to find the pair with the smallest Hamming distance in a collection of binary vectors. We give a new randomized algorithm for the problem on uniformly random input…

Data Structures and Algorithms · Computer Science 2021-12-08 Andre Esser , Robert Kübler , Floyd Zweydinger