Related papers: Approximating Nearest Neighbor Distances

Approximating the Euclidean circle in the square grid using neighbourhood sequences

Distance measuring is a very important task in digital geometry and digital image processing. Due to our natural approach to geometry we think of the set of points that are equally far from a given point as a Euclidean circle. Using the…

Metric Geometry · Mathematics 2010-06-18 Janos Farkas , Szabolcs Bajak , Benedek Nagy

Approximate Nearest Neighbor Searching with Non-Euclidean and Weighted Distances

We present a new approach to approximate nearest-neighbor queries in fixed dimension under a variety of non-Euclidean distances. We are given a set $S$ of $n$ points in $\mathbb{R}^d$, an approximation parameter $\varepsilon > 0$, and a…

Computational Geometry · Computer Science 2023-06-28 Ahmed Abdelkader , Sunil Arya , Guilherme D. da Fonseca , David M. Mount

Learning Nearest Neighbor Graphs from Noisy Distance Samples

We consider the problem of learning the nearest neighbor graph of a dataset of n items. The metric is unknown, but we can query an oracle to obtain a noisy estimate of the distance between any pair of items. This framework applies to…

Machine Learning · Statistics 2019-06-03 Blake Mason , Ardhendu Tripathy , Robert Nowak

Metric Embedding for Nearest Neighbor Classification

The distance metric plays an important role in nearest neighbor (NN) classification. Usually the Euclidean distance metric is assumed or a Mahalanobis distance metric is optimized to improve the NN performance. In this paper, we study the…

Machine Learning · Statistics 2007-06-26 Bharath K. Sriperumbudur , Gert R. G. Lanckriet

A quantum k-nearest neighbors algorithm based on the Euclidean distance estimation

The k-nearest neighbors (k-NN) is a basic machine learning (ML) algorithm, and several quantum versions of it, employing different distance metrics, have been presented in the last few years. Although the Euclidean distance is one of the…

Emerging Technologies · Computer Science 2024-04-25 Enrico Zardini , Enrico Blanzieri , Davide Pastorello

Quantum Algorithms for Nearest-Neighbor Methods for Supervised and Unsupervised Learning

We present several quantum algorithms for performing nearest-neighbor learning. At the core of our algorithms are fast and coherent quantum methods for computing distance metrics such as the inner product and Euclidean distance. We prove…

Quantum Physics · Physics 2014-12-12 Nathan Wiebe , Ashish Kapoor , Krysta Svore

Euclidean TSP, Motorcycle Graphs, and Other New Applications of Nearest-Neighbor Chains

We show new applications of the nearest-neighbor chain algorithm, a technique that originated in agglomerative hierarchical clustering. We apply it to a diverse class of geometric problems: we construct the greedy multi-fragment tour for…

Computational Geometry · Computer Science 2019-12-04 Nil Mamano , Alon Efrat , David Eppstein , Daniel Frishberg , Michael Goodrich , Stephen Kobourov , Pedro Matias , Valentin Polishchuk

Exact Computation of a Manifold Metric, via Lipschitz Embeddings and Shortest Paths on a Graph

Data-sensitive metrics adapt distances locally based the density of data points with the goal of aligning distances and some notion of similarity. In this paper, we give the first exact algorithm for computing a data-sensitive metric called…

Computational Geometry · Computer Science 2020-04-22 Timothy Chu , Gary Miller , Donald Sheehy

Geodesic Distance Estimation with Spherelets

Many statistical and machine learning approaches rely on pairwise distances between data points. The choice of distance metric has a fundamental impact on performance of these procedures, raising questions about how to appropriately…

Statistics Theory · Mathematics 2020-04-20 Didong Li , David B Dunson

Metric Spaces with Expensive Distances

In algorithms for finite metric spaces, it is common to assume that the distance between two points can be computed in constant time, and complexity bounds are expressed only in terms of the number of points of the metric space. We…

Computational Geometry · Computer Science 2019-01-28 Michael Kerber , Arnur Nigmetov

How to Design Robust Algorithms using Noisy Comparison Oracle

Metric based comparison operations such as finding maximum, nearest and farthest neighbor are fundamental to studying various clustering techniques such as $k$-center clustering and agglomerative hierarchical clustering. These techniques…

Data Structures and Algorithms · Computer Science 2021-05-13 Raghavendra Addanki , Sainyam Galhotra , Barna Saha

Approximate Weighted Farthest Neighbors and Minimum Dilation Stars

We provide an efficient reduction from the problem of querying approximate multiplicatively weighted farthest neighbors in a metric space to the unweighted problem. Combining our techniques with core-sets for approximate unweighted farthest…

Computational Geometry · Computer Science 2009-09-29 John Augustine , David Eppstein , Kevin A. Wortman

Metric Learning on Manifolds

Recent literature has shown that symbolic data, such as text and graphs, is often better represented by points on a curved manifold, rather than in Euclidean space. However, geometrical operations on manifolds are generally more complicated…

Machine Learning · Computer Science 2019-02-06 Max Aalto , Nakul Verma

Rates of Convergence for Nearest Neighbor Classification

Nearest neighbor methods are a popular class of nonparametric estimators with several desirable properties, such as adaptivity to different distance scales in different regions of space. Prior work on convergence rates for nearest neighbor…

Machine Learning · Computer Science 2014-07-03 Kamalika Chaudhuri , Sanjoy Dasgupta

Proximity Graphs for Similarity Search: Fast Construction, Lower Bounds, and Euclidean Separation

Proximity graph-based methods have emerged as a leading paradigm for approximate nearest neighbor (ANN) search in the system community. This paper presents fresh insights into the theoretical foundation of these methods. We describe an…

Data Structures and Algorithms · Computer Science 2025-09-10 Shangqi Lu , Yufei Tao

A Complex Networks Approach for Data Clustering

Many methods have been developed for data clustering, such as k-means, expectation maximization and algorithms based on graph theory. In this latter case, graphs are generally constructed by taking into account the Euclidian distance as a…

Data Analysis, Statistics and Probability · Physics 2011-01-27 Francisco A. Rodrigues , Guilherme Ferraz de Arruda , Luciano da Fontoura Costa

Products of Euclidean metrics and applications to proximity questions among curves

The problem of Approximate Nearest Neighbor (ANN) search is fundamental in computer science and has benefited from significant progress in the past couple of decades. However, most work has been devoted to pointsets whereas complex shapes…

Computational Geometry · Computer Science 2020-04-14 Ioannis Z. Emiris , Ioannis Psarros

On Coresets for Fair Clustering in Metric and Euclidean Spaces and Their Applications

Fair clustering is a constrained variant of clustering where the goal is to partition a set of colored points, such that the fraction of points of any color in every cluster is more or less equal to the fraction of points of this color in…

Data Structures and Algorithms · Computer Science 2020-07-21 Sayan Bandyapadhyay , Fedor V. Fomin , Kirill Simonov

Dynamic Feature Scaling for K-Nearest Neighbor Algorithm

Nearest Neighbors Algorithm is a Lazy Learning Algorithm, in which the algorithm tries to approximate the predictions with the help of similar existing vectors in the training dataset. The predictions made by the K-Nearest Neighbors…

Machine Learning · Computer Science 2018-11-14 Chandrasekaran Anirudh Bhardwaj , Megha Mishra , Kalyani Desikan

Crowdsourced correlation clustering with relative distance comparisons

Crowdsourced, or human computation based clustering algorithms usually rely on relative distance comparisons, as these are easier to elicit from human workers than absolute distance information. A relative distance comparison is a statement…

Data Structures and Algorithms · Computer Science 2017-09-26 Antti Ukkonen