English
Related papers

Related papers: Accelerated Distance Computation with Encoding Tre…

200 papers

Vectors of data are at the heart of machine learning and data mining. Recently, vector quantization methods have shown great promise in reducing both the time and space costs of operating on vectors. We introduce a vector quantization…

Performance · Computer Science 2017-07-03 Davis W Blalock , John V Guttag

Visual localization algorithms have achieved significant improvements in performance thanks to recent advances in camera technology and vision-based techniques. However, there remains one critical caveat: all current approaches that are…

Computer Vision and Pattern Recognition · Computer Science 2019-07-01 Huu Le , Tuan Hoang , Michael Milford

Recent indexing techniques inspired by source coding have been shown successful to index billions of high-dimensional vectors in memory. In this paper, we propose an approach that re-ranks the neighbor hypotheses obtained by these…

Information Retrieval · Computer Science 2011-02-21 Hervé Jégou , Romain Tavenard , Matthijs Douze , Laurent Amsaleg

A simple and computationally efficient scheme for tree-structured vector quantization is presented. Unlike previous methods, its quantization error depends only on the intrinsic dimension of the data distribution, rather than the apparent…

Machine Learning · Statistics 2008-05-12 Sanjoy Dasgupta , Yoav Freund

Kondo et al. (DS 2014) proposed methods for computing distances between unordered rooted trees by transforming an instance of the distance computing problem into an instance of the integer programming problem. They showed that the tree edit…

Data Structures and Algorithms · Computer Science 2017-06-13 Eunpyeong Hong , Yasuaki Kobayashi , Akihiro Yamamoto

We propose and study a multi-scale approach to vector quantization. We develop an algorithm, dubbed reconstruction trees, inspired by decision trees. Here the objective is parsimonious reconstruction of unsupervised data, rather than…

Machine Learning · Computer Science 2019-09-05 Enrico Cecini , Ernesto De Vito , Lorenzo Rosasco

Contour trees offer an abstract representation of the level set topology in scalar fields and are widely used in topological data analysis and visualization. However, applying contour trees to large-scale scientific datasets remains…

Computational Geometry · Computer Science 2025-08-13 Mingzhe Li , Hamish Carr , Oliver Rübel , Bei Wang , Gunther H. Weber

In many important applications -- such as search engines and relational database systems -- data is stored in the form of arrays of integers. Encoding and, most importantly, decoding of these arrays consumes considerable CPU time.…

Information Retrieval · Computer Science 2021-02-02 Daniel Lemire , Leonid Boytsov

Finding meaningful distances between high-dimensional data samples is an important scientific task. To this end, we propose a new tree-Wasserstein distance (TWD) for high-dimensional data with two key aspects. First, our TWD is specifically…

Machine Learning · Computer Science 2025-02-25 Ya-Wei Eileen Lin , Ronald R. Coifman , Gal Mishne , Ronen Talmon

We introduce a semiparametric approach to neighbor-based classification. We build off the recently proposed Boundary Trees algorithm by Mathy et al.(2015) which enables fast neighbor-based classification, regression and retrieval in large…

Machine Learning · Computer Science 2018-10-29 Tharindu Adikari , Stark C. Draper

Many modern search domains comprise high-dimensional vectors of floating point numbers derived from neural networks, in the form of embeddings. Typical embeddings range in size from hundreds to thousands of dimensions, making the size of…

Machine Learning · Computer Science 2025-06-03 Richard Connor , Alan Dearle , Ben Claydon

City modeling and generation have attracted an increased interest in various applications, including gaming, urban planning, and autonomous driving. Unlike previous works focused on the generation of single objects or indoor scenes, the…

Computer Vision and Pattern Recognition · Computer Science 2023-09-29 Wenyu Han , Congcong Wen , Lazarus Chok , Yan Liang Tan , Sheung Lung Chan , Hang Zhao , Chen Feng

Similarity search retrieves the nearest neighbors of a query vector from a dataset of high-dimensional vectors. As the size of the dataset grows, the cost of performing the distance computations needed to implement a query can become…

Machine Learning · Computer Science 2019-12-20 Soroosh Khoram , Stephen J Wright , Jing Li

In general, the clustering problem is NP-hard, and global optimality cannot be established for non-trivial instances. For high-dimensional data, distance-based methods for clustering or classification face an additional difficulty, the…

Statistics Theory · Mathematics 2016-04-26 Tsvetan Asamov , Adi Ben-Israel

Quantum network communication is challenging, as the No-cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable communication approach is teleportation of quantum…

Quantum Physics · Physics 2024-04-08 Mohammad Ghaderibaneh , Caitao Zhan , Himanshu Gupta , C. R. Ramakrishnan

Metric learning has the aim to improve classification accuracy by learning a distance measure which brings data points from the same class closer together and pushes data points from different classes further apart. Recent research has…

Machine Learning · Computer Science 2018-05-21 Benjamin Paaßen

This paper tackles the problem of finding optimal variable-height transport packaging. The goal is to reduce the empty space left in a box when shipping goods to customers, thereby saving on filler and reducing waste. We cast this problem…

Optimization and Control · Mathematics 2023-08-03 Alain Lehmann , Wilhelm Kleiminger , Hakim Invernizzi , Aurel Gautschi

Vector quantization-based approaches are successful to solve Approximate Nearest Neighbor (ANN) problems which are critical to many applications. The idea is to generate effective encodings to allow fast distance approximation. We propose…

Computer Vision and Pattern Recognition · Computer Science 2015-09-18 Shicong Liu , Junru Shao , Hongtao Lu

We propose an extension of tree-based space-partitioning indexing structures for data with low intrinsic dimensionality embedded in a high dimensional space. We call this extension an Angle Tree. Our extension can be applied to both…

Data Structures and Algorithms · Computer Science 2010-04-19 Ilia Zvedeniouk , Sanjay Chawla

In a distributed information application an encoder compresses an arbitrary vector while a similar reference vector is available to the decoder as side information. For the Hamming-distance similarity measure, and when guaranteed perfect…

Information Theory · Computer Science 2020-09-08 Yuval Cassuto , Jacob Ziv
‹ Prev 1 2 3 10 Next ›