Related papers: Bilinear Random Projections for Locality-Sensitive…

Bilinear Supervised Hashing Based on 2D Image Features

Hashing has been recognized as an efficient representation learning method to effectively handle big data due to its low computational complexity and memory cost. Most of the existing hashing methods focus on learning the low-dimensional…

Computer Vision and Pattern Recognition · Computer Science 2019-01-08 Yujuan Ding , Wai Kueng Wong , Zhihui Lai , Zheng Zhang

Improving Locality Sensitive Hashing by Efficiently Finding Projected Nearest Neighbors

Similarity search in high-dimensional spaces is an important task for many multimedia applications. Due to the notorious curse of dimensionality, approximate nearest neighbor techniques are preferred over exact searching techniques since…

Databases · Computer Science 2020-10-16 Omid Jafari , Parth Nagarkar , Jonathan Montaño

Range-efficient consistent sampling and locality-sensitive hashing for polygons

Locality-sensitive hashing (LSH) is a fundamental technique for similarity search and similarity estimation in high-dimensional spaces. The basic idea is that similar objects should produce hash collisions with probability significantly…

Computational Geometry · Computer Science 2017-09-25 Joachim Gudmundsson , Rasmus Pagh

Binary embeddings with structured hashed projections

We consider the hashing mechanism for constructing binary embeddings, that involves pseudo-random projections followed by nonlinear (sign function) mappings. The pseudo-random projection is described by a matrix, where not all entries are…

Machine Learning · Computer Science 2016-07-04 Anna Choromanska , Krzysztof Choromanski , Mariusz Bojarski , Tony Jebara , Sanjiv Kumar , Yann LeCun

Fast Locality Sensitive Hashing with Theoretical Guarantee

Locality-sensitive hashing (LSH) is an effective randomized technique widely used in many machine learning tasks. The cost of hashing is proportional to data dimensions, and thus often the performance bottleneck when dimensionality is high…

Machine Learning · Computer Science 2023-09-28 Zongyuan Tan , Hongya Wang , Bo Xu , Minjie Luo , Ming Du

Density Sensitive Hashing

Nearest neighbors search is a fundamental problem in various research fields like machine learning, data mining and pattern recognition. Recently, hashing-based approaches, e.g., Locality Sensitive Hashing (LSH), are proved to be effective…

Information Retrieval · Computer Science 2012-05-15 Yue Lin , Deng Cai , Cheng Li

Anti-sparse coding for approximate nearest neighbor search

This paper proposes a binarization scheme for vectors of high dimension based on the recent concept of anti-sparse coding, and shows its excellent performance for approximate nearest neighbor search. Unlike other binarization schemes, this…

Computer Vision and Pattern Recognition · Computer Science 2011-10-27 Hervé Jégou , Teddy Furon , Jean-Jacques Fuchs

Compact Hyperplane Hashing with Bilinear Functions

Hyperplane hashing aims at rapidly searching nearest points to a hyperplane, and has shown practical impact in scaling up active learning with SVMs. Unfortunately, the existing randomized methods need long hash codes to achieve reasonable…

Machine Learning · Computer Science 2012-06-22 Wei Liu , Jun Wang , Yadong Mu , Sanjiv Kumar , Shih-Fu Chang

Projected Hamming Dissimilarity for Bit-Level Importance Coding in Collaborative Filtering

When reasoning about tasks that involve large amounts of data, a common approach is to represent data items as objects in the Hamming space where operations can be done efficiently and effectively. Object similarity can then be computed by…

Information Retrieval · Computer Science 2021-03-29 Christian Hansen , Casper Hansen , Jakob Grue Simonsen , Christina Lioma

2-Bit Random Projections, NonLinear Estimators, and Approximate Near Neighbor Search

The method of random projections has become a standard tool for machine learning, data mining, and search with massive data at Web scale. The effective use of random projections requires efficient coding schemes for quantizing (real-valued)…

Machine Learning · Statistics 2016-02-23 Ping Li , Michael Mitzenmacher , Anshumali Shrivastava

Improving LSH via Tensorized Random Projection

Locality sensitive hashing (LSH) is a fundamental algorithmic toolkit used by data scientists for approximate nearest neighbour search problems that have been used extensively in many large scale data processing applications such as near…

Machine Learning · Statistics 2025-03-04 Bhisham Dev Verma , Rameshwar Pratap

Efficient Bitmap-based Indexing and Retrieval of Similarity Search Image Queries

Finding similar images is a necessary operation in many multimedia applications. Images are often represented and stored as a set of high-dimensional features, which are extracted using localized feature extraction algorithms. Locality…

Multimedia · Computer Science 2020-10-16 Omid Jafari , Parth Nagarkar , Jonathan Montaño

Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and Beyond

Computing approximate nearest neighbors in high dimensional spaces is a central problem in large-scale data mining with a wide range of applications in machine learning and data science. A popular and effective technique in computing…

Machine Learning · Computer Science 2019-10-29 Lin Chen , Hossein Esfandiari , Thomas Fu , Vahab S. Mirrokni

Learning to hash with semantic similarity metrics and empirical KL divergence

Learning to hash is an efficient paradigm for exact and approximate nearest neighbor search from massive databases. Binary hash codes are typically extracted from an image by rounding output features from a CNN, which is trained on a…

Machine Learning · Computer Science 2020-05-12 Heikki Arponen , Tom E. Bishop

Experimental Analysis of Machine Learning Techniques for Finding Search Radius in Locality Sensitive Hashing

Finding similar data in high-dimensional spaces is one of the important tasks in multimedia applications. Approaches introduced to find exact searching techniques often use tree-based index structures which are known to suffer from the…

Databases · Computer Science 2022-11-17 Omid Jafari , Parth Nagarkar

Experimental Analysis of Locality Sensitive Hashing Techniques for High-Dimensional Approximate Nearest Neighbor Searches

Finding nearest neighbors in high-dimensional spaces is a fundamental operation in many multimedia retrieval applications. Exact tree-based indexing approaches are known to suffer from the notorious curse of dimensionality for…

Databases · Computer Science 2021-02-16 Omid Jafari , Parth Nagarkar

Distance-Sensitive hashing

Locality-sensitive hashing (LSH) is an important tool for managing high-dimensional noisy or uncertain data, for example in connection with data cleaning (similarity join) and noise-robust search (similarity search). However, for a number…

Data Structures and Algorithms · Computer Science 2018-04-18 Martin Aumüller , Tobias Christiani , Rasmus Pagh , Francesco Silvestri

LSH on the Hypercube Revisited

LSH (locality sensitive hashing) had emerged as a powerful technique in nearest-neighbor search in high dimensions [IM98, HIM12]. Given a point set $P$ in a metric space, and given parameters $r$ and $\varepsilon > 0$, the task is to…

Computational Geometry · Computer Science 2017-04-11 Sariel Har-Peled , Sepideh Mahabadi

Locality Sensitive Hashing for Efficient Similar Polygon Retrieval

Locality Sensitive Hashing (LSH) is an effective method of indexing a set of items to support efficient nearest neighbors queries in high-dimensional spaces. The basic idea of LSH is that similar items should produce hash collisions with…

Data Structures and Algorithms · Computer Science 2021-02-22 Haim Kaplan , Jay Tenenbaum

Locality Sensitive Hashing for Set-Queries, Motivated by Group Recommendations

Locality Sensitive Hashing (LSH) is an effective method to index a set of points such that we can efficiently find the nearest neighbors of a query point. We extend this method to our novel Set-query LSH (SLSH), such that it can find the…

Data Structures and Algorithms · Computer Science 2020-04-23 Haim Kaplan , Jay Tenenbaum