English
Related papers

Related papers: Supermodular Locality Sensitive Hashes

200 papers

Data similarity (or distance) computation is a fundamental research topic which fosters a variety of similarity-based machine learning and data mining applications. In big data analytics, it is impractical to compute the exact similarity of…

Data Structures and Algorithms · Computer Science 2025-03-12 Wei Wu , Bin Li

Locality-sensitive hashing (LSH) is a fundamental algorithmic technique widely employed in large-scale data processing applications, such as nearest-neighbor search, entity resolution, and clustering. However, its applicability in some…

Information Retrieval · Computer Science 2024-02-01 Runhui Wang , Luyang Kong , Yefan Tao , Andrew Borthwick , Davor Golac , Henrik Johnson , Shadie Hijazi , Dong Deng , Yongfeng Zhang

Locality Sensitive Hashing (LSH) is an effective method to index a set of points such that we can efficiently find the nearest neighbors of a query point. We extend this method to our novel Set-query LSH (SLSH), such that it can find the…

Data Structures and Algorithms · Computer Science 2020-04-23 Haim Kaplan , Jay Tenenbaum

Scalar field comparison is a fundamental task in scientific visualization. In topological data analysis, we compare topological descriptors of scalar fields -- such as persistence diagrams and merge trees -- because they provide succinct…

Computational Geometry · Computer Science 2024-09-18 Weiran Lyu , Raghavendra Sridharamurthy , Jeff M. Phillips , Bei Wang

Locality-sensitive hashing~[Indyk,Motwani'98] is a classical data structure for approximate nearest neighbor search. It allows, after a close to linear time preprocessing of the input dataset, to find an approximately nearest neighbor of…

Data Structures and Algorithms · Computer Science 2024-06-18 Michael Kapralov , Mikhail Makarov , Christian Sohler

MinHash and SimHash are the two widely adopted Locality Sensitive Hashing (LSH) algorithms for large-scale data processing applications. Deciding which LSH to use for a particular problem at hand is an important question, which has no clear…

Computation · Statistics 2014-07-17 Anshumali Shrivastava , Ping Li

The facility location problem is widely used for summarizing large datasets and has additional applications in sensor placement, image retrieval, and clustering. One difficulty of this problem is that submodular optimization algorithms…

Machine Learning · Statistics 2017-03-09 Erik M. Lindgren , Shanshan Wu , Alexandros G. Dimakis

Locality-sensitive hashing (LSH) is a fundamental technique for similarity search and similarity estimation in high-dimensional spaces. The basic idea is that similar objects should produce hash collisions with probability significantly…

Computational Geometry · Computer Science 2017-09-25 Joachim Gudmundsson , Rasmus Pagh

With the advantage of low storage cost and high retrieval efficiency, hashing techniques have recently been an emerging topic in cross-modal similarity search. As multiple modal data reflect similar semantic content, many researches aim at…

Machine Learning · Computer Science 2019-04-19 Jun Yu , Xiao-Jun Wu , Josef Kittler

Locality sensitive hashing (LSH) is a powerful tool for sublinear-time approximate nearest neighbor search, and a variety of hashing schemes have been proposed for different dissimilarity measures. However, hash codes significantly depend…

Similarity search is critical for many database applications, including the increasingly popular online services for Content-Based Multimedia Retrieval (CBMR). These services, which include image search engines, must handle an overwhelming…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-10-16 Thiago S. F. X. Teixeira , George Teodoro , Eduardo Valle , Joel H. Saltz

Finding nearest neighbors in high-dimensional spaces is a fundamental operation in many diverse application domains. Locality Sensitive Hashing (LSH) is one of the most popular techniques for finding approximate nearest neighbor searches in…

Databases · Computer Science 2021-02-18 Omid Jafari , Preeti Maurya , Parth Nagarkar , Khandker Mushfiqul Islam , Chidambaram Crushev

Cross-modal hashing is an important approach for multimodal data management and application. Existing unsupervised cross-modal hashing algorithms mainly rely on data features in pre-trained models to mine their similarity relationships.…

Information Retrieval · Computer Science 2022-07-12 Liang Li , Baihua Zheng , Weiwei Sun

We introduce an efficient computational framework for hashing data belonging to multiple modalities into a single representation space where they become mutually comparable. The proposed approach is based on a novel coupled siamese neural…

Computer Vision and Pattern Recognition · Computer Science 2012-07-09 Jonathan Masci , Michael M. Bronstein , Alexander A. Bronstein , Jürgen Schmidhuber

Locality-sensitive hashing (LSH) is an effective randomized technique widely used in many machine learning tasks. The cost of hashing is proportional to data dimensions, and thus often the performance bottleneck when dimensionality is high…

Machine Learning · Computer Science 2023-09-28 Zongyuan Tan , Hongya Wang , Bo Xu , Minjie Luo , Ming Du

We show that approximate similarity (near neighbour) search can be solved in high dimensions with performance matching state of the art (data independent) Locality Sensitive Hashing, but with a guarantee of no false negatives. Specifically,…

Data Structures and Algorithms · Computer Science 2018-06-28 Thomas Dybdahl Ahle

Locality Sensitive Hashing (LSH) is an effective method of indexing a set of items to support efficient nearest neighbors queries in high-dimensional spaces. The basic idea of LSH is that similar items should produce hash collisions with…

Data Structures and Algorithms · Computer Science 2021-02-22 Haim Kaplan , Jay Tenenbaum

Hashing methods have been widely used for efficient similarity retrieval on large scale image database. Traditional hashing methods learn hash functions to generate binary codes from hand-crafted features, which achieve limited accuracy…

Computer Vision and Pattern Recognition · Computer Science 2017-11-08 Jian Zhang , Yuxin Peng

Online hashing has attracted extensive research attention when facing streaming data. Most online hashing methods, learning binary codes based on pairwise similarities of training instances, fail to capture the semantic relationship, and…

Computer Vision and Pattern Recognition · Computer Science 2019-06-03 Mingbao Lin , Rongrong Ji , Shen Chen , Feng Zheng , Xiaoshuai Sun , Baochang Zhang , Liujuan Cao , Guodong Guo , Feiyue Huang

LSH (locality sensitive hashing) had emerged as a powerful technique in nearest-neighbor search in high dimensions [IM98, HIM12]. Given a point set $P$ in a metric space, and given parameters $r$ and $\varepsilon > 0$, the task is to…

Computational Geometry · Computer Science 2017-04-11 Sariel Har-Peled , Sepideh Mahabadi
‹ Prev 1 2 3 10 Next ›