English
Related papers

Related papers: Locality-Sensitive Hashing for Efficient Web Appli…

200 papers

Similarity search (nearest neighbor search) is a problem of pursuing the data items whose distances to a query item are the smallest from a large database. Various methods have been developed to address this problem, and recently a lot of…

Data Structures and Algorithms · Computer Science 2014-08-14 Jingdong Wang , Heng Tao Shen , Jingkuan Song , Jianqiu Ji

Web services are becoming business-critical components, often deployed with critical software bugs that can be maliciously explored. Web vulnerability scanners allow the detection of security vulnerabilities in web services by stressing the…

Cryptography and Security · Computer Science 2022-12-26 Osejobe Ehichoya , Chinwuba Christian Nnaemeka

Given a binary executable without source code, it is difficult to determine what each function in the binary does by reverse engineering it, and even harder without prior experience and context. In this paper, we performed a comparison of…

Cryptography and Security · Computer Science 2021-12-13 Haoxi Tan

With the increasing concern for security in the network, many approaches are laid out that try to protect the network from unauthorised access. New methods have been adopted in order to find the potential discrepancies that may damage the…

Cryptography and Security · Computer Science 2014-03-28 Sheetal Bairwa , Bhawna Mewara , Jyoti Gajrani

Content scanning systems employ perceptual hashing algorithms to scan user content for illegal material, such as child pornography or terrorist recruitment flyers. Perceptual hashing algorithms help determine whether two images are visually…

Cryptography and Security · Computer Science 2022-12-09 Ashish Hooda , Andrey Labunets , Tadayoshi Kohno , Earlence Fernandes

Similarity search in high-dimensional spaces is an important task for many multimedia applications. Due to the notorious curse of dimensionality, approximate nearest neighbor techniques are preferred over exact searching techniques since…

Databases · Computer Science 2020-10-16 Omid Jafari , Parth Nagarkar , Jonathan Montaño

The probability Jaccard similarity was recently proposed as a natural generalization of the Jaccard similarity to measure the proximity of sets whose elements are associated with relative frequencies or probabilities. In combination with a…

Data Structures and Algorithms · Computer Science 2020-10-27 Otmar Ertl

Locality-sensitive hashing~[Indyk,Motwani'98] is a classical data structure for approximate nearest neighbor search. It allows, after a close to linear time preprocessing of the input dataset, to find an approximately nearest neighbor of…

Data Structures and Algorithms · Computer Science 2024-06-18 Michael Kapralov , Mikhail Makarov , Christian Sohler

Linux containers are gaining increasing traction in both individual and industrial use, and as these containers get integrated into mission-critical systems, real-time detection of malicious cyber attacks becomes a critical operational…

Cryptography and Security · Computer Science 2017-01-05 Amr S. Abed , Charles Clancy , David S. Levy

Similarity search is critical for many database applications, including the increasingly popular online services for Content-Based Multimedia Retrieval (CBMR). These services, which include image search engines, must handle an overwhelming…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-10-16 Thiago S. F. X. Teixeira , George Teodoro , Eduardo Valle , Joel H. Saltz

Many tools and libraries are readily available to build and operate distributed Web applications. While the setup of operational environments is comparatively easy, practice shows that their continuous secure operation is more difficult to…

Cryptography and Security · Computer Science 2012-07-13 Matteo Maria Casalino , Michele Mangili , Henrik Plate , Serena Elisa Ponta

Perceptual hashes map images with identical semantic content to the same $n$-bit hash value, while mapping semantically-different images to different hashes. These algorithms carry important applications in cybersecurity such as copyright…

Cryptography and Security · Computer Science 2022-07-29 Jagdeep Singh Bhatia , Kevin Meng

We present ElasticHash, a novel approach for high-quality, efficient, and large-scale semantic image similarity search. It is based on a deep hashing model to learn hash codes for fine-grained image similarity search in natural images and a…

Computer Vision and Pattern Recognition · Computer Science 2023-05-10 Nikolaus Korfhage , Markus Mühling , Bernd Freisleben

The importance of an efficient and scalable document similarity detection system is undeniable nowadays. Search engines need batch text similarity measures to detect duplicated and near-duplicated web pages in their indexes in order to…

Information Retrieval · Computer Science 2018-10-09 Hamid Mohammadi , Amin Nikoukaran

Automatic crash bucketing is a crucial phase in the software development process for efficiently triaging bug reports. It generally consists in grouping similar reports through clustering techniques. However, with real-time streaming bug…

Software Engineering · Computer Science 2023-10-11 Youcef Remil , Anes Bendimerad , Romain Mathonat , Chedy Raissi , Mehdi Kaytoue

Learning hash functions/codes for similarity search over multi-view data is attracting increasing attention, where similar hash codes are assigned to the data objects characterizing consistently neighborhood relationship across views.…

Machine Learning · Computer Science 2016-11-18 Lin Wu , Yang Wang

End-to-end encryption (E2EE) by messaging platforms enable people to securely and privately communicate with one another. Its widespread adoption however raised concerns that illegal content might now be shared undetected. Following the…

Cryptography and Security · Computer Science 2022-08-03 Shubham Jain , Ana-Maria Cretu , Yves-Alexandre de Montjoye

To automatically test web applications, crawling-based techniques are usually adopted to mine the behavior models, explore the state spaces or detect the violated invariants of the applications. However, in existing crawlers, rules for…

Software Engineering · Computer Science 2016-08-24 Jun-Wei Lin , Farn Wang

Background. The recent surge in phishing attacks keeps undermining the effectiveness of the traditional anti-phishing blacklist approaches. On-device anti-phishing solutions are gaining popularity as they offer faster phishing detection…

Cryptography and Security · Computer Science 2024-07-08 Ivan Petrukha , Nataliia Stulova , Sergii Kryvoblotskyi

Semantic caching has emerged as a pivotal technique for scaling LLM applications, widely adopted by major providers including AWS and Microsoft. By utilizing semantic embedding vectors as cache keys, this mechanism effectively minimizes…

Cryptography and Security · Computer Science 2026-02-02 Zhixiang Zhang , Zesen Liu , Yuchong Xie , Quanfeng Huang , Dongdong She
‹ Prev 1 2 3 10 Next ›