Related papers: Quantization for Vector Search under Streaming Upd…

Online Product Quantization

Approximate nearest neighbor (ANN) search has achieved great success in many tasks. However, existing popular methods for ANN search, such as hashing and quantization methods, are designed for static databases only. They cannot handle well…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Donna Xu , Ivor W. Tsang , Ying Zhang

Locally-Adaptive Quantization for Streaming Vector Search

Retrieving the most similar vector embeddings to a given query among a massive collection of vectors has long been a key component of countless real-world applications. The recently introduced Retrieval-Augmented Generation is one of the…

Machine Learning · Computer Science 2024-02-06 Cecilia Aguerrebere , Mark Hildebrand , Ishwar Singh Bhati , Theodore Willke , Mariano Tepper

FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search

Approximate nearest neighbor search (ANNS) is a fundamental building block in information retrieval with graph-based indices being the current state-of-the-art and widely used in the industry. Recent advances in graph-based indices have…

Information Retrieval · Computer Science 2021-05-21 Aditi Singh , Suhas Jayaram Subramanya , Ravishankar Krishnaswamy , Harsha Vardhan Simhadri

Accelerating Large-Scale Inference with Anisotropic Vector Quantization

Quantization based techniques are the current state-of-the-art for scaling maximum inner product search to massive databases. Traditional approaches to quantization aim to minimize the reconstruction error of the database points. Based on…

Machine Learning · Computer Science 2020-12-08 Ruiqi Guo , Philip Sun , Erik Lindgren , Quan Geng , David Simcha , Felix Chern , Sanjiv Kumar

Approximate Nearest Neighbour Search on Dynamic Datasets: An Investigation

Approximate k-Nearest Neighbour (ANN) methods are often used for mining information and aiding machine learning on large scale high-dimensional datasets. ANN methods typically differ in the index structure used for accelerating searches,…

Machine Learning · Computer Science 2025-02-04 Ben Harwood , Amir Dezfouli , Iadine Chades , Conrad Sanderson

Quake: Adaptive Indexing for Vector Search

Vector search, the task of finding the k-nearest neighbors of a query vector against a database of high-dimensional vectors, underpins many machine learning applications, including retrieval-augmented generation, recommendation systems, and…

Information Retrieval · Computer Science 2025-06-10 Jason Mohoney , Devesh Sarda , Mengze Tang , Shihabur Rahman Chowdhury , Anil Pacaci , Ihab F. Ilyas , Theodoros Rekatsinas , Shivaram Venkataraman

Learning Better Encoding for Approximate Nearest Neighbor Search with Dictionary Annealing

We introduce a novel dictionary optimization method for high-dimensional vector quantization employed in approximate nearest neighbor (ANN) search. Vector quantization methods first seek a series of dictionaries, then approximate each…

Computer Vision and Pattern Recognition · Computer Science 2015-07-07 Shicong Liu , Hongtao Lu

On Storage Neural Network Augmented Approximate Nearest Neighbor Search

Large-scale approximate nearest neighbor search (ANN) has been gaining attention along with the latest machine learning researches employing ANNs. If the data is too large to fit in memory, it is necessary to search for the most similar…

Machine Learning · Computer Science 2025-01-29 Taiga Ikeda , Daisuke Miyashita , Jun Deguchi

Passing the Baton: High Throughput Distributed Disk-Based Vector Search with BatANN

Vector search underpins modern information-retrieval systems, including retrieval-augmented generation (RAG) pipelines and search engines over unstructured text and images. As datasets scale to billions of vectors, disk-based vector search…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-07 Nam Anh Dang , Ben Landrum , Ken Birman

A Topology-Aware Localized Update Strategy for Graph-Based ANN Index

The graph-based index has been widely adopted to meet the demand for approximate nearest neighbor search (ANNS) for high-dimensional vectors. However, in dynamic scenarios involving frequent vector insertions and deletions, existing systems…

Databases · Computer Science 2025-03-19 Song Yu , Shengyuan Lin , Shufeng Gong , Yongqing Xie , Ruicheng Liu , Yijie Zhou , Ji Sun , Yanfeng Zhang , Guoliang Li , Ge Yu

Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search

Approximate nearest neighbor (ANN) query in high-dimensional Euclidean space is a key operator in database systems. For this query, quantization is a popular family of methods developed for compressing vectors and reducing memory…

Databases · Computer Science 2024-09-17 Jianyang Gao , Yutong Gou , Yuexuan Xu , Yongyi Yang , Cheng Long , Raymond Chi-Wing Wong

BBC: Improving Large-k Approximate Nearest Neighbor Search with a Bucket-based Result Collector

Although Approximate Nearest Neighbor (ANN) search has been extensively studied, large-k ANN queries that aim to retrieve a large number of nearest neighbors remain underexplored, despite their numerous real-world applications. Existing ANN…

Databases · Computer Science 2026-05-05 Ziqi Yin , Gao Cong , Kai Zeng , Jinwei Zhu , Bin Cui

In-Place Updates of a Graph Index for Streaming Approximate Nearest Neighbor Search

Indices for approximate nearest neighbor search (ANNS) are a basic component for information retrieval and widely used in database, search, recommendation and RAG systems. In these scenarios, documents or other objects are inserted into and…

Information Retrieval · Computer Science 2025-02-20 Haike Xu , Magdalen Dobson Manohar , Philip A. Bernstein , Badrish Chandramouli , Richard Wen , Harsha Vardhan Simhadri

DiverGet: A Search-Based Software Testing Approach for Deep Neural Network Quantization Assessment

Quantization is one of the most applied Deep Neural Network (DNN) compression strategies, when deploying a trained DNN model on an embedded system or a cell phone. This is owing to its simplicity and adaptability to a wide range of…

Machine Learning · Computer Science 2022-10-10 Ahmed Haj Yahmed , Houssem Ben Braiek , Foutse Khomh , Sonia Bouzidi , Rania Zaatour

Vector and Line Quantization for Billion-scale Similarity Search on GPUs

Billion-scale high-dimensional approximate nearest neighbour (ANN) search has become an important problem for searching similar objects among the vast amount of images and videos available online. The existing ANN methods are usually…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Wei Chen , Jincai Chen , Fuhao Zou , Yuan-Fang Li , Ping Lu , Qiang Wang , Wei Zhao

Low-Precision Quantization for Efficient Nearest Neighbor Search

Fast k-Nearest Neighbor search over real-valued vector spaces (KNN) is an important algorithmic task for information retrieval and recommendation systems. We present a method for using reduced precision to represent vectors through…

Information Retrieval · Computer Science 2021-10-19 Anthony Ko , Iman Keivanloo , Vihan Lakshman , Eric Schkufza

SAQ: Pushing the Limits of Vector Quantization through Code Adjustment and Dimension Segmentation

Approximate Nearest Neighbor Search (ANNS) plays a critical role in applications such as search engines, recommender systems, and RAG for LLMs. Vector quantization (VQ), a crucial technique for ANNS, is commonly used to reduce space…

Databases · Computer Science 2026-01-22 Hui Li , Shiyuan Deng , Xiao Yan , Xiangyu Zhi , James Cheng

Relative NN-Descent: A Fast Index Construction for Graph-Based Approximate Nearest Neighbor Search

Approximate Nearest Neighbor Search (ANNS) is the task of finding the database vector that is closest to a given query vector. Graph-based ANNS is the family of methods with the best balance of accuracy and speed for million-scale datasets.…

Information Retrieval · Computer Science 2023-11-01 Naoki Ono , Yusuke Matsui

Updatable Balanced Index for Stable Streaming Similarity Search over Large-Scale Fresh Vectors

As artificial intelligence gains more and more popularity, vectors are one of the most widely used data structures for services such as information retrieval and recommendation. Approximate Nearest Neighbor Search (ANNS), which generally…

Databases · Computer Science 2026-02-03 Yuhui Lai , Shixun Huang , Sheng Wang

I/O Optimizations for Graph-Based Disk-Resident Approximate Nearest Neighbor Search: A Design Space Exploration

Approximate nearest neighbor (ANN) search on SSD-backed indexes is increasingly I/O-bound (I/O accounts for 70--90\% of query latency). We present an I/O-first framework for disk-based ANN that organizes techniques along three dimensions:…

Databases · Computer Science 2026-03-24 Liang Li , Shufeng Gong , Yanan Yang , Yiduo Wang , Jie Wu