Related papers: Quantixar: High-performance Vector Data Management…

Vector database management systems: Fundamental concepts, use-cases, and current challenges

Vector database management systems have emerged as an important component in modern data management, driven by the growing importance for the need to computationally describe rich data such as texts, images and video in various domains such…

Databases · Computer Science 2025-01-15 Toni Taipalus

HAKES: Scalable Vector Database for Embedding Search Service

Modern deep learning models capture the semantics of complex data by transforming them into high-dimensional embedding vectors. Emerging applications, such as retrieval-augmented generation, use approximate nearest neighbor (ANN) search in…

Databases · Computer Science 2025-10-01 Guoyu Hu , Shaofeng Cai , Tien Tuan Anh Dinh , Zhongle Xie , Cong Yue , Gang Chen , Beng Chin Ooi

Survey of Vector Database Management Systems

There are now over 20 commercial vector database management systems (VDBMSs), all produced within the past five years. But embedding-based retrieval has been studied for over ten years, and similarity search a staggering half century and…

Databases · Computer Science 2023-10-24 James Jie Pan , Jianguo Wang , Guoliang Li

Memory vectors for similarity search in high-dimensional spaces

We study an indexing architecture to store and search in a database of high-dimensional vectors from the perspective of statistical signal processing and decision theory. This architecture is composed of several memory units, each of which…

Computer Vision and Pattern Recognition · Computer Science 2017-03-03 Ahmet Iscen , Teddy Furon , Vincent Gripon , Michael Rabbat , Hervé Jégou

A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge

As high-dimensional vector data increasingly surpasses the processing capabilities of traditional database management systems, Vector Databases (VDBs) have emerged and become tightly integrated with large language models, being widely…

Databases · Computer Science 2026-03-27 Le Ma , Ran Zhang , Yikun Han , Shirui Yu , Zaitian Wang , Zhiyuan Ning , Jinghan Zhang , Ping Xu , Pengjiang Li , Ziyue Qiao , Wei Ju , Chong Chen , Dongjie Wang , Kunpeng Liu , Pengyang Wang , Pengfei Wang , Yanjie Fu , Chunjiang Liu , Yuanchun Zhou , Chang-Tien Lu

Cost-Effective, Low Latency Vector Search with Azure Cosmos DB

Vector indexing enables semantic search over diverse corpora and has become an important interface to databases for both users and AI agents. Efficient vector search requires deep optimizations in database systems. This has motivated a new…

Databases · Computer Science 2025-08-04 Nitish Upreti , Harsha Vardhan Simhadri , Hari Sudan Sundar , Krishnan Sundaram , Samer Boshra , Balachandar Perumalswamy , Shivam Atri , Martin Chisholm , Revti Raman Singh , Greg Yang , Tamara Hass , Nitesh Dudhey , Subramanyam Pattipaka , Mark Hildebrand , Magdalen Manohar , Jack Moffitt , Haiyang Xu , Naren Datha , Suryansh Gupta , Ravishankar Krishnaswamy , Prashant Gupta , Abhishek Sahu , Hemeswari Varada , Sudhanshu Barthwal , Ritika Mor , James Codella , Shaun Cooper , Kevin Pilch , Simon Moreno , Aayush Kataria , Santosh Kulkarni , Neil Deshpande , Amar Sagare , Dinesh Billa , Zishan Fu , Vipul Vishal

Dimensionality Reduction in Sentence Transformer Vector Databases with Fast Fourier Transform

Dimensionality reduction in vector databases is pivotal for streamlining AI data management, enabling efficient storage, faster computation, and improved model performance. This paper explores the benefits of reducing vector database…

Databases · Computer Science 2024-04-10 Vitaly Bulgakov , Alec Segal

Exploiting Modern Hardware for High-Dimensional Nearest Neighbor Search

Many multimedia information retrieval or machine learning problems require efficient high-dimensional nearest neighbor search techniques. For instance, multimedia objects (images, music or videos) can be represented by high-dimensional…

Computer Vision and Pattern Recognition · Computer Science 2017-12-11 Fabien André

NaviX: A Native Vector Index Design for Graph DBMSs With Robust Predicate-Agnostic Search Performance

There is an increasing demand for extending existing DBMSs with vector indices so that they become unified systems capable of supporting modern predictive applications, which require joint querying of vector embeddings together with the…

Information Retrieval · Computer Science 2025-07-01 Gaurav Sehgal , Semih Salihoglu

Accelerating Large-Scale Inference with Anisotropic Vector Quantization

Quantization based techniques are the current state-of-the-art for scaling maximum inner product search to massive databases. Traditional approaches to quantization aim to minimize the reconstruction error of the database points. Based on…

Machine Learning · Computer Science 2020-12-08 Ruiqi Guo , Philip Sun , Erik Lindgren , Quan Geng , David Simcha , Felix Chern , Sanjiv Kumar

Exploring Distributed Vector Databases Performance on HPC Platforms: A Study with Qdrant

Vector databases have rapidly grown in popularity, enabling efficient similarity search over data such as text, images, and video. They now play a central role in modern AI workflows, aiding large language models by grounding model outputs…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-17 Seth Ockerman , Amal Gueroudji , Song Young Oh , Robert Underwood , Nicholas Chia , Kyle Chard , Robert Ross , Shivaram Venkataraman

HDIdx: High-Dimensional Indexing for Efficient Approximate Nearest Neighbor Search

Fast Nearest Neighbor (NN) search is a fundamental challenge in large-scale data processing and analytics, particularly for analyzing multimedia contents which are often of high dimensionality. Instead of using exact NN search, extensive…

Information Retrieval · Computer Science 2016-05-19 Ji Wan , Sheng Tang , Yongdong Zhang , Jintao Li , Pengcheng Wu , Steven C. H. Hoi

Towards a Hybrid Quantum-Classical Computing Framework for Database Optimization Problems in Real Time Setup

Quantum computing has shown promise for solving complex optimization problems in databases, such as join ordering and index selection. Prior work often submits formulated problems directly to black-box quantum or quantum-inspired solvers…

Databases · Computer Science 2026-02-17 Hanwen Liu , Ibrahim Sabek

The Impacts of Data, Ordering, and Intrinsic Dimensionality on Recall in Hierarchical Navigable Small Worlds

Vector search systems, pivotal in AI applications, often rely on the Hierarchical Navigable Small Worlds (HNSW) algorithm. However, the behaviour of HNSW under real-world scenarios using vectors generated with deep learning models remains…

Information Retrieval · Computer Science 2025-06-10 Owen Pendrigh Elliott , Jesse Clark

MINT: Multi-Vector Search Index Tuning

Vector search plays a crucial role in many real-world applications. In addition to single-vector search, multi-vector search becomes important for multi-modal and multi-feature scenarios today. In a multi-vector database, each row is an…

Databases · Computer Science 2026-05-05 Jiongli Zhu , Yue Wang , Bailu Ding , Philip A. Bernstein , Vivek Narasayya , Surajit Chaudhuri

Quantum-Inspired Keyword Search on Multi-Model Databases

With the rising applications implemented in different domains, it is inevitable to require databases to adopt corresponding appropriate data models to store and exchange data derived from various sources. To handle these data models in a…

Databases · Computer Science 2021-09-02 Gongsheng Yuan , Jiaheng Lu , Peifeng Su

Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines

Vector representations and vector space modeling (VSM) play a central role in modern machine learning. We propose a novel approach to `vector similarity searching' over dense semantic representations of words and documents that can be…

Information Retrieval · Computer Science 2017-06-06 Jan Rygl , Jan Pomikálek , Radim Řehůřek , Michal Růžička , Vít Novotný , Petr Sojka

Bolt: Accelerated Data Mining with Fast Vector Compression

Vectors of data are at the heart of machine learning and data mining. Recently, vector quantization methods have shown great promise in reducing both the time and space costs of operating on vectors. We introduce a vector quantization…

Performance · Computer Science 2017-07-03 Davis W Blalock , John V Guttag

Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search

Approximate nearest neighbor (ANN) query in high-dimensional Euclidean space is a key operator in database systems. For this query, quantization is a popular family of methods developed for compressing vectors and reducing memory…

Databases · Computer Science 2024-09-17 Jianyang Gao , Yutong Gou , Yuexuan Xu , Yongyi Yang , Cheng Long , Raymond Chi-Wing Wong

LeanVec: Searching vectors faster by making them fit

Modern deep learning models have the ability to generate high-dimensional vectors whose similarity reflects semantic resemblance. Thus, similarity search, i.e., the operation of retrieving those vectors in a large collection that are…

Machine Learning · Computer Science 2024-04-04 Mariano Tepper , Ishwar Singh Bhati , Cecilia Aguerrebere , Mark Hildebrand , Ted Willke