English
Related papers

Related papers: Quantixar: High-performance Vector Data Management…

200 papers

Vector database management systems have emerged as an important component in modern data management, driven by the growing importance for the need to computationally describe rich data such as texts, images and video in various domains such…

Databases · Computer Science 2025-01-15 Toni Taipalus

Modern deep learning models capture the semantics of complex data by transforming them into high-dimensional embedding vectors. Emerging applications, such as retrieval-augmented generation, use approximate nearest neighbor (ANN) search in…

Databases · Computer Science 2025-10-01 Guoyu Hu , Shaofeng Cai , Tien Tuan Anh Dinh , Zhongle Xie , Cong Yue , Gang Chen , Beng Chin Ooi

There are now over 20 commercial vector database management systems (VDBMSs), all produced within the past five years. But embedding-based retrieval has been studied for over ten years, and similarity search a staggering half century and…

Databases · Computer Science 2023-10-24 James Jie Pan , Jianguo Wang , Guoliang Li

We study an indexing architecture to store and search in a database of high-dimensional vectors from the perspective of statistical signal processing and decision theory. This architecture is composed of several memory units, each of which…

Computer Vision and Pattern Recognition · Computer Science 2017-03-03 Ahmet Iscen , Teddy Furon , Vincent Gripon , Michael Rabbat , Hervé Jégou

As high-dimensional vector data increasingly surpasses the processing capabilities of traditional database management systems, Vector Databases (VDBs) have emerged and become tightly integrated with large language models, being widely…

Vector indexing enables semantic search over diverse corpora and has become an important interface to databases for both users and AI agents. Efficient vector search requires deep optimizations in database systems. This has motivated a new…

Dimensionality reduction in vector databases is pivotal for streamlining AI data management, enabling efficient storage, faster computation, and improved model performance. This paper explores the benefits of reducing vector database…

Databases · Computer Science 2024-04-10 Vitaly Bulgakov , Alec Segal

Many multimedia information retrieval or machine learning problems require efficient high-dimensional nearest neighbor search techniques. For instance, multimedia objects (images, music or videos) can be represented by high-dimensional…

Computer Vision and Pattern Recognition · Computer Science 2017-12-11 Fabien André

There is an increasing demand for extending existing DBMSs with vector indices so that they become unified systems capable of supporting modern predictive applications, which require joint querying of vector embeddings together with the…

Information Retrieval · Computer Science 2025-07-01 Gaurav Sehgal , Semih Salihoglu

Quantization based techniques are the current state-of-the-art for scaling maximum inner product search to massive databases. Traditional approaches to quantization aim to minimize the reconstruction error of the database points. Based on…

Machine Learning · Computer Science 2020-12-08 Ruiqi Guo , Philip Sun , Erik Lindgren , Quan Geng , David Simcha , Felix Chern , Sanjiv Kumar

Vector databases have rapidly grown in popularity, enabling efficient similarity search over data such as text, images, and video. They now play a central role in modern AI workflows, aiding large language models by grounding model outputs…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-17 Seth Ockerman , Amal Gueroudji , Song Young Oh , Robert Underwood , Nicholas Chia , Kyle Chard , Robert Ross , Shivaram Venkataraman

Fast Nearest Neighbor (NN) search is a fundamental challenge in large-scale data processing and analytics, particularly for analyzing multimedia contents which are often of high dimensionality. Instead of using exact NN search, extensive…

Information Retrieval · Computer Science 2016-05-19 Ji Wan , Sheng Tang , Yongdong Zhang , Jintao Li , Pengcheng Wu , Steven C. H. Hoi

Quantum computing has shown promise for solving complex optimization problems in databases, such as join ordering and index selection. Prior work often submits formulated problems directly to black-box quantum or quantum-inspired solvers…

Databases · Computer Science 2026-02-17 Hanwen Liu , Ibrahim Sabek

Vector search systems, pivotal in AI applications, often rely on the Hierarchical Navigable Small Worlds (HNSW) algorithm. However, the behaviour of HNSW under real-world scenarios using vectors generated with deep learning models remains…

Information Retrieval · Computer Science 2025-06-10 Owen Pendrigh Elliott , Jesse Clark

Vector search plays a crucial role in many real-world applications. In addition to single-vector search, multi-vector search becomes important for multi-modal and multi-feature scenarios today. In a multi-vector database, each row is an…

Databases · Computer Science 2026-05-05 Jiongli Zhu , Yue Wang , Bailu Ding , Philip A. Bernstein , Vivek Narasayya , Surajit Chaudhuri

With the rising applications implemented in different domains, it is inevitable to require databases to adopt corresponding appropriate data models to store and exchange data derived from various sources. To handle these data models in a…

Databases · Computer Science 2021-09-02 Gongsheng Yuan , Jiaheng Lu , Peifeng Su

Vector representations and vector space modeling (VSM) play a central role in modern machine learning. We propose a novel approach to `vector similarity searching' over dense semantic representations of words and documents that can be…

Information Retrieval · Computer Science 2017-06-06 Jan Rygl , Jan Pomikálek , Radim Řehůřek , Michal Růžička , Vít Novotný , Petr Sojka

Vectors of data are at the heart of machine learning and data mining. Recently, vector quantization methods have shown great promise in reducing both the time and space costs of operating on vectors. We introduce a vector quantization…

Performance · Computer Science 2017-07-03 Davis W Blalock , John V Guttag

Approximate nearest neighbor (ANN) query in high-dimensional Euclidean space is a key operator in database systems. For this query, quantization is a popular family of methods developed for compressing vectors and reducing memory…

Databases · Computer Science 2024-09-17 Jianyang Gao , Yutong Gou , Yuexuan Xu , Yongyi Yang , Cheng Long , Raymond Chi-Wing Wong

Modern deep learning models have the ability to generate high-dimensional vectors whose similarity reflects semantic resemblance. Thus, similarity search, i.e., the operation of retrieving those vectors in a large collection that are…

Machine Learning · Computer Science 2024-04-04 Mariano Tepper , Ishwar Singh Bhati , Cecilia Aguerrebere , Mark Hildebrand , Ted Willke
‹ Prev 1 2 3 10 Next ›