English
Related papers

Related papers: Cloud-Native Vector Search: A Comprehensive Perfor…

200 papers

Vector search (VS) has become a fundamental component in multimodal data management, enabling core functionalities such as image, video, and code retrieval. As vector data scales rapidly, VS faces growing challenges in balancing search,…

Databases · Computer Science 2026-01-06 Yitong Song , Xuanhe Zhou , Christian S. Jensen , Jianliang Xu

The rapid growth of machine learning capabilities and the adoption of data processing methods using vector embeddings sparked a great interest in creating systems for vector data management. While the predominant approach of vector data…

Databases · Computer Science 2024-03-26 Viktor Sanca , Anastasia Ailamaki

Vector searches on large-scale datasets are critical to modern online services like web search and RAG, which necessity storing the datasets and their index on the secondary storage like SSD. In this paper, we are the first to characterize…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-08 Rongxin Cheng , Yifan Peng , Xingda Wei , Hongrui Xie , Rong Chen , Sijie Shen , Haibo Chen

In multi-vector retrieval, both queries and data are represented as sets of high-dimensional vectors, enabling finer-grained semantic matching and improving retrieval quality over single-vector approaches. However, its practical adoption is…

Information Retrieval · Computer Science 2026-03-24 Yao Tian , Zhoujin Tian , Xi Zhao , Ruiyuan Zhang , Xiaofang Zhou

Graph-based high-dimensional vector indices have become a mainstream solution for large-scale approximate nearest neighbor search (ANNS). However, their substantial memory footprint often requires storage on secondary devices, where…

Databases · Computer Science 2025-08-22 Yijie Zhou , Shengyuan Lin , Shufeng Gong , Song Yu , Shuhao Fan , Yanfeng Zhang , Ge Yu

Vector search (VS) is now available in most database engines. However, while vector search is a common feature in AI/ML/LLMs where the dominant computing platforms are GPUs, existing database engines operate on CPUs even when implementing…

Databases · Computer Science 2026-05-18 Vasilis Mageirakos , Joel André , Marko Kabić , Bowen Wu , Yannis Chronis , Gustavo Alonso

Vector indexing enables semantic search over diverse corpora and has become an important interface to databases for both users and AI agents. Efficient vector search requires deep optimizations in database systems. This has motivated a new…

Vector data is prevalent across business and scientific applications, and its popularity is growing with the proliferation of learned embeddings. Vector data collections often reach billions of vectors with thousands of dimensions, thus,…

Information Retrieval · Computer Science 2025-09-08 Ilias Azizi , Karima Echihabi , Themis Palpanas

Vector data is prevalent across business and scientific applications, and its popularity is growing with the proliferation of learned embeddings. Vector data collections often reach billions of vectors with thousands of dimensions, thus,…

Information Retrieval · Computer Science 2025-09-09 Ilias Azizi , Karima Echihab , Themis Palpanas , Vassilis Christophides

Vector databases have emerged as a new type of systems that support efficient querying of high-dimensional vectors. Many of these offer their database as a service in the cloud. However, the variety of available CPUs and the lack of vector…

Databases · Computer Science 2025-05-13 Leonardo Kuffo , Peter Boncz

As high-dimensional vector data increasingly surpasses the processing capabilities of traditional database management systems, Vector Databases (VDBs) have emerged and become tightly integrated with large language models, being widely…

There is an increasing demand for extending existing DBMSs with vector indices so that they become unified systems capable of supporting modern predictive applications, which require joint querying of vector embeddings together with the…

Information Retrieval · Computer Science 2025-07-01 Gaurav Sehgal , Semih Salihoglu

Similarity-based vector search underpins many important applications, but a key challenge is processing massive vector datasets (e.g., in TBs). To reduce costs, some systems utilize SSDs as the primary data storage. They employ a proximity…

Databases · Computer Science 2025-08-22 Peiqi Yin , Xiao Yan , Qihui Zhou , Hui Li , Xiaolu Li , Lin Zhang , Meiling Wang , Xin Yao , James Cheng

Graph-based indexing is the dominant approach for approximate nearest neighbor search in vector databases, offering high recall with low latency across billions of vectors. However, in such indices, the edge set of the proximity graph is…

Databases · Computer Science 2026-03-03 Sami Abuzakuk , Anne-Marie Kermarrec , Rafael Pires , Mathis Randl , Martijn de Vos

Embedding-based vector search underpins many important applications, such as recommendation and retrieval-augmented generation (RAG). It relies on vector indices to enable efficient search. However, these indices require storing…

The rapid adoption of AI-powered applications demands high-performance, scalable, and efficient cloud database solutions, as traditional architectures often struggle with AI-driven workloads requiring real-time data access, vector search,…

Databases · Computer Science 2025-05-06 Santosh Bhupathi

Vector search underpins modern AI applications by supporting approximate nearest neighbor (ANN) queries over high-dimensional embeddings in tasks like retrieval-augmented generation (RAG), recommendation systems, and multimodal search.…

Databases · Computer Science 2026-05-19 Shurui Zhong , Dingheng Mo , Siqiang Luo

Vector databases have emerged as key enablers for bridging intelligent applications with unstructured data, providing generic search and management support for embedding vectors extracted from the raw unstructured data. As multiple data…

Databases · Computer Science 2024-01-17 Yicheng Jin , Yongji Wu , Wenjun Hu , Bruce M. Maggs , Xiao Zhang , Danyang Zhuo

Businesses have made increasing adoption and incorporation of cloud technology into internal processes in the last decade. The cloud-based deployment provides on-demand availability without active management. More recently, the concept of…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-01 Ying Mao , Yuqi Fu , Suwen Gu , Wenrui Mu , Long Cheng , Qingzhi Liu

Classification is a common AI problem, and vector search is a typical solution. This transforms a given body of text into a numerical representation, known as an embedding, and modern improvements to vector search focus on optimising speed…

Information Retrieval · Computer Science 2025-06-04 Lee Harris
‹ Prev 1 2 3 10 Next ›