Related papers: Efficient Multi-Vector Dense Retrieval Using Bit V…

Query Embedding Pruning for Dense Retrieval

Recent advances in dense retrieval techniques have offered the promise of being able not just to re-rank documents using contextualised language models such as BERT, but also to use such models to identify documents from the collection in…

Information Retrieval · Computer Science 2021-08-25 Nicola Tonellotto , Craig Macdonald

Model-enhanced Vector Index

Embedding-based retrieval methods construct vector indices to search for document representations that are most similar to the query representations. They are widely used in document retrieval due to low latency and decent recall…

Information Retrieval · Computer Science 2023-11-10 Hailin Zhang , Yujing Wang , Qi Chen , Ruiheng Chang , Ting Zhang , Ziming Miao , Yingyan Hou , Yang Ding , Xupeng Miao , Haonan Wang , Bochen Pang , Yuefeng Zhan , Hao Sun , Weiwei Deng , Qi Zhang , Fan Yang , Xing Xie , Mao Yang , Bin Cui

Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval

Recently, the retrieval models based on dense representations have been gradually applied in the first stage of the document retrieval tasks, showing better performance than traditional sparse vector space models. To obtain high efficiency,…

Information Retrieval · Computer Science 2021-08-20 Hongyin Tang , Xingwu Sun , Beihong Jin , Jingang Wang , Fuzheng Zhang , Wei Wu

On the Value of Behavioral Representations for Dense Retrieval

We consider text retrieval within dense representational space in real-world settings such as e-commerce search where (a) document popularity and (b) diversity of queries associated with a document have a skewed distribution. Most of the…

Information Retrieval · Computer Science 2022-08-12 Nan Jiang , Dhivya Eswaran , Choon Hui Teo , Yexiang Xue , Yesh Dattatreya , Sujay Sanghavi , Vishy Vishwanathan

Investigating Multi-layer Representations for Dense Passage Retrieval

Dense retrieval models usually adopt vectors from the last hidden layer of the document encoder to represent a document, which is in contrast to the fact that representations in different layers of a pre-trained language model usually…

Information Retrieval · Computer Science 2025-09-30 Zhongbin Xie , Thomas Lukasiewicz

Multi-View Document Representation Learning for Open-Domain Dense Retrieval

Dense retrieval has achieved impressive advances in first-stage retrieval from a large-scale document collection, which is built on bi-encoder architecture to produce single vector representation of query and document. However, a document…

Computation and Language · Computer Science 2022-03-17 Shunyu Zhang , Yaobo Liang , Ming Gong , Daxin Jiang , Nan Duan

Pseudo-Relevance Feedback for Multiple Representation Dense Retrieval

Pseudo-relevance feedback mechanisms, from Rocchio to the relevance models, have shown the usefulness of expanding and reweighting the users' initial queries using information occurring in an initial set of retrieved documents, known as the…

Information Retrieval · Computer Science 2021-07-02 Xiao Wang , Craig Macdonald , Nicola Tonellotto , Iadh Ounis

MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interaction

Universal multimodal embedding models have achieved great success in capturing semantic relevance between queries and candidates. However, current methods either condense queries and candidates into a single vector, potentially limiting the…

Information Retrieval · Computer Science 2026-04-08 Zilin Xiao , Qi Ma , Mengting Gu , Chun-cheng Jason Chen , Xintao Chen , Vicente Ordonez , Vijai Mohan

Binary Embedding-based Retrieval at Tencent

Large-scale embedding-based retrieval (EBR) is the cornerstone of search-related industrial applications. Given a user query, the system of EBR aims to identify relevant information from a large corpus of documents that may be tens or…

Information Retrieval · Computer Science 2023-02-20 Yukang Gan , Yixiao Ge , Chang Zhou , Shupeng Su , Zhouchuan Xu , Xuyuan Xu , Quanchao Hui , Xiang Chen , Yexin Wang , Ying Shan

Dense Passage Retrieval in Conversational Search

Information retrieval systems have traditionally relied on exact term match methods such as BM25 for first-stage retrieval. However, recent advancements in neural network-based techniques have introduced a new method called dense retrieval.…

Information Retrieval · Computer Science 2025-03-25 Ahmed H. Salamah , Pierre McWhannel , Nicole Yan

MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings

Neural embedding models have become a fundamental component of modern information retrieval (IR) pipelines. These models produce a single embedding $x \in \mathbb{R}^d$ per data-point, allowing for fast retrieval via highly optimized…

Data Structures and Algorithms · Computer Science 2024-05-31 Laxman Dhulipala , Majid Hadian , Rajesh Jayaram , Jason Lee , Vahab Mirrokni

Span-Aggregatable, Contextualized Word Embeddings for Effective Phrase Mining

Dense vector representations for sentences made significant progress in recent years as can be seen on sentence similarity tasks. Real-world phrase retrieval applications, on the other hand, still encounter challenges for effective use of…

Computation and Language · Computer Science 2024-05-14 Eyal Orbach , Lev Haikin , Nelly David , Avi Faizakof

Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback

Dense retrieval systems conduct first-stage retrieval using embedded representations and simple similarity metrics to match a query to documents. Its effectiveness depends on encoded embeddings to capture the semantics of queries and…

Information Retrieval · Computer Science 2021-09-01 HongChien Yu , Chenyan Xiong , Jamie Callan

A Multi-Resolution Word Embedding for Document Retrieval from Large Unstructured Knowledge Bases

Deep language models learning a hierarchical representation proved to be a powerful tool for natural language processing, text mining and information retrieval. However, representations that perform well for retrieval must capture semantic…

Information Retrieval · Computer Science 2019-05-24 Tolgahan Cakaloglu , Xiaowei Xu

Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval

Dense Retrieval (DR) has achieved state-of-the-art first-stage ranking effectiveness. However, the efficiency of most existing DR models is limited by the large memory cost of storing dense vectors and the time-consuming nearest neighbor…

Information Retrieval · Computer Science 2021-10-13 Jingtao Zhan , Jiaxin Mao , Yiqun Liu , Jiafeng Guo , Min Zhang , Shaoping Ma

Divide and Conquer: Towards Better Embedding-based Retrieval for Recommender Systems From a Multi-task Perspective

Embedding-based retrieval (EBR) methods are widely used in modern recommender systems thanks to its simplicity and effectiveness. However, along the journey of deploying and iterating on EBR in production, we still identify some fundamental…

Information Retrieval · Computer Science 2023-02-07 Yuan Zhang , Xue Dong , Weijie Ding , Biao Li , Peng Jiang , Kun Gai

LEMUR: Learned Multi-Vector Retrieval

Multi-vector representations generated by late interaction models, such as ColBERT, enable superior retrieval quality compared to single-vector representations in information retrieval applications. In multi-vector retrieval systems, both…

Information Retrieval · Computer Science 2026-05-22 Elias Jääsaari , Ville Hyvönen , Teemu Roos

An Analysis on Matching Mechanisms and Token Pruning for Late-interaction Models

With the development of pre-trained language models, the dense retrieval models have become promising alternatives to the traditional retrieval models that rely on exact match and sparse bag-of-words representations. Different from most…

Information Retrieval · Computer Science 2024-03-21 Qi Liu , Gang Guo , Jiaxin Mao , Zhicheng Dou , Ji-Rong Wen , Hao Jiang , Xinyu Zhang , Zhao Cao

Llama2Vec: Unsupervised Adaptation of Large Language Models for Dense Retrieval

Dense retrieval calls for discriminative embeddings to represent the semantic relationship between query and document. It may benefit from the using of large language models (LLMs), given LLMs' strong capability on semantic understanding.…

Computation and Language · Computer Science 2025-11-25 Zheng Liu , Chaofan Li , Shitao Xiao , Yingxia Shao , Defu Lian

Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

Ad-hoc search calls for the selection of appropriate answers from a massive-scale corpus. Nowadays, the embedding-based retrieval (EBR) becomes a promising solution, where deep learning based document representation and ANN search…

Information Retrieval · Computer Science 2022-03-03 Shitao Xiao , Zheng Liu , Weihao Han , Jianjin Zhang , Yingxia Shao , Defu Lian , Chaozhuo Li , Hao Sun , Denvy Deng , Liangjie Zhang , Qi Zhang , Xing Xie