Related papers: Constrained Auto-Regressive Decoding Constrains Ge…

Generative Retrieval Overcomes Limitations of Dense Retrieval but Struggles with Identifier Ambiguity

While dense retrieval models, which embed queries and documents into a shared low-dimensional space, have gained widespread popularity, they were shown to exhibit important theoretical limitations and considerably lag behind traditional…

Information Retrieval · Computer Science 2026-04-09 Adrian Bracher , Svitlana Vakulenko

How Does Generative Retrieval Scale to Millions of Passages?

Popularized by the Differentiable Search Index, the emerging paradigm of generative retrieval re-frames the classic information retrieval problem into a sequence-to-sequence modeling task, forgoing external indices and encoding an entire…

Information Retrieval · Computer Science 2023-05-22 Ronak Pradeep , Kai Hui , Jai Gupta , Adam D. Lelkes , Honglei Zhuang , Jimmy Lin , Donald Metzler , Vinh Q. Tran

Does Generative Retrieval Overcome the Limitations of Dense Retrieval?

Generative retrieval (GR) has emerged as a new paradigm in neural information retrieval, offering an alternative to dense retrieval (DR) by directly generating identifiers of relevant documents. In this paper, we theoretically and…

Information Retrieval · Computer Science 2025-11-12 Yingchen Zhang , Ruqing Zhang , Jiafeng Guo , Maarten de Rijke , Yixing Fan , Xueqi Cheng

Generative Retrieval for Book search

In book search, relevant book information should be returned in response to a query. Books contain complex, multi-faceted information such as metadata, outlines, and main text, where the outline provides hierarchical information between…

Information Retrieval · Computer Science 2025-01-22 Yubao Tang , Ruqing Zhang , Jiafeng Guo , Maarten de Rijke , Shihao Liu , Shuaiqing Wang , Dawei Yin , Xueqi Cheng

Generative Retrieval Meets Multi-Graded Relevance

Generative retrieval represents a novel approach to information retrieval. It uses an encoder-decoder architecture to directly produce relevant document identifiers (docids) for queries. While this method offers benefits, current approaches…

Information Retrieval · Computer Science 2024-09-30 Yubao Tang , Ruqing Zhang , Jiafeng Guo , Maarten de Rijke , Wei Chen , Xueqi Cheng

Generative Dense Retrieval: Memory Can Be a Burden

Generative Retrieval (GR), autoregressively decoding relevant document identifiers given a query, has been shown to perform well under the setting of small-scale corpora. By memorizing the document corpus with model parameters, GR…

Information Retrieval · Computer Science 2024-01-22 Peiwen Yuan , Xinglin Wang , Shaoxiong Feng , Boyuan Pan , Yiwei Li , Heda Wang , Xupeng Miao , Kan Li

Controlled LLM Decoding via Discrete Auto-regressive Biasing

Controlled text generation allows for enforcing user-defined constraints on large language model outputs, an increasingly important field as LLMs become more prevalent in everyday life. One common approach uses energy-based decoding, which…

Computation and Language · Computer Science 2025-02-07 Patrick Pynadath , Ruqi Zhang

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Large language models (LLMs) exhibit remarkable generative capabilities but often suffer from hallucinations. Retrieval-augmented generation (RAG) offers an effective solution by incorporating external knowledge, but existing methods still…

Computation and Language · Computer Science 2024-12-17 Xiaoxi Li , Jiajie Jin , Yujia Zhou , Yongkang Wu , Zhonghua Li , Qi Ye , Zhicheng Dou

Generative Retrieval with Preference Optimization for E-commerce Search

Generative retrieval introduces a groundbreaking paradigm to document retrieval by directly generating the identifier of a pertinent document in response to a specific query. This paradigm has demonstrated considerable benefits and…

Information Retrieval · Computer Science 2024-10-28 Mingming Li , Huimu Wang , Zuxu Chen , Guangtao Nie , Yiming Qiu , Guoyu Tang , Lin Liu , Jingwei Zhuo

Learning to Rank in Generative Retrieval

Generative retrieval stands out as a promising new paradigm in text retrieval that aims to generate identifier strings of relevant passages as the retrieval target. This generative paradigm taps into powerful generative language models,…

Computation and Language · Computer Science 2023-12-19 Yongqi Li , Nan Yang , Liang Wang , Furu Wei , Wenjie Li

Generative Retrieval as Multi-Vector Dense Retrieval

Generative retrieval generates identifiers of relevant documents in an end-to-end manner using a sequence-to-sequence architecture for a given query. The relation between generative retrieval and other retrieval methods, especially those…

Information Retrieval · Computer Science 2024-04-02 Shiguang Wu , Wenda Wei , Mengqi Zhang , Zhumin Chen , Jun Ma , Zhaochun Ren , Maarten de Rijke , Pengjie Ren

Generative Retrieval as Dense Retrieval

Generative retrieval is a promising new neural retrieval paradigm that aims to optimize the retrieval pipeline by performing both indexing and retrieval with a single transformer model. However, this new paradigm faces challenges with…

Information Retrieval · Computer Science 2023-06-21 Thong Nguyen , Andrew Yates

Autoregressive Search Engines: Generating Substrings as Document Identifiers

Knowledge-intensive language tasks require NLP systems to both provide the correct answer and retrieve supporting evidence for it in a given corpus. Autoregressive language models are emerging as the de-facto standard for generating…

Computation and Language · Computer Science 2022-04-25 Michele Bevilacqua , Giuseppe Ottaviano , Patrick Lewis , Wen-tau Yih , Sebastian Riedel , Fabio Petroni

Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search

We present Grid Beam Search (GBS), an algorithm which extends beam search to allow the inclusion of pre-specified lexical constraints. The algorithm can be used with any model that generates a sequence $ \mathbf{\hat{y}} = \{y_{0}\ldots…

Computation and Language · Computer Science 2017-05-03 Chris Hokamp , Qun Liu

Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation

Decoding from the output distributions of large language models to produce high-quality text is a complex challenge in language modeling. Various approaches, such as beam search, sampling with temperature, $k-$sampling, nucleus…

Computation and Language · Computer Science 2024-10-22 Esteban Garces Arias , Julian Rodemann , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

DOGR: Leveraging Document-Oriented Contrastive Learning in Generative Retrieval

Generative retrieval constitutes an innovative approach in information retrieval, leveraging generative language models (LM) to generate a ranked list of document identifiers (docid) for a given query. It simplifies the retrieval pipeline…

Information Retrieval · Computer Science 2025-02-13 Penghao Lu , Xin Dong , Yuansheng Zhou , Lei Cheng , Chuan Yuan , Linjian Mo

Improved Constrained Generation by Bridging Pretrained Generative Models

Constrained generative modeling is fundamental to applications such as robotic control and autonomous driving, where models must respect physical laws and safety-critical constraints. In real-world settings, these constraints rarely take…

Machine Learning · Computer Science 2026-03-10 Xiaoxuan Liang , Saeid Naderiparizi , Yunpeng Liu , Berend Zwartsenberg , Frank Wood

Hierarchical corpus encoder: Fusing generative retrieval and dense indices

Generative retrieval employs sequence models for conditional generation of document IDs based on a query (DSI (Tay et al., 2022); NCI (Wang et al., 2022); inter alia). While this has led to improved performance in zero-shot retrieval, it is…

Information Retrieval · Computer Science 2025-02-27 Tongfei Chen , Ankita Sharma , Adam Pauls , Benjamin Van Durme

Coordinated Semantic Alignment and Evidence Constraints for Retrieval-Augmented Generation with Large Language Models

Retrieval augmented generation mitigates limitations of large language models in factual consistency and knowledge updating by introducing external knowledge. However, practical applications still suffer from semantic misalignment between…

Computation and Language · Computer Science 2026-03-06 Xin Chen , Saili Uday Gadgil , Jiarong Qiu

Speculative Decoding and Beyond: An In-Depth Survey of Techniques

Sequential dependencies present a fundamental bottleneck in deploying large-scale autoregressive models, particularly for real-time applications. While traditional optimization approaches like pruning and quantization often compromise model…

Computation and Language · Computer Science 2025-10-09 Yunhai Hu , Zining Liu , Zhenyuan Dong , Tianfan Peng , Bradley McDanel , Sai Qian Zhang