Related papers: Nonparametric Decoding for Generative Retrieval

Generative Multi-hop Retrieval

A common practice for text retrieval is to use an encoder to map the documents and the query to a common vector space and perform a nearest neighbor search (NNS); multi-hop retrieval also often adopts the same paradigm, usually with a…

Information Retrieval · Computer Science 2022-10-18 Hyunji Lee , Sohee Yang , Hanseok Oh , Minjoon Seo

Generative Retrieval Overcomes Limitations of Dense Retrieval but Struggles with Identifier Ambiguity

While dense retrieval models, which embed queries and documents into a shared low-dimensional space, have gained widespread popularity, they were shown to exhibit important theoretical limitations and considerably lag behind traditional…

Information Retrieval · Computer Science 2026-04-09 Adrian Bracher , Svitlana Vakulenko

On Debiasing Text Embeddings Through Context Injection

Current advances in Natural Language Processing (NLP) have made it increasingly feasible to build applications leveraging textual data. Generally, the core of these applications rely on having a good semantic representation of text into…

Computation and Language · Computer Science 2024-10-21 Thomas Uriot

A Generative Word Embedding Model and its Low Rank Positive Semidefinite Solution

Most existing word embedding methods can be categorized into Neural Embedding Models and Matrix Factorization (MF)-based methods. However some models are opaque to probabilistic interpretation, and MF-based methods, typically solved using…

Computation and Language · Computer Science 2015-08-18 Shaohua Li , Jun Zhu , Chunyan Miao

Constrained Auto-Regressive Decoding Constrains Generative Retrieval

Generative retrieval seeks to replace traditional search index data structures with a single large-scale neural network, offering the potential for improved efficiency and seamless integration with generative large language models. As an…

Information Retrieval · Computer Science 2025-04-15 Shiguang Wu , Zhaochun Ren , Xin Xin , Jiyuan Yang , Mengqi Zhang , Zhumin Chen , Maarten de Rijke , Pengjie Ren

Semi-Parametric Neural Image Synthesis

Novel architectures have recently improved generative image synthesis leading to excellent visual quality in various tasks. Much of this success is due to the scalability of these architectures and hence caused by a dramatic increase in…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Andreas Blattmann , Robin Rombach , Kaan Oktay , Jonas Müller , Björn Ommer

Non-Parametric Adaptation for Neural Machine Translation

Neural Networks trained with gradient descent are known to be susceptible to catastrophic forgetting caused by parameter shift during the training process. In the context of Neural Machine Translation (NMT) this results in poor performance…

Computation and Language · Computer Science 2019-06-20 Ankur Bapna , Orhan Firat

Improving Neural Parsing by Disentangling Model Combination and Reranking Effects

Recent work has proposed several generative neural models for constituency parsing that achieve state-of-the-art results. Since direct search in these generative models is difficult, they have primarily been used to rescore candidate…

Computation and Language · Computer Science 2017-07-12 Daniel Fried , Mitchell Stern , Dan Klein

ELITE: Embedding-Less retrieval with Iterative Text Exploration

Large Language Models (LLMs) have achieved impressive progress in natural language processing, but their limited ability to retain long-term context constrains performance on document-level or multi-turn tasks. Retrieval-Augmented…

Computation and Language · Computer Science 2025-05-20 Zhangyu Wang , Siyuan Gao , Rong Zhou , Hao Wang , Li Ning

Re3val: Reinforced and Reranked Generative Retrieval

Generative retrieval models encode pointers to information in a corpus as an index within the model's parameters. These models serve as part of a larger pipeline, where retrieved information conditions generation for knowledge-intensive NLP…

Information Retrieval · Computer Science 2024-02-26 EuiYul Song , Sangryul Kim , Haeju Lee , Joonkee Kim , James Thorne

Extracting Text Representations for Terms and Phrases in Technical Domains

Extracting dense representations for terms and phrases is a task of great importance for knowledge discovery platforms targeting highly-technical fields. Dense representations are used as features for downstream components and have multiple…

Computation and Language · Computer Science 2023-05-26 Francesco Fusco , Diego Antognini

Evolutionary Feature-wise Thresholding for Binary Representation of NLP Embeddings

Efficient text embedding is crucial for large-scale natural language processing (NLP) applications, where storage and computational efficiency are key concerns. In this paper, we explore how using binary representations (barcodes) instead…

Computation and Language · Computer Science 2025-07-24 Soumen Sinha , Shahryar Rahnamayan , Azam Asilian Bidgoli

Contextual Document Embeddings

Dense document embeddings are central to neural retrieval. The dominant paradigm is to train and construct embeddings by running encoders directly on individual documents. In this work, we argue that these embeddings, while effective, are…

Computation and Language · Computer Science 2024-11-11 John X. Morris , Alexander M. Rush

Decoding a Neural Retriever's Latent Space for Query Suggestion

Neural retrieval models have superseded classic bag-of-words methods such as BM25 as the retrieval framework of choice. However, neural systems lack the interpretability of bag-of-words models; it is not trivial to connect a query change to…

Computation and Language · Computer Science 2022-10-24 Leonard Adolphs , Michelle Chen Huebscher , Christian Buck , Sertan Girgin , Olivier Bachem , Massimiliano Ciaramita , Thomas Hofmann

Hamming Sentence Embeddings for Information Retrieval

In retrieval applications, binary hashes are known to offer significant improvements in terms of both memory and speed. We investigate the compression of sentence embeddings using a neural encoder-decoder architecture, which is trained by…

Information Retrieval · Computer Science 2019-08-16 Felix Hamann , Nadja Kurz , Adrian Ulges

No-Rank Tensor Decomposition Using Metric Learning

Tensor decomposition of high-dimensional data often struggles to capture semantically or physically meaningful structures, particularly when relying on reconstruction objectives and fixed-rank constraints. We introduce a no-rank tensor…

Machine Learning · Computer Science 2026-03-03 Maryam Bagherian

Effective Inference for Generative Neural Parsing

Generative neural models have recently achieved state-of-the-art results for constituency parsing. However, without a feasible search procedure, their use has so far been limited to reranking the output of external parsers in which decoding…

Computation and Language · Computer Science 2017-07-31 Mitchell Stern , Daniel Fried , Dan Klein

Does Generative Retrieval Overcome the Limitations of Dense Retrieval?

Generative retrieval (GR) has emerged as a new paradigm in neural information retrieval, offering an alternative to dense retrieval (DR) by directly generating identifiers of relevant documents. In this paper, we theoretically and…

Information Retrieval · Computer Science 2025-11-12 Yingchen Zhang , Ruqing Zhang , Jiafeng Guo , Maarten de Rijke , Yixing Fan , Xueqi Cheng

LongEmbed: Extending Embedding Models for Long Context Retrieval

Embedding models play a pivot role in modern NLP applications such as IR and RAG. While the context limit of LLMs has been pushed beyond 1 million tokens, embedding models are still confined to a narrow context window not exceeding 8k…

Computation and Language · Computer Science 2024-11-08 Dawei Zhu , Liang Wang , Nan Yang , Yifan Song , Wenhao Wu , Furu Wei , Sujian Li

An end-to-end Generative Retrieval Method for Sponsored Search Engine --Decoding Efficiently into a Closed Target Domain

In this paper, we present a generative retrieval method for sponsored search engine, which uses neural machine translation (NMT) to generate keywords directly from query. This method is completely end-to-end, which skips query rewriting and…

Information Retrieval · Computer Science 2019-03-19 Yijiang Lian , Zhijie Chen , Jinlong Hu , Kefeng Zhang , Chunwei Yan , Muchenxuan Tong , Wenying Han , Hanju Guan , Ying Li , Ying Cao , Yang Yu , Zhigang Li , Xiaochun Liu , Yue Wang