English
Related papers

Related papers: Nonparametric Decoding for Generative Retrieval

200 papers

A common practice for text retrieval is to use an encoder to map the documents and the query to a common vector space and perform a nearest neighbor search (NNS); multi-hop retrieval also often adopts the same paradigm, usually with a…

Information Retrieval · Computer Science 2022-10-18 Hyunji Lee , Sohee Yang , Hanseok Oh , Minjoon Seo

While dense retrieval models, which embed queries and documents into a shared low-dimensional space, have gained widespread popularity, they were shown to exhibit important theoretical limitations and considerably lag behind traditional…

Information Retrieval · Computer Science 2026-04-09 Adrian Bracher , Svitlana Vakulenko

Current advances in Natural Language Processing (NLP) have made it increasingly feasible to build applications leveraging textual data. Generally, the core of these applications rely on having a good semantic representation of text into…

Computation and Language · Computer Science 2024-10-21 Thomas Uriot

Most existing word embedding methods can be categorized into Neural Embedding Models and Matrix Factorization (MF)-based methods. However some models are opaque to probabilistic interpretation, and MF-based methods, typically solved using…

Computation and Language · Computer Science 2015-08-18 Shaohua Li , Jun Zhu , Chunyan Miao

Generative retrieval seeks to replace traditional search index data structures with a single large-scale neural network, offering the potential for improved efficiency and seamless integration with generative large language models. As an…

Information Retrieval · Computer Science 2025-04-15 Shiguang Wu , Zhaochun Ren , Xin Xin , Jiyuan Yang , Mengqi Zhang , Zhumin Chen , Maarten de Rijke , Pengjie Ren

Novel architectures have recently improved generative image synthesis leading to excellent visual quality in various tasks. Much of this success is due to the scalability of these architectures and hence caused by a dramatic increase in…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Andreas Blattmann , Robin Rombach , Kaan Oktay , Jonas Müller , Björn Ommer

Neural Networks trained with gradient descent are known to be susceptible to catastrophic forgetting caused by parameter shift during the training process. In the context of Neural Machine Translation (NMT) this results in poor performance…

Computation and Language · Computer Science 2019-06-20 Ankur Bapna , Orhan Firat

Recent work has proposed several generative neural models for constituency parsing that achieve state-of-the-art results. Since direct search in these generative models is difficult, they have primarily been used to rescore candidate…

Computation and Language · Computer Science 2017-07-12 Daniel Fried , Mitchell Stern , Dan Klein

Large Language Models (LLMs) have achieved impressive progress in natural language processing, but their limited ability to retain long-term context constrains performance on document-level or multi-turn tasks. Retrieval-Augmented…

Computation and Language · Computer Science 2025-05-20 Zhangyu Wang , Siyuan Gao , Rong Zhou , Hao Wang , Li Ning

Generative retrieval models encode pointers to information in a corpus as an index within the model's parameters. These models serve as part of a larger pipeline, where retrieved information conditions generation for knowledge-intensive NLP…

Information Retrieval · Computer Science 2024-02-26 EuiYul Song , Sangryul Kim , Haeju Lee , Joonkee Kim , James Thorne

Extracting dense representations for terms and phrases is a task of great importance for knowledge discovery platforms targeting highly-technical fields. Dense representations are used as features for downstream components and have multiple…

Computation and Language · Computer Science 2023-05-26 Francesco Fusco , Diego Antognini

Efficient text embedding is crucial for large-scale natural language processing (NLP) applications, where storage and computational efficiency are key concerns. In this paper, we explore how using binary representations (barcodes) instead…

Computation and Language · Computer Science 2025-07-24 Soumen Sinha , Shahryar Rahnamayan , Azam Asilian Bidgoli

Dense document embeddings are central to neural retrieval. The dominant paradigm is to train and construct embeddings by running encoders directly on individual documents. In this work, we argue that these embeddings, while effective, are…

Computation and Language · Computer Science 2024-11-11 John X. Morris , Alexander M. Rush

Neural retrieval models have superseded classic bag-of-words methods such as BM25 as the retrieval framework of choice. However, neural systems lack the interpretability of bag-of-words models; it is not trivial to connect a query change to…

In retrieval applications, binary hashes are known to offer significant improvements in terms of both memory and speed. We investigate the compression of sentence embeddings using a neural encoder-decoder architecture, which is trained by…

Information Retrieval · Computer Science 2019-08-16 Felix Hamann , Nadja Kurz , Adrian Ulges

Tensor decomposition of high-dimensional data often struggles to capture semantically or physically meaningful structures, particularly when relying on reconstruction objectives and fixed-rank constraints. We introduce a no-rank tensor…

Machine Learning · Computer Science 2026-03-03 Maryam Bagherian

Generative neural models have recently achieved state-of-the-art results for constituency parsing. However, without a feasible search procedure, their use has so far been limited to reranking the output of external parsers in which decoding…

Computation and Language · Computer Science 2017-07-31 Mitchell Stern , Daniel Fried , Dan Klein

Generative retrieval (GR) has emerged as a new paradigm in neural information retrieval, offering an alternative to dense retrieval (DR) by directly generating identifiers of relevant documents. In this paper, we theoretically and…

Information Retrieval · Computer Science 2025-11-12 Yingchen Zhang , Ruqing Zhang , Jiafeng Guo , Maarten de Rijke , Yixing Fan , Xueqi Cheng

Embedding models play a pivot role in modern NLP applications such as IR and RAG. While the context limit of LLMs has been pushed beyond 1 million tokens, embedding models are still confined to a narrow context window not exceeding 8k…

Computation and Language · Computer Science 2024-11-08 Dawei Zhu , Liang Wang , Nan Yang , Yifan Song , Wenhao Wu , Furu Wei , Sujian Li

In this paper, we present a generative retrieval method for sponsored search engine, which uses neural machine translation (NMT) to generate keywords directly from query. This method is completely end-to-end, which skips query rewriting and…

‹ Prev 1 2 3 10 Next ›