Related papers: Multi-Task Retrieval-Augmented Text Generation wit…

FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

Retrieval-augmented generation models offer many benefits over standalone language models: besides a textual answer to a given query they provide provenance items retrieved from an updateable knowledge base. However, they are also more…

Computation and Language · Computer Science 2022-09-29 Sebastian Hofstätter , Jiecao Chen , Karthik Raman , Hamed Zamani

Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks

Retrieval-augmented generation models have shown state-of-the-art performance across many knowledge-intensive NLP tasks such as open question answering and fact verification. These models are trained to generate the final output given the…

Computation and Language · Computer Science 2022-05-17 Akari Asai , Matt Gardner , Hannaneh Hajishirzi

Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System

Developing an efficient retriever to retrieve knowledge from a large-scale knowledge base (KB) is critical for task-oriented dialogue systems to effectively handle localized and specialized tasks. However, widely used generative models such…

Computation and Language · Computer Science 2023-10-23 Weizhou Shen , Yingqi Gao , Canbin Huang , Fanqi Wan , Xiaojun Quan , Wei Bi

RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning

Retrieval-augmented generation (RAG) systems rely on retrieval models for identifying relevant contexts and answer generation models for utilizing those contexts. However, retrievers exhibit imperfect recall and precision, limiting…

Computation and Language · Computer Science 2026-04-29 Jerry Huang , Siddarth Madala , Risham Sidhu , Cheng Niu , Hao Peng , Julia Hockenmaier , Tong Zhang

Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems

Efficient knowledge retrieval plays a pivotal role in ensuring the success of end-to-end task-oriented dialogue systems by facilitating the selection of relevant information necessary to fulfill user requests. However, current approaches…

Computation and Language · Computer Science 2023-10-24 Tianyuan Shi , Liangzhi Li , Zijian Lin , Tao Yang , Xiaojun Quan , Qifan Wang

A Unified Generative Retriever for Knowledge-Intensive Language Tasks via Prompt Learning

Knowledge-intensive language tasks (KILTs) benefit from retrieving high-quality relevant contexts from large external knowledge corpora. Learning task-specific retrievers that return relevant contexts at an appropriate level of semantic…

Information Retrieval · Computer Science 2023-05-01 Jiangui Chen , Ruqing Zhang , Jiafeng Guo , Maarten de Rijke , Yiqun Liu , Yixing Fan , Xueqi Cheng

Efficient Retrieval Augmented Generation from Unstructured Knowledge for Task-Oriented Dialog

This paper summarizes our work on the first track of the ninth Dialog System Technology Challenge (DSTC 9), "Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access". The goal of the task is to generate…

Computation and Language · Computer Science 2021-02-10 David Thulke , Nico Daheim , Christian Dugast , Hermann Ney

Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration

In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model…

Computation and Language · Computer Science 2024-06-21 Han-Cheng Yu , Yu-An Shih , Kin-Man Law , Kai-Yu Hsieh , Yu-Chen Cheng , Hsin-Chih Ho , Zih-An Lin , Wen-Chuan Hsu , Yao-Chung Fan

Retrieval-Generation Synergy Augmented Large Language Models

Large language models augmented with task-relevant documents have demonstrated impressive performance on knowledge-intensive tasks. However, regarding how to obtain effective documents, the existing methods are mainly divided into two…

Computation and Language · Computer Science 2023-10-10 Zhangyin Feng , Xiaocheng Feng , Dezhi Zhao , Maojin Yang , Bing Qin

TARGET: Benchmarking Table Retrieval for Generative Tasks

The data landscape is rich with structured data, often of high value to organizations, driving important applications in data analysis and machine learning. Recent progress in representation learning and generative models for such data has…

Information Retrieval · Computer Science 2025-05-20 Xingyu Ji , Parker Glenn , Aditya G. Parameswaran , Madelon Hulsebos

KILT: a Benchmark for Knowledge Intensive Language Tasks

Challenging problems such as open-domain question answering, fact checking, slot filling and entity linking require access to large, external knowledge sources. While some models do well on individual tasks, developing general models is…

Computation and Language · Computer Science 2021-05-28 Fabio Petroni , Aleksandra Piktus , Angela Fan , Patrick Lewis , Majid Yazdani , Nicola De Cao , James Thorne , Yacine Jernite , Vladimir Karpukhin , Jean Maillard , Vassilis Plachouras , Tim Rocktäschel , Sebastian Riedel

A Survey on Retrieval-Augmented Text Generation

Recently, retrieval-augmented text generation attracted increasing attention of the computational linguistics community. Compared with conventional generation models, retrieval-augmented text generation has remarkable advantages and…

Computation and Language · Computer Science 2022-02-15 Huayang Li , Yixuan Su , Deng Cai , Yan Wang , Lemao Liu

Chunk Knowledge Generation Model for Enhanced Information Retrieval: A Multi-task Learning Approach

Traditional query expansion techniques for addressing vocabulary mismatch problems in information retrieval are context-sensitive and may lead to performance degradation. As an alternative, document expansion research has gained attention,…

Information Retrieval · Computer Science 2025-09-22 Jisu Kim , Jinhee Park , Changhyun Jeon , Jungwoo Choi , Keonwoo Kim , Minji Hong , Sehyun Kim

RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling

Recent advances in large-scale pre-training such as GPT-3 allow seemingly high quality text to be generated from a given prompt. However, such generation systems often suffer from problems of hallucinated facts, and are not inherently…

Computation and Language · Computer Science 2022-02-25 Yizhe Zhang , Siqi Sun , Xiang Gao , Yuwei Fang , Chris Brockett , Michel Galley , Jianfeng Gao , Bill Dolan

Task Prototype-Based Knowledge Retrieval for Multi-Task Learning from Partially Annotated Data

Multi-task learning (MTL) is critical in real-world applications such as autonomous driving and robotics, enabling simultaneous handling of diverse tasks. However, obtaining fully annotated data for all tasks is impractical due to labeling…

Machine Learning · Computer Science 2026-01-13 Youngmin Oh , Hyung-Il Kim , Jung Uk Kim

A Multi-Task Embedder For Retrieval Augmented LLMs

LLMs confront inherent limitations in terms of its knowledge, memory, and action. The retrieval augmentation stands as a vital mechanism to address these limitations, which brings in useful information from external sources to augment the…

Information Retrieval · Computer Science 2026-01-06 Peitian Zhang , Shitao Xiao , Zheng Liu , Zhicheng Dou , Jian-Yun Nie

Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering

Retrieval-augmented generation models augment knowledge encoded in a language model by providing additional relevant external knowledge (context) during generation. Although it has been shown that the quantity and quality of context impact…

Computation and Language · Computer Science 2024-03-22 Kosuke Akimoto , Kunihiro Takeoka , Masafumi Oyamada

Retrieval-augmented Multi-label Text Classification

Multi-label text classification (MLC) is a challenging task in settings of large label sets, where label support follows a Zipfian distribution. In this paper, we address this problem through retrieval augmentation, aiming to improve the…

Computation and Language · Computer Science 2023-05-23 Ilias Chalkidis , Yova Kementchedjhieva

Is Relevance Propagated from Retriever to Generator in RAG?

Retrieval Augmented Generation (RAG) is a framework for incorporating external knowledge, usually in the form of a set of documents retrieved from a collection, as a part of a prompt to a large language model (LLM) to potentially improve…

Information Retrieval · Computer Science 2025-02-24 Fangzheng Tian , Debasis Ganguly , Craig Macdonald

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Large language models are powerful text processors and reasoners, but are still subject to limitations including outdated knowledge and hallucinations, which necessitates connecting them to the world. Retrieval-augmented large language…

Computation and Language · Computer Science 2023-10-24 Zhihong Shao , Yeyun Gong , Yelong Shen , Minlie Huang , Nan Duan , Weizhu Chen