Related papers: Paraphrase Augmented Task-Oriented Dialog Generati…

Contextual Data Augmentation for Task-Oriented Dialog Systems

Collection of annotated dialogs for training task-oriented dialog systems have been one of the key bottlenecks in improving current models. While dialog response generation has been widely studied on the agent side, it is not evident if…

Computation and Language · Computer Science 2023-10-17 Dustin Axman , Avik Ray , Shubham Garg , Jing Huang

Quick Starting Dialog Systems with Paraphrase Generation

Acquiring training data to improve the robustness of dialog systems can be a painstakingly long process. In this work, we propose a method to reduce the cost and effort of creating new conversational agents by artificially generating more…

Computation and Language · Computer Science 2022-05-05 Louis Marceau , Raouf Belbahar , Marc Queudot , Nada Naji , Eric Charton , Marie-Jean Meurs

ParetoRAG: Leveraging Sentence-Context Attention for Robust and Efficient Retrieval-Augmented Generation

While Retrieval-Augmented Generation (RAG) systems enhance Large Language Models (LLMs) by incorporating external knowledge, they still face persistent challenges in retrieval inefficiency and the inability of LLMs to filter out irrelevant…

Computation and Language · Computer Science 2025-02-13 Ruobing Yao , Yifei Zhang , Shuang Song , Yuhua Liu , Neng Gao , Chenyang Tu

Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations

Syntactically controlled paraphrase generation has become an emerging research direction in recent years. Most existing approaches require annotated paraphrase pairs for training and are thus costly to extend to new domains. Unsupervised…

Computation and Language · Computer Science 2022-11-03 Kuan-Hao Huang , Varun Iyer , Anoop Kumar , Sriram Venkatapathy , Kai-Wei Chang , Aram Galstyan

Diversifying Task-oriented Dialogue Response Generation with Prototype Guided Paraphrasing

Existing methods for Dialogue Response Generation (DRG) in Task-oriented Dialogue Systems (TDSs) can be grouped into two categories: template-based and corpus-based. The former prepare a collection of response templates in advance and fill…

Computation and Language · Computer Science 2020-08-11 Phillip Lippe , Pengjie Ren , Hinda Haned , Bart Voorn , Maarten de Rijke

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks. However, their ability to access and precisely manipulate…

Computation and Language · Computer Science 2021-04-13 Patrick Lewis , Ethan Perez , Aleksandra Piktus , Fabio Petroni , Vladimir Karpukhin , Naman Goyal , Heinrich Küttler , Mike Lewis , Wen-tau Yih , Tim Rocktäschel , Sebastian Riedel , Douwe Kiela

AARGH! End-to-end Retrieval-Generation for Task-Oriented Dialog

We introduce AARGH, an end-to-end task-oriented dialog system combining retrieval and generative approaches in a single model, aiming at improving dialog management and lexical diversity of outputs. The model features a new response…

Computation and Language · Computer Science 2022-09-27 Tomáš Nekvinda , Ondřej Dušek

Paraphrase Generation with Deep Reinforcement Learning

Automatic generation of paraphrases from a given sentence is an important yet challenging task in natural language processing (NLP), and plays a key role in a number of applications such as question answering, search, and dialogue. In this…

Computation and Language · Computer Science 2018-08-24 Zichao Li , Xin Jiang , Lifeng Shang , Hang Li

A Tailored Pre-Training Model for Task-Oriented Dialog Generation

The recent success of large pre-trained language models such as BERT and GPT-2 has suggested the effectiveness of incorporating language priors in downstream dialog generation tasks. However, the performance of pre-trained models on the…

Computation and Language · Computer Science 2020-04-30 Jing Gu , Qingyang Wu , Chongruo Wu , Weiyan Shi , Zhou Yu

Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering

Paraphrase generation is an important task in natural language processing. Previous works focus on sentence-level paraphrase generation, while ignoring document-level paraphrase generation, which is a more challenging and valuable task. In…

Computation and Language · Computer Science 2021-09-16 Zhe Lin , Yitao Cai , Xiaojun Wan

Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by retrieving relevant documents from external sources and incorporating them into the context. While it improves reliability by providing factual texts, it…

Computation and Language · Computer Science 2025-05-07 Yuqiao Tan , Shizhu He , Huanxuan Liao , Jun Zhao , Kang Liu

Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models

Existing dialog system models require extensive human annotations and are difficult to generalize to different tasks. The recent success of large pre-trained language models such as BERT and GPT-2 (Devlin et al., 2019; Radford et al., 2019)…

Computation and Language · Computer Science 2021-04-28 Qingyang Wu , Yichi Zhang , Yu Li , Zhou Yu

Persona-Aware Alignment Framework for Personalized Dialogue Generation

Personalized dialogue generation aims to leverage persona profiles and dialogue history to generate persona-relevant and consistent responses. Mainstream models typically rely on token-level language model training with persona dialogue…

Computation and Language · Computer Science 2025-11-14 Guanrong Li , Xinyu Liu , Zhen Wu , Xinyu Dai

ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation

Paraphrase generation is a long-standing task in natural language processing (NLP). Supervised paraphrase generation models, which rely on human-annotated paraphrase pairs, are cost-inefficient and hard to scale up. On the other hand,…

Computation and Language · Computer Science 2023-05-29 Kuan-Hao Huang , Varun Iyer , I-Hung Hsu , Anoop Kumar , Kai-Wei Chang , Aram Galstyan

TAdaRAG: Task Adaptive Retrieval-Augmented Generation via On-the-Fly Knowledge Graph Construction

Retrieval-Augmented Generation (RAG) improves large language models by retrieving external knowledge, often truncated into smaller chunks due to the input context window, which leads to information loss, resulting in response hallucinations…

Computation and Language · Computer Science 2025-11-18 Jie Zhang , Bo Tang , Wanzi Shao , Wenqiang Wei , Jihao Zhao , Jianqing Zhu , Zhiyu li , Wen Xi , Zehao Lin , Feiyu Xiong , Yanchao Tan

Learning towards Selective Data Augmentation for Dialogue Generation

As it is cumbersome and expensive to acquire a huge amount of data for training neural dialog models, data augmentation is proposed to effectively utilize existing training samples. However, current data augmentation techniques on the…

Computation and Language · Computer Science 2023-03-20 Xiuying Chen , Mingzhe Li , Jiayi Zhang , Xiaoqiang Xia , Chen Wei , Jianwei Cui , Xin Gao , Xiangliang Zhang , Rui Yan

Context Matters in Semantically Controlled Language Generation for Task-oriented Dialogue Systems

This work combines information about the dialogue history encoded by pre-trained model with a meaning representation of the current system utterance to realize contextual language generation in task-oriented dialogues. We utilize the…

Computation and Language · Computer Science 2021-11-30 Ye Liu , Wolfgang Maier , Wolfgang Minker , Stefan Ultes

ConRPG: Paraphrase Generation using Contexts as Regularizer

A long-standing issue with paraphrase generation is how to obtain reliable supervision signals. In this paper, we propose an unsupervised paradigm for paraphrase generation based on the assumption that the probabilities of generating two…

Computation and Language · Computer Science 2021-09-02 Yuxian Meng , Xiang Ao , Qing He , Xiaofei Sun , Qinghong Han , Fei Wu , Chun fan , Jiwei Li

TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework

Retrieval-Augmented Generation (RAG) utilizes external knowledge to augment Large Language Models' (LLMs) reliability. For flexibility, agentic RAG employs autonomous, multi-round retrieval and reasoning to resolve queries. Although recent…

Information Retrieval · Computer Science 2025-11-10 Chao Zhang , Yuhao Wang , Derong Xu , Haoxin Zhang , Yuanjie Lyu , Yuhao Chen , Shuochen Liu , Tong Xu , Xiangyu Zhao , Yan Gao , Yao Hu , Enhong Chen

A Survey on Retrieval-Augmented Text Generation for Large Language Models

Retrieval-Augmented Generation (RAG) merges retrieval methods with deep learning advancements to address the static limitations of large language models (LLMs) by enabling the dynamic integration of up-to-date external information. This…

Information Retrieval · Computer Science 2026-05-19 Yizheng Huang , Jimmy Huang