Related papers: Dialogue-oriented Pre-training

Advances in Multi-turn Dialogue Comprehension: A Survey

Training machines to understand natural language and interact with humans is an elusive and essential task of artificial intelligence. A diversity of dialogue systems has been designed with the rapid development of deep learning techniques,…

Computation and Language · Computer Science 2021-10-13 Zhuosheng Zhang , Hai Zhao

Advances in Multi-turn Dialogue Comprehension: A Survey

Training machines to understand natural language and interact with humans is an elusive and essential task of artificial intelligence. A diversity of dialogue systems has been designed with the rapid development of deep learning techniques,…

Computation and Language · Computer Science 2021-10-14 Zhuosheng Zhang , Hai Zhao

Dialogue-adaptive Language Model Pre-training From Quality Estimation

Pre-trained language models (PrLMs) have achieved great success on a wide range of natural language processing tasks by virtue of the universal language representation ability obtained by self-supervised learning on a large corpus. These…

Computation and Language · Computer Science 2022-10-21 Junlong Li , Zhuosheng Zhang , Hai Zhao

Post-Training Dialogue Summarization using Pseudo-Paraphrasing

Previous dialogue summarization techniques adapt large language models pretrained on the narrative text by injecting dialogue-specific features into the models. These features either require additional knowledge to recognize or make the…

Computation and Language · Computer Science 2022-04-29 Qi Jia , Yizhu Liu , Haifeng Tang , Kenny Q. Zhu

LERT: A Linguistically-motivated Pre-trained Language Model

Pre-trained Language Model (PLM) has become a representative foundation model in the natural language processing field. Most PLMs are trained with linguistic-agnostic pre-training tasks on the surface form of the text, such as the masked…

Computation and Language · Computer Science 2022-11-11 Yiming Cui , Wanxiang Che , Shijin Wang , Ting Liu

DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations

Language models pre-trained on general text have achieved impressive results in diverse fields. Yet, the distinct linguistic characteristics of task-oriented dialogues (TOD) compared to general text limit the practical utility of existing…

Computation and Language · Computer Science 2024-04-02 Weihao Zeng , Dayuan Fu , Keqing He , Yejie Wang , Yukai Xu , Weiran Xu

Structural Pre-training for Dialogue Comprehension

Pre-trained language models (PrLMs) have demonstrated superior performance due to their strong ability to learn universal language representations from self-supervised pre-training. However, even with the help of the powerful PrLMs, it is…

Computation and Language · Computer Science 2021-05-25 Zhuosheng Zhang , Hai Zhao

DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization

Dialogue is an essential part of human communication and cooperation. Existing research mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person interactions in the real world, such as meetings or…

Computation and Language · Computer Science 2022-01-07 Ming Zhong , Yang Liu , Yichong Xu , Chenguang Zhu , Michael Zeng

Response Generation with Context-Aware Prompt Learning

Pre-trained language models (PLM) have marked a huge leap in neural dialogue modeling. While PLMs are pre-trained on large-scale text corpora, they are usually fine-tuned on scarce dialogue data with specific domain knowledge and dialogue…

Computation and Language · Computer Science 2021-12-14 Xiaodong Gu , Kang Min Yoo , Sang-Woo Lee

Investigating the Impact of Pre-trained Language Models on Dialog Evaluation

Recently, there is a surge of interest in applying pre-trained language models (Pr-LM) in automatic open-domain dialog evaluation. Pr-LMs offer a promising direction for addressing the multi-domain evaluation challenge. Yet, the impact of…

Computation and Language · Computer Science 2021-11-03 Chen Zhang , Luis Fernando D'Haro , Yiming Chen , Thomas Friedrichs , Haizhou Li

BEYOND DIALOGUE: A Profile-Dialogue Alignment Framework Towards General Role-Playing Language Model

The rapid advancement of large language models (LLMs) has revolutionized role-playing, enabling the development of general role-playing models. However, current role-playing training has two significant issues: (I) Using a predefined role…

Computation and Language · Computer Science 2025-06-10 Yeyong Yu , Runsheng Yu , Haojie Wei , Zhanqiu Zhang , Quan Qian

Channel-aware Decoupling Network for Multi-turn Dialogue Comprehension

Training machines to understand natural language and interact with humans is one of the major goals of artificial intelligence. Recent years have witnessed an evolution from matching networks to pre-trained language models (PrLMs). In…

Computation and Language · Computer Science 2023-01-12 Zhuosheng Zhang , Hai Zhao , Longxiang Liu

How Does Pretraining Improve Discourse-Aware Translation?

Pretrained language models (PLMs) have produced substantial improvements in discourse-aware neural machine translation (NMT), for example, improved coherence in spoken language translation. However, the underlying reasons for their strong…

Computation and Language · Computer Science 2023-06-01 Zhihong Huang , Longyue Wang , Siyou Liu , Derek F. Wong

Large Language Model based Situational Dialogues for Second Language Learning

In second language learning, scenario-based conversation practice is important for language learners to achieve fluency in speaking, but students often lack sufficient opportunities to practice their conversational skills with qualified…

Computation and Language · Computer Science 2024-04-01 Shuyao Xu , Long Qin , Tianyang Chen , Zhenzhou Zha , Bingxue Qiu , Weizhi Wang

Dialogue Is Not Enough to Make a Communicative BabyLM (But Neither Is Developmentally Inspired Reinforcement Learning)

We investigate whether pre-training exclusively on dialogue data results in formally and functionally apt small language models. Based on this pre-trained llamalogue model, we employ a variety of fine-tuning strategies to enforce "more…

Computation and Language · Computer Science 2025-12-02 Francesca Padovani , Bastian Bunzeck , Manar Ali , Omar Momen , Arianna Bisazza , Hendrik Buschmeier , Sina Zarrieß

Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models

Existing dialog system models require extensive human annotations and are difficult to generalize to different tasks. The recent success of large pre-trained language models such as BERT and GPT-2 (Devlin et al., 2019; Radford et al., 2019)…

Computation and Language · Computer Science 2021-04-28 Qingyang Wu , Yichi Zhang , Yu Li , Zhou Yu

Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk

Large language models (LLMs) are powerful dialogue agents, but specializing them towards fulfilling a specific function can be challenging. Instructing tuning, i.e. tuning models on instruction and sample responses generated by humans…

Computation and Language · Computer Science 2024-01-11 Dennis Ulmer , Elman Mansimov , Kaixiang Lin , Justin Sun , Xibin Gao , Yi Zhang

Pre-Trained Language Models for Interactive Decision-Making

Language model (LM) pre-training is useful in many language processing tasks. But can pre-trained LMs be further leveraged for more general machine learning problems? We propose an approach for using LMs to scaffold learning and…

Machine Learning · Computer Science 2022-11-01 Shuang Li , Xavier Puig , Chris Paxton , Yilun Du , Clinton Wang , Linxi Fan , Tao Chen , De-An Huang , Ekin Akyürek , Anima Anandkumar , Jacob Andreas , Igor Mordatch , Antonio Torralba , Yuke Zhu

An Overview on Language Models: Recent Developments and Outlook

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo

Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations

Large language models (LLMs) have emerged as powerful and general solutions to many natural language tasks. However, many of the most important applications of language generation are interactive, where an agent has to talk to a person to…

Machine Learning · Computer Science 2023-11-10 Joey Hong , Sergey Levine , Anca Dragan