English
Related papers

Related papers: Dialogue-oriented Pre-training

200 papers

Training machines to understand natural language and interact with humans is an elusive and essential task of artificial intelligence. A diversity of dialogue systems has been designed with the rapid development of deep learning techniques,…

Computation and Language · Computer Science 2021-10-13 Zhuosheng Zhang , Hai Zhao

Training machines to understand natural language and interact with humans is an elusive and essential task of artificial intelligence. A diversity of dialogue systems has been designed with the rapid development of deep learning techniques,…

Computation and Language · Computer Science 2021-10-14 Zhuosheng Zhang , Hai Zhao

Pre-trained language models (PrLMs) have achieved great success on a wide range of natural language processing tasks by virtue of the universal language representation ability obtained by self-supervised learning on a large corpus. These…

Computation and Language · Computer Science 2022-10-21 Junlong Li , Zhuosheng Zhang , Hai Zhao

Previous dialogue summarization techniques adapt large language models pretrained on the narrative text by injecting dialogue-specific features into the models. These features either require additional knowledge to recognize or make the…

Computation and Language · Computer Science 2022-04-29 Qi Jia , Yizhu Liu , Haifeng Tang , Kenny Q. Zhu

Pre-trained Language Model (PLM) has become a representative foundation model in the natural language processing field. Most PLMs are trained with linguistic-agnostic pre-training tasks on the surface form of the text, such as the masked…

Computation and Language · Computer Science 2022-11-11 Yiming Cui , Wanxiang Che , Shijin Wang , Ting Liu

Language models pre-trained on general text have achieved impressive results in diverse fields. Yet, the distinct linguistic characteristics of task-oriented dialogues (TOD) compared to general text limit the practical utility of existing…

Computation and Language · Computer Science 2024-04-02 Weihao Zeng , Dayuan Fu , Keqing He , Yejie Wang , Yukai Xu , Weiran Xu

Pre-trained language models (PrLMs) have demonstrated superior performance due to their strong ability to learn universal language representations from self-supervised pre-training. However, even with the help of the powerful PrLMs, it is…

Computation and Language · Computer Science 2021-05-25 Zhuosheng Zhang , Hai Zhao

Dialogue is an essential part of human communication and cooperation. Existing research mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person interactions in the real world, such as meetings or…

Computation and Language · Computer Science 2022-01-07 Ming Zhong , Yang Liu , Yichong Xu , Chenguang Zhu , Michael Zeng

Pre-trained language models (PLM) have marked a huge leap in neural dialogue modeling. While PLMs are pre-trained on large-scale text corpora, they are usually fine-tuned on scarce dialogue data with specific domain knowledge and dialogue…

Computation and Language · Computer Science 2021-12-14 Xiaodong Gu , Kang Min Yoo , Sang-Woo Lee

Recently, there is a surge of interest in applying pre-trained language models (Pr-LM) in automatic open-domain dialog evaluation. Pr-LMs offer a promising direction for addressing the multi-domain evaluation challenge. Yet, the impact of…

Computation and Language · Computer Science 2021-11-03 Chen Zhang , Luis Fernando D'Haro , Yiming Chen , Thomas Friedrichs , Haizhou Li

The rapid advancement of large language models (LLMs) has revolutionized role-playing, enabling the development of general role-playing models. However, current role-playing training has two significant issues: (I) Using a predefined role…

Computation and Language · Computer Science 2025-06-10 Yeyong Yu , Runsheng Yu , Haojie Wei , Zhanqiu Zhang , Quan Qian

Training machines to understand natural language and interact with humans is one of the major goals of artificial intelligence. Recent years have witnessed an evolution from matching networks to pre-trained language models (PrLMs). In…

Computation and Language · Computer Science 2023-01-12 Zhuosheng Zhang , Hai Zhao , Longxiang Liu

Pretrained language models (PLMs) have produced substantial improvements in discourse-aware neural machine translation (NMT), for example, improved coherence in spoken language translation. However, the underlying reasons for their strong…

Computation and Language · Computer Science 2023-06-01 Zhihong Huang , Longyue Wang , Siyou Liu , Derek F. Wong

In second language learning, scenario-based conversation practice is important for language learners to achieve fluency in speaking, but students often lack sufficient opportunities to practice their conversational skills with qualified…

Computation and Language · Computer Science 2024-04-01 Shuyao Xu , Long Qin , Tianyang Chen , Zhenzhou Zha , Bingxue Qiu , Weizhi Wang

We investigate whether pre-training exclusively on dialogue data results in formally and functionally apt small language models. Based on this pre-trained llamalogue model, we employ a variety of fine-tuning strategies to enforce "more…

Computation and Language · Computer Science 2025-12-02 Francesca Padovani , Bastian Bunzeck , Manar Ali , Omar Momen , Arianna Bisazza , Hendrik Buschmeier , Sina Zarrieß

Existing dialog system models require extensive human annotations and are difficult to generalize to different tasks. The recent success of large pre-trained language models such as BERT and GPT-2 (Devlin et al., 2019; Radford et al., 2019)…

Computation and Language · Computer Science 2021-04-28 Qingyang Wu , Yichi Zhang , Yu Li , Zhou Yu

Large language models (LLMs) are powerful dialogue agents, but specializing them towards fulfilling a specific function can be challenging. Instructing tuning, i.e. tuning models on instruction and sample responses generated by humans…

Computation and Language · Computer Science 2024-01-11 Dennis Ulmer , Elman Mansimov , Kaixiang Lin , Justin Sun , Xibin Gao , Yi Zhang

Language model (LM) pre-training is useful in many language processing tasks. But can pre-trained LMs be further leveraged for more general machine learning problems? We propose an approach for using LMs to scaffold learning and…

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo

Large language models (LLMs) have emerged as powerful and general solutions to many natural language tasks. However, many of the most important applications of language generation are interactive, where an agent has to talk to a person to…

Machine Learning · Computer Science 2023-11-10 Joey Hong , Sergey Levine , Anca Dragan
‹ Prev 1 2 3 10 Next ›