Related papers: Post-processing Networks: Method for Optimizing Pi…

Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems

Post-processing networks (PPNs) are components that modify the outputs of arbitrary modules in task-oriented dialogue systems and are optimized using reinforcement learning (RL) to improve the overall task completion capability of the…

Computation and Language · Computer Science 2025-02-04 Atsumoto Ohashi , Ryuichiro Higashinaka

Continuously Learning Neural Dialogue Management

We describe a two-step approach for dialogue management in task-oriented spoken dialogue systems. A unified neural network framework is proposed to enable the system to first learn by supervision from a set of dialogue data and then…

Computation and Language · Computer Science 2016-06-09 Pei-Hao Su , Milica Gasic , Nikola Mrksic , Lina Rojas-Barahona , Stefan Ultes , David Vandyke , Tsung-Hsien Wen , Steve Young

Joint System-Wise Optimization for Pipeline Goal-Oriented Dialog System

Recent work (Takanobu et al., 2020) proposed the system-wise evaluation on dialog systems and found that improvement on individual components (e.g., NLU, policy) in prior work may not necessarily bring benefit to pipeline systems in…

Computation and Language · Computer Science 2021-06-10 Zichuan Lin , Jing Huang , Bowen Zhou , Xiaodong He , Tengyu Ma

Is Your Goal-Oriented Dialog Model Performing Really Well? Empirical Analysis of System-wise Evaluation

There is a growing interest in developing goal-oriented dialog systems which serve users in accomplishing complex tasks through multi-turn conversations. Although many methods are devised to evaluate and improve the performance of…

Computation and Language · Computer Science 2020-05-18 Ryuichi Takanobu , Qi Zhu , Jinchao Li , Baolin Peng , Jianfeng Gao , Minlie Huang

Enhancing End-to-End Multi-Task Dialogue Systems: A Study on Intrinsic Motivation Reinforcement Learning Algorithms for Improved Training and Adaptability

End-to-end multi-task dialogue systems are usually designed with separate modules for the dialogue pipeline. Among these, the policy module is essential for deciding what to do in response to user input. This policy is trained by…

Computation and Language · Computer Science 2024-03-27 Navin Kamuni , Hardik Shah , Sathishkumar Chintala , Naveen Kunchakuri , Sujatha Alla Old Dominion

End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning

In this paper, we present a neural network based task-oriented dialogue system that can be optimized end-to-end with deep reinforcement learning (RL). The system is able to track dialogue state, interface with knowledge bases, and…

Computation and Language · Computer Science 2017-12-04 Bing Liu , Gokhan Tur , Dilek Hakkani-Tur , Pararth Shah , Larry Heck

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

In this work, we present a hybrid learning method for training task-oriented dialogue systems through online user interactions. Popular methods for learning task-oriented dialogues include applying reinforcement learning with user feedback…

Computation and Language · Computer Science 2018-04-19 Bing Liu , Gokhan Tur , Dilek Hakkani-Tur , Pararth Shah , Larry Heck

End-to-End Task-Completion Neural Dialogue Systems

One of the major drawbacks of modularized task-completion dialogue systems is that each module is trained individually, which presents several challenges. For example, downstream modules are affected by earlier modules, and the performance…

Computation and Language · Computer Science 2018-02-13 Xiujun Li , Yun-Nung Chen , Lihong Li , Jianfeng Gao , Asli Celikyilmaz

Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System

Designing the dialogue policy of a spoken dialogue system involves many nontrivial choices. This paper presents a reinforcement learning approach for automatically optimizing a dialogue policy, which addresses the technical challenges in…

Machine Learning · Computer Science 2011-06-06 M. Kearns , D. Litman , S. Singh , M. Walker

Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems

Dialogue policy learning for task-oriented dialogue systems has enjoyed great progress recently mostly through employing reinforcement learning methods. However, these approaches have become very sophisticated. It is time to re-evaluate it.…

Computation and Language · Computer Science 2020-09-22 Ziming Li , Julia Kiseleva , Maarten de Rijke

Improving Multi-Domain Task-Oriented Dialogue System with Offline Reinforcement Learning

Task-oriented dialogue (TOD) system is designed to accomplish user-defined tasks through dialogues. The TOD system has progressed towards end-to-end modeling by leveraging pre-trained large language models. Fine-tuning the pre-trained…

Computation and Language · Computer Science 2024-11-11 Dharmendra Prajapat , Durga Toshniwal

Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Reinforcement learning is widely used for dialogue policy optimization where the reward function often consists of more than one component, e.g., the dialogue success and the dialogue length. In this work, we propose a structured method for…

Computation and Language · Computer Science 2017-07-21 Stefan Ultes , Paweł Budzianowski , Iñigo Casanueva , Nikola Mrkšić , Lina Rojas-Barahona , Pei-Hao Su , Tsung-Hsien Wen , Milica Gašić , Steve Young

Dynamic Policy Induction for Adaptive Prompt Optimization: Bridging the Efficiency-Accuracy Gap via Lightweight Reinforcement Learning

The performance of Large Language Models (LLMs) depends heavily on the chosen prompting strategy, yet static approaches such as Zero-Shot, Few-Shot, or Chain-of-Thought (CoT) impose a rigid efficiency-accuracy trade-off. Highly accurate…

Machine Learning · Computer Science 2025-10-01 Jiexi Xu

Retrieve & Memorize: Dialog Policy Learning with Multi-Action Memory

Dialogue policy learning, a subtask that determines the content of system response generation and then the degree of task completion, is essential for task-oriented dialogue systems. However, the unbalanced distribution of system actions in…

Computation and Language · Computer Science 2021-06-29 Yunhao Li , Yunyi Yang , Xiaojun Quan , Jianxing Yu

Structured Fusion Networks for Dialog

Neural dialog models have exhibited strong performance, however their end-to-end nature lacks a representation of the explicit structure of dialog. This results in a loss of generalizability, controllability and a data-hungry nature.…

Computation and Language · Computer Science 2019-07-24 Shikib Mehri , Tejas Srinivasan , Maxine Eskenazi

Optimizing Conversational Product Recommendation via Reinforcement Learning

We propose a reinforcement learning-based approach to optimize conversational strategies for product recommendation across diverse industries. As organizations increasingly adopt intelligent agents to support sales and service operations,…

Information Retrieval · Computer Science 2025-07-03 Kang Liu

A Network-based End-to-End Trainable Task-oriented Dialogue System

Teaching machines to accomplish tasks by conversing naturally with humans is challenging. Currently, developing task-oriented dialogue systems requires creating multiple components and typically this involves either a large amount of…

Computation and Language · Computer Science 2017-04-25 Tsung-Hsien Wen , David Vandyke , Nikola Mrksic , Milica Gasic , Lina M. Rojas-Barahona , Pei-Hao Su , Stefan Ultes , Steve Young

Continual Learning for Task-oriented Dialogue System with Iterative Network Pruning, Expanding and Masking

This ability to learn consecutive tasks without forgetting how to perform previously trained problems is essential for developing an online dialogue system. This paper proposes an effective continual learning for the task-oriented dialogue…

Computation and Language · Computer Science 2021-07-20 Binzong Geng , Fajie Yuan , Qiancheng Xu , Ying Shen , Ruifeng Xu , Min Yang

Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration

Persuasion dialogue systems reflect the machine's ability to make strategic moves beyond verbal communication, and therefore differentiate themselves from task-oriented or open-domain dialogue systems and have their own unique values.…

Computation and Language · Computer Science 2022-10-25 Weiyan Shi , Yu Li , Saurav Sahay , Zhou Yu

DORA: Toward Policy Optimization for Task-oriented Dialogue System with Efficient Context

Recently, reinforcement learning (RL) has been applied to task-oriented dialogue systems by using latent actions to solve shortcomings of supervised learning (SL). In this paper, we propose a multi-domain task-oriented dialogue system,…

Computation and Language · Computer Science 2021-07-08 Hyunmin Jeon , Gary Geunbae Lee