Related papers: Ensemble-Based Deep Reinforcement Learning for Cha…

Deep Reinforcement Learning for Chatbots Using Clustered Actions and Human-Likeness Rewards

Training chatbots using the reinforcement learning paradigm is challenging due to high-dimensional states, infinite action spaces and the difficulty in specifying the reward function. We address such problems using clustered actions instead…

Artificial Intelligence · Computer Science 2019-08-28 Heriberto Cuayáhuitl , Donghyeon Lee , Seonghan Ryu , Sungja Choi , Inchul Hwang , Jihie Kim

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management

Reinforcement learning (RL) has shown great promise for developing dialogue management (DM) agents that are non-myopic, conduct rich conversations, and maximize overall user satisfaction. Despite recent developments in RL and language…

Machine Learning · Computer Science 2023-10-31 Dhawal Gupta , Yinlam Chow , Aza Tulepbergenov , Mohammad Ghavamzadeh , Craig Boutilier

Strategic Dialogue Management via Deep Reinforcement Learning

Artificially intelligent agents equipped with strategic skills that can negotiate during their interactions with other natural or artificial agents are still underdeveloped. This paper describes a successful application of Deep…

Artificial Intelligence · Computer Science 2015-11-28 Heriberto Cuayáhuitl , Simon Keizer , Oliver Lemon

Deep Reinforcement Learning for Dialogue Generation

Recent neural models of dialogue generation offer great promise for generating responses for conversational agents, but tend to be shortsighted, predicting utterances one at a time while ignoring their influence on future outcomes. Modeling…

Computation and Language · Computer Science 2016-09-30 Jiwei Li , Will Monroe , Alan Ritter , Michel Galley , Jianfeng Gao , Dan Jurafsky

Improving a sequence-to-sequence nlp model using a reinforcement learning policy algorithm

Nowadays, the current neural network models of dialogue generation(chatbots) show great promise for generating answers for chatty agents. But they are short-sighted in that they predict utterances one at a time while disregarding their…

Computation and Language · Computer Science 2023-01-19 Jabri Ismail , Aboulbichr Ahmed , El ouaazizi Aziza

Deep Reinforcement Learning For Modeling Chit-Chat Dialog With Discrete Attributes

Open domain dialog systems face the challenge of being repetitive and producing generic responses. In this paper, we demonstrate that by conditioning the response generation on interpretable discrete dialog attributes and composed…

Machine Learning · Computer Science 2019-09-17 Chinnadhurai Sankar , Sujith Ravi

An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals

Deep Reinforcement Learning (DRL) is widely used in task-oriented dialogue systems to optimize dialogue policy, but it struggles to balance exploration and exploitation due to the high dimensionality of state and action spaces. This…

Computation and Language · Computer Science 2025-06-06 Yangyang Zhao , Ben Niu , Libo Qin , Shihan Wang

Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training

Actor-critic deep reinforcement learning (DRL) algorithms have recently achieved prominent success in tackling various challenging reinforcement learning (RL) problems, particularly complex control tasks with high-dimensional continuous…

Machine Learning · Computer Science 2023-05-04 Gang Chen , Victoria Huang

Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models

In this paper, we present a deep reinforcement learning (RL) framework for iterative dialog policy optimization in end-to-end task-oriented dialog systems. Popular approaches in learning dialog policy with RL include letting a dialog agent…

Computation and Language · Computer Science 2017-09-20 Bing Liu , Ian Lane

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Conventionally, generation of natural language for dialogue agents may be viewed as a statistical learning problem: determine the patterns in human-provided data and generate appropriate responses with similar statistical properties.…

Computation and Language · Computer Science 2022-04-19 Siddharth Verma , Justin Fu , Mengjiao Yang , Sergey Levine

Integrating Pretrained Language Model for Dialogue Policy Learning

Reinforcement Learning (RL) has been witnessed its potential for training a dialogue policy agent towards maximizing the accumulated rewards given from users. However, the reward can be very sparse for it is usually only provided at the end…

Computation and Language · Computer Science 2021-11-03 Hongru Wang , Huimin Wang , Zezhong Wang , Kam-Fai Wong

Efficient entity-based reinforcement learning

Recent deep reinforcement learning (DRL) successes rely on end-to-end learning from fixed-size observational inputs (e.g. image, state-variables). However, many challenging and interesting problems in decision making involve observations or…

Machine Learning · Computer Science 2022-06-08 Vince Jankovics , Michael Garcia Ortiz , Eduardo Alonso

GoChat: Goal-oriented Chatbots with Hierarchical Reinforcement Learning

A chatbot that converses like a human should be goal-oriented (i.e., be purposeful in conversation), which is beyond language generation. However, existing dialogue systems often heavily rely on cumbersome hand-crafted rules or costly…

Computation and Language · Computer Science 2020-05-27 Jianfeng Liu , Feiyang Pan , Ling Luo

End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning

In this paper, we present a neural network based task-oriented dialogue system that can be optimized end-to-end with deep reinforcement learning (RL). The system is able to track dialogue state, interface with knowledge bases, and…

Computation and Language · Computer Science 2017-12-04 Bing Liu , Gokhan Tur , Dilek Hakkani-Tur , Pararth Shah , Larry Heck

An Introduction to Deep Reinforcement and Imitation Learning

Embodied agents, such as robots and virtual characters, must continuously select actions to execute tasks effectively, solving complex sequential decision-making problems. Given the difficulty of designing such controllers manually,…

Robotics · Computer Science 2026-05-18 Pedro Santana

A Survey on Reinforcement Learning Methods in Character Animation

Reinforcement Learning is an area of Machine Learning focused on how agents can be trained to make sequential decisions, and achieve a particular goal within an arbitrary environment. While learning, they repeatedly take actions based on…

Graphics · Computer Science 2022-05-26 Ariel Kwiatkowski , Eduardo Alvarado , Vicky Kalogeiton , C. Karen Liu , Julien Pettré , Michiel van de Panne , Marie-Paule Cani

Hierarchical Reinforcement Learning for Open-Domain Dialog

Open-domain dialog generation is a challenging problem; maximum likelihood training can lead to repetitive outputs, models have difficulty tracking long-term conversational goals, and training on standard movie or online datasets may lead…

Machine Learning · Computer Science 2020-01-03 Abdelrhman Saleh , Natasha Jaques , Asma Ghandeharioun , Judy Hanwen Shen , Rosalind Picard

ChatShopBuddy: Towards Reliable Conversational Shopping Agents via Reinforcement Learning

Conversational shopping agents represent a critical consumer-facing application of Large Language Model (LLM)-powered agents, yet how to effectively apply post-training Reinforcement Learning (RL) to optimize such agents remains…

Information Retrieval · Computer Science 2026-03-09 Yiruo Cheng , Kelong Mao , Tianhao Li , Jiejun Tan , Ji-Rong Wen , Zhicheng Dou

A Mixture-of-Expert Approach to RL-based Dialogue Management

Despite recent advancements in language models (LMs), their application to dialogue management (DM) problems and ability to carry on rich conversations remain a challenge. We use reinforcement learning (RL) to develop a dialogue agent that…

Computation and Language · Computer Science 2022-06-02 Yinlam Chow , Aza Tulepbergenov , Ofir Nachum , MoonKyung Ryu , Mohammad Ghavamzadeh , Craig Boutilier

A Hierarchical Approach to Population Training for Human-AI Collaboration

A major challenge for deep reinforcement learning (DRL) agents is to collaborate with novel partners that were not encountered by them during the training phase. This is specifically worsened by an increased variance in action responses…

Artificial Intelligence · Computer Science 2023-05-29 Yi Loo , Chen Gong , Malika Meghjani