Related papers: Unsupervised Dialog Structure Learning

Structured Attention for Unsupervised Dialogue Structure Induction

Inducing a meaningful structural representation from one or a set of dialogues is a crucial but challenging task in computational linguistics. Advancement made in this area is critical for dialogue system design and discourse analysis. It…

Computation and Language · Computer Science 2021-03-15 Liang Qiu , Yizhou Zhao , Weiyan Shi , Yuan Liang , Feng Shi , Tao Yuan , Zhou Yu , Song-Chun Zhu

Learning Interpretable Latent Dialogue Actions With Less Supervision

We present a novel architecture for explainable modeling of task-oriented dialogues with discrete latent variables to represent dialogue actions. Our model is based on variational recurrent neural networks (VRNN) and requires no explicit…

Computation and Language · Computer Science 2022-10-14 Vojtěch Hudeček , Ondřej Dušek

Discovering Dialog Structure Graph for Open-Domain Dialog Generation

Learning interpretable dialog structure from human-human dialogs yields basic insights into the structure of conversation, and also provides background knowledge to facilitate dialog generation. In this paper, we conduct unsupervised…

Artificial Intelligence · Computer Science 2021-01-01 Jun Xu , Zeyang Lei , Haifeng Wang , Zheng-Yu Niu , Hua Wu , Wanxiang Che , Ting Liu

Unsupervised Discrete Sentence Representation Learning for Interpretable Neural Dialog Generation

The encoder-decoder dialog model is one of the most prominent methods used to build dialog systems in complex domains. Yet it is limited because it cannot output interpretable actions as in traditional systems, which hinders humans from…

Computation and Language · Computer Science 2018-04-24 Tiancheng Zhao , Kyusong Lee , Maxine Eskenazi

Unsupervised Learning of Interpretable Dialog Models

Recently several deep learning based models have been proposed for end-to-end learning of dialogs. While these models can be trained from data without the need for any additional annotations, it is hard to interpret them. On the other hand,…

Artificial Intelligence · Computer Science 2018-11-05 Dhiraj Madan , Dinesh Raghu , Gaurav Pandey , Sachindra Joshi

DSBERT:Unsupervised Dialogue Structure learning with BERT

Unsupervised dialogue structure learning is an important and meaningful task in natural language processing. The extracted dialogue structure and process can help analyze human dialogue, and play a vital role in the design and evaluation of…

Computation and Language · Computer Science 2021-11-10 Bingkun Chen , Shaobing Dai , Shenghua Zheng , Lei Liao , Yang Li

Variational Transformers for Diverse Response Generation

Despite the great promise of Transformers in many sequence modeling tasks (e.g., machine translation), their deterministic nature hinders them from generalizing to high entropy tasks such as dialogue response generation. Previous work…

Computation and Language · Computer Science 2020-03-31 Zhaojiang Lin , Genta Indra Winata , Peng Xu , Zihan Liu , Pascale Fung

DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder

Variational autoencoders~(VAEs) have shown a promise in data-driven conversation modeling. However, most VAE conversation models match the approximate posterior distribution over the latent variables to a simple prior such as standard…

Computation and Language · Computer Science 2019-02-27 Xiaodong Gu , Kyunghyun Cho , Jung-Woo Ha , Sunghun Kim

DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation

Dialog response generation in open domain is an important research topic where the main challenge is to generate relevant and diverse responses. In this paper, we propose a new dialog pre-training framework called DialogVED, which…

Computation and Language · Computer Science 2022-11-01 Wei Chen , Yeyun Gong , Song Wang , Bolun Yao , Weizhen Qi , Zhongyu Wei , Xiaowu Hu , Bartuer Zhou , Yi Mao , Weizhu Chen , Biao Cheng , Nan Duan

Advanced Conditional Variational Autoencoders (A-CVAE): Towards interpreting open-domain conversation generation via disentangling latent feature representation

Currently end-to-end deep learning based open-domain dialogue systems remain black box models, making it easy to generate irrelevant contents with data-driven models. Specifically, latent variables are highly entangled with different…

Computation and Language · Computer Science 2022-07-27 Ye Wang , Jingbo Liao , Hong Yu , Guoyin Wang , Xiaoxia Zhang , Li Liu

Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

Reinforcement learning has been widely adopted to model dialogue managers in task-oriented dialogues. However, the user simulator provided by state-of-the-art dialogue frameworks are only rough approximations of human behaviour. The ability…

Computation and Language · Computer Science 2023-02-23 Thibault Cordier , Tanguy Urvoy , Fabrice Lefevre , Lina M. Rojas-Barahona

V-VAE: A Variational Auto Encoding Framework Towards Fine-Grained Control over Human-Like Chat

With the continued proliferation of Large Language Model (LLM) based chatbots, there is a growing demand for generating responses that are not only linguistically fluent but also consistently aligned with persona-specific traits in…

Computation and Language · Computer Science 2025-12-12 Qi Lin , Weikai Xu , Lisi Chen , Bin Dai

A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings

We propose a new unsupervised model for mapping a variable-duration speech segment to a fixed-dimensional representation. The resulting acoustic word embeddings can form the basis of search, discovery, and indexing systems for low- and…

Audio and Speech Processing · Electrical Eng. & Systems 2020-12-07 Puyuan Peng , Herman Kamper , Karen Livescu

Variational Reward Estimator Bottleneck: Learning Robust Reward Estimator for Multi-Domain Task-Oriented Dialog

Despite its notable success in adversarial learning approaches to multi-domain task-oriented dialog system, training the dialog policy via adversarial inverse reinforcement learning often fails to balance the performance of the policy…

Artificial Intelligence · Computer Science 2020-06-02 Jeiyoon Park , Chanhee Lee , Kuekyeng Kim , Heuiseok Lim

Reasoning Visual Dialogs with Structural and Partial Observations

We propose a novel model to address the task of Visual Dialog which exhibits complex dialog structures. To obtain a reasonable answer based on the current question and the dialog history, the underlying semantic dependencies between dialog…

Computer Vision and Pattern Recognition · Computer Science 2019-05-30 Zilong Zheng , Wenguan Wang , Siyuan Qi , Song-Chun Zhu

Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System

Designing task-oriented dialogue systems is a challenging research topic, since it needs not only to generate utterances fulfilling user requests but also to guarantee the comprehensibility. Many previous works trained end-to-end (E2E)…

Computation and Language · Computer Science 2021-02-22 Jianhong Wang , Yuan Zhang , Tae-Kyun Kim , Yunjie Gu

Improving Variational Encoder-Decoders in Dialogue Generation

Variational encoder-decoders (VEDs) have shown promising results in dialogue generation. However, the latent variable distributions are usually approximated by a much simpler model than the powerful RNN structure used for encoding and…

Computation and Language · Computer Science 2018-02-07 Xiaoyu Shen , Hui Su , Shuzi Niu , Vera Demberg

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

The Variational Autoencoder (VAE) is a powerful deep generative model that is now extensively used to represent high-dimensional complex data via a low-dimensional latent space learned in an unsupervised manner. In the original VAE model,…

Sound · Computer Science 2021-06-15 Xiaoyu Bie , Laurent Girin , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda

Transformer-Based Conditioned Variational Autoencoder for Dialogue Generation

In human dialogue, a single query may elicit numerous appropriate responses. The Transformer-based dialogue model produces frequently occurring sentences in the corpus since it is a one-to-one mapping function. CVAE is a technique for…

Computation and Language · Computer Science 2022-10-25 Huihui Yang

Unsupervised speech representation learning using WaveNet autoencoders

We consider the task of unsupervised extraction of meaningful latent representations of speech by applying autoencoding neural networks to speech waveforms. The goal is to learn a representation able to capture high level semantic content…

Machine Learning · Computer Science 2019-09-12 Jan Chorowski , Ron J. Weiss , Samy Bengio , Aäron van den Oord