Related papers: Variational Memory Encoder-Decoder

Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders

While recent neural encoder-decoder models have shown great promise in modeling open-domain conversations, they often generate dull and generic responses. Unlike past work that has focused on diversifying the output of the decoder at…

Computation and Language · Computer Science 2017-10-24 Tiancheng Zhao , Ran Zhao , Maxine Eskenazi

Variational Attention for Sequence-to-Sequence Models

The variational encoder-decoder (VED) encodes source information as a set of random variables using a neural network, which in turn is decoded into target data using another neural network. In natural language processing,…

Computation and Language · Computer Science 2018-06-25 Hareesh Bahuleyan , Lili Mou , Olga Vechtomova , Pascal Poupart

Towards General Continuous Memory for Vision-Language Models

Language models (LMs) and their extension, vision-language models (VLMs), have achieved remarkable performance across various tasks. However, they still struggle with complex reasoning tasks that require multimodal or multilingual…

Machine Learning · Computer Science 2025-07-09 Wenyi Wu , Zixuan Song , Kun Zhou , Yifei Shao , Zhiting Hu , Biwei Huang

Improving Variational Encoder-Decoders in Dialogue Generation

Variational encoder-decoders (VEDs) have shown promising results in dialogue generation. However, the latent variable distributions are usually approximated by a much simpler model than the powerful RNN structure used for encoding and…

Computation and Language · Computer Science 2018-02-07 Xiaoyu Shen , Hui Su , Shuzi Niu , Vera Demberg

Variational Transformers for Diverse Response Generation

Despite the great promise of Transformers in many sequence modeling tasks (e.g., machine translation), their deterministic nature hinders them from generalizing to high entropy tasks such as dialogue response generation. Previous work…

Computation and Language · Computer Science 2020-03-31 Zhaojiang Lin , Genta Indra Winata , Peng Xu , Zihan Liu , Pascale Fung

Variational Memory Addressing in Generative Models

Aiming to augment generative models with external memory, we interpret the output of a memory module with stochastic addressing as a conditional mixture distribution, where a read operation corresponds to sampling a discrete memory address…

Machine Learning · Computer Science 2017-09-22 Jörg Bornschein , Andriy Mnih , Daniel Zoran , Danilo J. Rezende

Conditional Variational Autoencoder for Neural Machine Translation

We explore the performance of latent variable models for conditional text generation in the context of neural machine translation (NMT). Similar to Zhang et al., we augment the encoder-decoder NMT paradigm by introducing a continuous latent…

Computation and Language · Computer Science 2018-12-12 Artidoro Pagnoni , Kevin Liu , Shangyan Li

Language Model Memory and Memory Models for Language

The ability of machine learning models to store input information in hidden layer vector embeddings, analogous to the concept of `memory', is widely employed but not well characterized. We find that language model embeddings typically…

Computation and Language · Computer Science 2026-05-20 Benjamin L. Badger

Variational Neural Machine Translation

Models of neural machine translation are often from a discriminative family of encoderdecoders that learn a conditional distribution of a target sentence given a source sentence. In this paper, we propose a variational model to learn this…

Computation and Language · Computer Science 2016-09-27 Biao Zhang , Deyi Xiong , Jinsong Su , Hong Duan , Min Zhang

A Discrete CVAE for Response Generation on Short-Text Conversation

Neural conversation models such as encoder-decoder models are easy to generate bland and generic responses. Some researchers propose to use the conditional variational autoencoder(CVAE) which maximizes the lower bound on the conditional…

Computation and Language · Computer Science 2019-11-25 Jun Gao , Wei Bi , Xiaojiang Liu , Junhui Li , Guodong Zhou , Shuming Shi

VCDM: Leveraging Variational Bi-encoding and Deep Contextualized Word Representations for Improved Definition Modeling

In this paper, we tackle the task of definition modeling, where the goal is to learn to generate definitions of words and phrases. Existing approaches for this task are discriminative, combining distributional and lexical semantics in an…

Computation and Language · Computer Science 2020-10-08 Machel Reid , Edison Marrese-Taylor , Yutaka Matsuo

DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation

Dialog response generation in open domain is an important research topic where the main challenge is to generate relevant and diverse responses. In this paper, we propose a new dialog pre-training framework called DialogVED, which…

Computation and Language · Computer Science 2022-11-01 Wei Chen , Yeyun Gong , Song Wang , Bolun Yao , Weizhen Qi , Zhongyu Wei , Xiaowu Hu , Bartuer Zhou , Yi Mao , Weizhu Chen , Biao Cheng , Nan Duan

Variational Neurons in Transformers for Language Modeling

Transformers for language modeling usually rely on deterministic internal computation, with uncertainty expressed mainly at the output layer. We introduce variational neurons into Transformer feed-forward computation so that uncertainty…

Machine Learning · Computer Science 2026-03-31 Yves Ruffenach

VMI-VAE: Variational Mutual Information Maximization Framework for VAE With Discrete and Continuous Priors

Variational Autoencoder is a scalable method for learning latent variable models of complex data. It employs a clear objective that can be easily optimized. However, it does not explicitly measure the quality of learned representations. We…

Machine Learning · Computer Science 2020-05-29 Andriy Serdega , Dae-Shik Kim

A variance modeling framework based on variational autoencoders for speech enhancement

In this paper we address the problem of enhancing speech signals in noisy mixtures using a source separation approach. We explore the use of neural networks as an alternative to a popular speech variance model based on supervised…

Sound · Computer Science 2019-02-06 Simon Leglaive , Laurent Girin , Radu Horaud

Variational Recurrent Neural Machine Translation

Partially inspired by successful applications of variational recurrent neural networks, we propose a novel variational recurrent neural machine translation (VRNMT) model in this paper. Different from the variational NMT, VRNMT introduces a…

Computation and Language · Computer Science 2018-01-17 Jinsong Su , Shan Wu , Deyi Xiong , Yaojie Lu , Xianpei Han , Biao Zhang

Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion

Variational autoencoder-based voice conversion (VAE-VC) has the advantage of requiring only pairs of speeches and speaker labels for training. Unlike the majority of the research in VAE-VC which focuses on utilizing auxiliary losses or…

Sound · Computer Science 2021-12-07 Kei Akuzawa , Kotaro Onishi , Keisuke Takiguchi , Kohki Mametani , Koichiro Mori

Improve Variational Autoencoder for Text Generationwith Discrete Latent Bottleneck

Variational autoencoders (VAEs) are essential tools in end-to-end representation learning. However, the sequential text generation common pitfall with VAEs is that the model tends to ignore latent variables with a strong auto-regressive…

Machine Learning · Computer Science 2021-02-26 Yang Zhao , Ping Yu , Suchismit Mahapatra , Qinliang Su , Changyou Chen

An Extended Variational Mode Decomposition Algorithm Developed Speech Emotion Recognition Performance

Emotion recognition (ER) from speech signals is a robust approach since it cannot be imitated like facial expression or text based sentiment analysis. Valuable information underlying the emotions are significant for human-computer…

Sound · Computer Science 2023-12-19 David Hason Rudd , Huan Huo , Guandong Xu

Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder

Recently, a generative variational autoencoder (VAE) has been proposed for speech enhancement to model speech statistics. However, this approach only uses clean speech in the training phase, making the estimation particularly sensitive to…

Audio and Speech Processing · Electrical Eng. & Systems 2021-05-18 Huajian Fang , Guillaume Carbajal , Stefan Wermter , Timo Gerkmann