Related papers: Rethinking and Improving Natural Language Generati…

Layer-Wise Multi-View Learning for Neural Machine Translation

Traditional neural machine translation is limited to the topmost encoder layer's context representation and cannot directly perceive the lower encoder layers. Existing solutions usually rely on the adjustment of network architecture, making…

Computation and Language · Computer Science 2020-11-04 Qiang Wang , Changliang Li , Yue Zhang , Tong Xiao , Jingbo Zhu

Self-Enhancing Multi-filter Sequence-to-Sequence Model

Representation learning is important for solving sequence-to-sequence problems in natural language processing. Representation learning transforms raw data into vector-form representations while preserving their features. However, data with…

Computation and Language · Computer Science 2023-01-12 Yunhao Yang , Zhaokun Xue , Andrew Whinston

Investigating Linguistic Pattern Ordering in Hierarchical Natural Language Generation

Natural language generation (NLG) is a critical component in spoken dialogue system, which can be divided into two phases: (1) sentence planning: deciding the overall sentence structure, (2) surface realization: determining specific word…

Computation and Language · Computer Science 2018-09-21 Shang-Yu Su , Yun-Nung Chen

Middle-Out Decoding

Despite being virtually ubiquitous, sequence-to-sequence models are challenged by their lack of diversity and inability to be externally controlled. In this paper, we speculate that a fundamental shortcoming of sequence generation models is…

Computation and Language · Computer Science 2018-10-30 Shikib Mehri , Leonid Sigal

Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

The sequence-to-sequence (seq2seq) task aims at generating the target sequence based on the given input source sequence. Traditionally, most of the seq2seq task is resolved by the Encoder-Decoder framework which requires an encoder to…

Computation and Language · Computer Science 2023-04-11 Zihao Fu , Wai Lam , Qian Yu , Anthony Man-Cho So , Shengding Hu , Zhiyuan Liu , Nigel Collier

Learning Generic Sentence Representations Using Convolutional Neural Networks

We propose a new encoder-decoder approach to learn distributed sentence representations that are applicable to multiple purposes. The model is learned by using a convolutional neural network as an encoder to map an input sentence into a…

Computation and Language · Computer Science 2017-07-28 Zhe Gan , Yunchen Pu , Ricardo Henao , Chunyuan Li , Xiaodong He , Lawrence Carin

Hierarchical Memory Decoding for Video Captioning

Recent advances of video captioning often employ a recurrent neural network (RNN) as the decoder. However, RNN is prone to diluting long-term information. Recent works have demonstrated memory network (MemNet) has the advantage of storing…

Computer Vision and Pattern Recognition · Computer Science 2020-02-28 Aming Wu , Yahong Han

Context-aware Natural Language Generation with Recurrent Neural Networks

This paper studied generating natural languages at particular contexts or situations. We proposed two novel approaches which encode the contexts into a continuous semantic representation and then decode the semantic representation into text…

Computation and Language · Computer Science 2016-12-01 Jian Tang , Yifan Yang , Sam Carton , Ming Zhang , Qiaozhu Mei

Cross-Lingual Natural Language Generation via Pre-Training

In this work we focus on transferring supervision signals of natural language generation (NLG) tasks between multiple languages. We propose to pretrain the encoder and the decoder of a sequence-to-sequence model under both monolingual and…

Computation and Language · Computer Science 2019-11-25 Zewen Chi , Li Dong , Furu Wei , Wenhui Wang , Xian-Ling Mao , Heyan Huang

Deconvolutional Paragraph Representation Learning

Learning latent representations from long text sequences is an important first step in many natural language processing applications. Recurrent Neural Networks (RNNs) have become a cornerstone for this challenging task. However, the quality…

Computation and Language · Computer Science 2017-09-25 Yizhe Zhang , Dinghan Shen , Guoyin Wang , Zhe Gan , Ricardo Henao , Lawrence Carin

Universal Multimodal Representation for Language Understanding

Representation learning is the foundation of natural language processing (NLP). This work presents new methods to employ visual information as assistant signals to general NLP tasks. For each sentence, we first retrieve a flexible number of…

Computation and Language · Computer Science 2023-01-10 Zhuosheng Zhang , Kehai Chen , Rui Wang , Masao Utiyama , Eiichiro Sumita , Zuchao Li , Hai Zhao

Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network

Despite having impressive vision-language (VL) pretraining with BERT-based encoder for VL understanding, the pretraining of a universal encoder-decoder for both VL understanding and generation remains challenging. The difficulty originates…

Computer Vision and Pattern Recognition · Computer Science 2021-01-28 Yehao Li , Yingwei Pan , Ting Yao , Jingwen Chen , Tao Mei

Neural Text Generation: A Practical Guide

Deep learning methods have recently achieved great empirical success on machine translation, dialogue response generation, summarization, and other text generation tasks. At a high level, the technique has been to train end-to-end neural…

Computation and Language · Computer Science 2017-11-28 Ziang Xie

Natural Language Generation by Hierarchical Decoding with Linguistic Patterns

Natural language generation (NLG) is a critical component in spoken dialogue systems. Classic NLG can be divided into two phases: (1) sentence planning: deciding on the overall sentence structure, (2) surface realization: determining…

Computation and Language · Computer Science 2018-08-10 Shang-Yu Su , Kai-Ling Lo , Yi-Ting Yeh , Yun-Nung Chen

Pixel-Aligned Multi-View Generation with Depth Guided Decoder

The task of image-to-multi-view generation refers to generating novel views of an instance from a single image. Recent methods achieve this by extending text-to-image latent diffusion models to multi-view version, which contains an VAE…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Zhenggang Tang , Peiye Zhuang , Chaoyang Wang , Aliaksandr Siarohin , Yash Kant , Alexander Schwing , Sergey Tulyakov , Hsin-Ying Lee

Making Language Model a Hierarchical Classifier

Decoder-only language models, such as GPT and LLaMA, generally decode on the last layer. Motivated by human's hierarchical thinking capability, we propose that a hierarchical decoder architecture could be built with different layers…

Computation and Language · Computer Science 2025-09-30 Yihong Wang , Zhonglin Jiang , Ningyuan Xi , Yue Zhao , Qingqing Gu , Xiyuan Chen , Hao Wu , Sheng Xu , Hange Zhou , Yong Chen , Luo Ji

End-to-End Video Captioning

Building correspondences across different modalities, such as video and language, has recently become critical in many visual recognition applications, such as video captioning. Inspired by machine translation, recent models tackle this…

Computer Vision and Pattern Recognition · Computer Science 2019-11-11 Silvio Olivastri , Gurkirt Singh , Fabio Cuzzolin

Learning to Compose Representations of Different Encoder Layers towards Improving Compositional Generalization

Recent studies have shown that sequence-to-sequence (seq2seq) models struggle with compositional generalization (CG), i.e., the ability to systematically generalize to unseen compositions of seen components. There is mounting evidence that…

Computation and Language · Computer Science 2023-10-19 Lei Lin , Shuangtao Li , Yafang Zheng , Biao Fu , Shan Liu , Yidong Chen , Xiaodong Shi

Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Current state-of-the-art machine translation systems are based on encoder-decoder architectures, that first encode the input sequence, and then generate an output sequence based on the input encoding. Both are interfaced with an attention…

Computation and Language · Computer Science 2018-11-02 Maha Elbayad , Laurent Besacier , Jakob Verbeek

Exploiting Deep Representations for Neural Machine Translation

Advanced neural machine translation (NMT) models generally implement encoder and decoder as multiple layers, which allows systems to model complex functions and capture complicated linguistic structures. However, only the top layers of…

Computation and Language · Computer Science 2018-10-25 Zi-Yi Dou , Zhaopeng Tu , Xing Wang , Shuming Shi , Tong Zhang