Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Kyunghyun Cho; Bart van Merrienboer; Caglar Gulcehre; Dzmitry Bahdanau; Fethi Bougares; Holger Schwenk; Yoshua Bengio

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Computation and Language 2014-09-04 v3 Machine Learning Neural and Evolutionary Computing Machine Learning

Authors: Kyunghyun Cho , Bart van Merrienboer , Caglar Gulcehre , Dzmitry Bahdanau , Fethi Bougares , Holger Schwenk , Yoshua Bengio

View on arXiv ↗ PDF ↗

Abstract

In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

Keywords

natural language parsing encoder-decoder architecture recurrent neural network

Cite

@article{arxiv.1406.1078,
  title  = {Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation},
  author = {Kyunghyun Cho and Bart van Merrienboer and Caglar Gulcehre and Dzmitry Bahdanau and Fethi Bougares and Holger Schwenk and Yoshua Bengio},
  journal= {arXiv preprint arXiv:1406.1078},
  year   = {2014}
}

Comments

EMNLP 2014

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Abstract

Keywords

Cite

Comments

Related papers