Related papers: Neural Machine Translation via Binary Code Predict…

Neural Machine Translation by Jointly Learning to Align and Translate

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to…

Computation and Language · Computer Science 2016-05-23 Dzmitry Bahdanau , Kyunghyun Cho , Yoshua Bengio

Vocabulary Manipulation for Neural Machine Translation

In order to capture rich language phenomena, neural machine translation models have to use a large vocabulary size, which requires high computing time and large memory usage. In this paper, we alleviate this issue by introducing a…

Computation and Language · Computer Science 2016-08-02 Haitao Mi , Zhiguo Wang , Abe Ittycheriah

Improving Lexical Choice in Neural Machine Translation

We explore two solutions to the problem of mistranslating rare words in neural machine translation. First, we argue that the standard output layer, which computes the inner product of a vector representing the context with all possible…

Computation and Language · Computer Science 2018-04-19 Toan Q. Nguyen , David Chiang

Vocabulary Selection Strategies for Neural Machine Translation

Classical translation models constrain the space of possible outputs by selecting a subset of translation rules based on the input sentence. Recent work on improving the efficiency of neural translation models adopted a similar strategy by…

Computation and Language · Computer Science 2016-10-04 Gurvan L'Hostis , David Grangier , Michael Auli

Neural System Combination for Machine Translation

Neural machine translation (NMT) becomes a new approach to machine translation and generates much more fluent results compared to statistical machine translation (SMT). However, SMT is usually better than NMT in translation adequacy. It is…

Computation and Language · Computer Science 2017-04-24 Long Zhou , Wenpeng Hu , Jiajun Zhang , Chengqing Zong

Training Neural Machine Translation using Word Embedding-based Loss

In neural machine translation (NMT), the computational cost at the output layer increases with the size of the target-side vocabulary. Using a limited-size vocabulary instead may cause a significant decrease in translation quality. This…

Computation and Language · Computer Science 2018-07-31 Katsuki Chousa , Katsuhito Sudoh , Satoshi Nakamura

Multi-Layer Softmaxing during Training Neural Machine Translation for Flexible Decoding with Fewer Layers

This paper proposes a novel procedure for training an encoder-decoder based deep neural network which compresses NxM models into a single model enabling us to dynamically choose the number of encoder and decoder layers for decoding.…

Computation and Language · Computer Science 2019-08-29 Raj Dabre , Atsushi Fujita

Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU

Attentional sequence-to-sequence models have become the new standard for machine translation, but one challenge of such models is a significant increase in training and decoding cost compared to phrase-based systems. Here, we focus on…

Computation and Language · Computer Science 2017-05-08 Jacob Devlin

Neural Machine Translation with Word Predictions

In the encoder-decoder architecture for neural machine translation (NMT), the hidden states of the recurrent structures in the encoder and decoder carry the crucial information about the sentence.These vectors are generated by parameters…

Computation and Language · Computer Science 2017-08-08 Rongxiang Weng , Shujian Huang , Zaixiang Zheng , Xinyu Dai , Jiajun Chen

Speeding Up Neural Machine Translation Decoding by Cube Pruning

Although neural machine translation has achieved promising results, it suffers from slow translation speed. The direct consequence is that a trade-off has to be made between translation quality and speed, thus its performance can not come…

Computation and Language · Computer Science 2018-09-11 Wen Zhang , Liang Huang , Yang Feng , Lei Shen , Qun Liu

On Using Very Large Target Vocabulary for Neural Machine Translation

Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite…

Computation and Language · Computer Science 2015-03-19 Sébastien Jean , Kyunghyun Cho , Roland Memisevic , Yoshua Bengio

Incorporating Discrete Translation Lexicons into Neural Machine Translation

Neural machine translation (NMT) often makes mistakes in translating low-frequency content words that are essential to understanding the meaning of the sentence. We propose a method to alleviate this problem by augmenting NMT systems with…

Computation and Language · Computer Science 2016-10-06 Philip Arthur , Graham Neubig , Satoshi Nakamura

Towards Interlingua Neural Machine Translation

Common intermediate language representation in neural machine translation can be used to extend bilingual to multilingual systems by incremental training. In this paper, we propose a new architecture based on introducing an interlingual…

Computation and Language · Computer Science 2019-12-10 Carlos Escolano , Marta R. Costa-jussà , José A. R. Fonollosa

Beyond BLEU: Training Neural Machine Translation with Semantic Similarity

While most neural machine translation (NMT) systems are still trained using maximum likelihood estimation, recent work has demonstrated that optimizing systems to directly improve evaluation metrics such as BLEU can substantially improve…

Computation and Language · Computer Science 2019-09-17 John Wieting , Taylor Berg-Kirkpatrick , Kevin Gimpel , Graham Neubig

A Convolutional Encoder Model for Neural Machine Translation

The prevalent approach to neural machine translation relies on bi-directional LSTMs to encode the source sentence. In this paper we present a faster and simpler architecture based on a succession of convolutional layers. This allows to…

Computation and Language · Computer Science 2017-07-26 Jonas Gehring , Michael Auli , David Grangier , Yann N. Dauphin

On the Importance of Word Boundaries in Character-level Neural Machine Translation

Neural Machine Translation (NMT) models generally perform translation using a fixed-size lexical vocabulary, which is an important bottleneck on their generalization capability and overall translation quality. The standard approach to…

Computation and Language · Computer Science 2019-10-22 Duygu Ataman , Orhan Firat , Mattia A. Di Gangi , Marcello Federico , Alexandra Birch

Bilingual Dictionary-based Language Model Pretraining for Neural Machine Translation

Recent studies have demonstrated a perceivable improvement on the performance of neural machine translation by applying cross-lingual language model pretraining (Lample and Conneau, 2019), especially the Translation Language Modeling (TLM).…

Computation and Language · Computer Science 2021-03-15 Yusen Lin , Jiayong Lin , Shuaicheng Zhang , Haoying Dai

Iterative Refinement for Machine Translation

Existing machine translation decoding algorithms generate translations in a strictly monotonic fashion and never revisit previous decisions. As a result, earlier mistakes cannot be corrected at a later stage. In this paper, we present a…

Computation and Language · Computer Science 2018-04-17 Roman Novak , Michael Auli , David Grangier

Improving Neural Machine Translation through Phrase-based Forced Decoding

Compared to traditional statistical machine translation (SMT), neural machine translation (NMT) often sacrifices adequacy for the sake of fluency. We propose a method to combine the advantages of traditional SMT and NMT by exploiting an…

Computation and Language · Computer Science 2017-11-02 Jingyi Zhang , Masao Utiyama , Eiichro Sumita , Graham Neubig , Satoshi Nakamura

Neural Machine Translation for Code Generation

Neural machine translation (NMT) methods developed for natural language processing have been shown to be highly successful in automating translation from one natural language to another. Recently, these NMT methods have been adapted to the…

Computation and Language · Computer Science 2023-05-24 Dharma KC , Clayton T. Morrison