Related papers: Asynchronous Bidirectional Decoding for Neural Mac…

Synchronous Bidirectional Neural Machine Translation

Existing approaches to neural machine translation (NMT) generate the target language sequence token by token from left to right. However, this kind of unidirectional decoding framework cannot make full use of the target-side future contexts…

Computation and Language · Computer Science 2019-05-14 Long Zhou , Jiajun Zhang , Chengqing Zong

Asynchronous and Segmented Bidirectional Encoding for NMT

With the rapid advancement of Neural Machine Translation (NMT), enhancing translation efficiency and quality has become a focal point of research. Despite the commendable performance of general models such as the Transformer in various…

Computation and Language · Computer Science 2024-02-26 Jingpu Yang , Zehua Han , Mengyu Xiang , Helin Wang , Yuxiao Huang , Miao Fang

Bi-Decoder Augmented Network for Neural Machine Translation

Neural Machine Translation (NMT) has become a popular technology in recent years, and the encoder-decoder framework is the mainstream among all the methods. It's obvious that the quality of the semantic representations from encoding is very…

Computation and Language · Computer Science 2020-01-15 Boyuan Pan , Yazheng Yang , Zhou Zhao , Yueting Zhuang , Deng Cai

Enhanced Neural Machine Translation by Learning from Draft

Neural machine translation (NMT) has recently achieved impressive results. A potential problem of the existing NMT algorithm, however, is that the decoding is conducted from left to right, without considering the right context. This paper…

Computation and Language · Computer Science 2017-10-06 Aodong Li , Shiyue Zhang , Dong Wang , Thomas Fang Zheng

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Most neural machine translation (NMT) models are based on the sequential encoder-decoder framework, which makes no use of syntactic information. In this paper, we improve this model by explicitly incorporating source-side syntactic trees.…

Computation and Language · Computer Science 2017-07-19 Huadong Chen , Shujian Huang , David Chiang , Jiajun Chen

Look Backward and Forward: Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation

Neural Machine Translation(NMT) models are usually trained via unidirectional decoder which corresponds to optimizing one-step-ahead prediction. However, this kind of unidirectional decoding framework may incline to focus on local structure…

Computation and Language · Computer Science 2022-03-14 Xuanwei Zhang , Libin Shen , Disheng Pan , Liang Wang , Yanjun Miao

Improving Context-aware Neural Machine Translation with Target-side Context

In recent years, several studies on neural machine translation (NMT) have attempted to use document-level context by using a multi-encoder and two attention mechanisms to read the current and previous sentences to incorporate the context of…

Computation and Language · Computer Science 2019-09-04 Hayahide Yamagishi , Mamoru Komachi

Multi-channel Encoder for Neural Machine Translation

Attention-based Encoder-Decoder has the effective architecture for neural machine translation (NMT), which typically relies on recurrent neural networks (RNN) to build the blocks that will be lately called by attentive reader during the…

Computation and Language · Computer Science 2017-12-07 Hao Xiong , Zhongjun He , Xiaoguang Hu , Hua Wu

An empirical study on the effectiveness of images in Multimodal Neural Machine Translation

In state-of-the-art Neural Machine Translation (NMT), an attention mechanism is used during decoding to enhance the translation. At every step, the decoder uses this mechanism to focus on different parts of the source sentence to gather the…

Computation and Language · Computer Science 2018-05-31 Jean-Benoit Delbrouck , Stéphane Dupont

Bi-Directional Neural Machine Translation with Synthetic Parallel Data

Despite impressive progress in high-resource settings, Neural Machine Translation (NMT) still struggles in low-resource and out-of-domain scenarios, often failing to match the quality of phrase-based translation. We propose a novel…

Computation and Language · Computer Science 2018-05-31 Xing Niu , Michael Denkowski , Marine Carpuat

Temporal Attention Model for Neural Machine Translation

Attention-based Neural Machine Translation (NMT) models suffer from attention deficiency issues as has been observed in recent research. We propose a novel mechanism to address some of these limitations and improve the NMT attention.…

Computation and Language · Computer Science 2016-08-10 Baskaran Sankaran , Haitao Mi , Yaser Al-Onaizan , Abe Ittycheriah

Self-Attentive Residual Decoder for Neural Machine Translation

Neural sequence-to-sequence networks with attention have achieved remarkable performance for machine translation. One of the reasons for their effectiveness is their ability to capture relevant source-side contextual information at each…

Computation and Language · Computer Science 2018-10-02 Lesly Miculicich Werlen , Nikolaos Pappas , Dhananjay Ram , Andrei Popescu-Belis

English-Japanese Neural Machine Translation with Encoder-Decoder-Reconstructor

Neural machine translation (NMT) has recently become popular in the field of machine translation. However, NMT suffers from the problem of repeating or missing words in the translation. To address this problem, Tu et al. (2017) proposed an…

Computation and Language · Computer Science 2017-06-27 Yukio Matsumura , Takayuki Sato , Mamoru Komachi

Deconvolution-Based Global Decoding for Neural Machine Translation

A great proportion of sequence-to-sequence (Seq2Seq) models for Neural Machine Translation (NMT) adopt Recurrent Neural Network (RNN) to generate translation word by word following a sequential order. As the studies of linguistics have…

Computation and Language · Computer Science 2018-06-14 Junyang Lin , Xu Sun , Xuancheng Ren , Shuming Ma , Jinsong Su , Qi Su

Dense Information Flow for Neural Machine Translation

Recently, neural machine translation has achieved remarkable progress by introducing well-designed deep neural networks into its encoder-decoder framework. From the optimization perspective, residual connections are adopted to improve…

Computation and Language · Computer Science 2018-07-03 Yanyao Shen , Xu Tan , Di He , Tao Qin , Tie-Yan Liu

Universal Vector Neural Machine Translation With Effective Attention

Neural Machine Translation (NMT) leverages one or more trained neural networks for the translation of phrases. Sutskever introduced a sequence to sequence based encoder-decoder model which became the standard for NMT based systems.…

Computation and Language · Computer Science 2020-06-11 Satish Mylapore , Ryan Quincy Paul , Joshua Yi , Robert D. Slater

Incorporating Global Visual Features into Attention-Based Neural Machine Translation

We introduce multi-modal, attention-based neural machine translation (NMT) models which incorporate visual features into different parts of both the encoder and the decoder. We utilise global image features extracted using a pre-trained…

Computation and Language · Computer Science 2017-01-24 Iacer Calixto , Qun Liu , Nick Campbell

Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder

In this paper, we present our first attempts in building a multilingual Neural Machine Translation framework under a unified approach. We are then able to employ attention-based NMT for many-to-many multilingual translation tasks. Our…

Computation and Language · Computer Science 2016-11-16 Thanh-Le Ha , Jan Niehues , Alexander Waibel

Training Deeper Neural Machine Translation Models with Transparent Attention

While current state-of-the-art NMT models, such as RNN seq2seq and Transformers, possess a large number of parameters, they are still shallow in comparison to convolutional models used for both text and vision applications. In this work we…

Computation and Language · Computer Science 2018-09-06 Ankur Bapna , Mia Xu Chen , Orhan Firat , Yuan Cao , Yonghui Wu

Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input

Non-autoregressive translation (NAT) models, which remove the dependence on previous target tokens from the inputs of the decoder, achieve significantly inference speedup but at the cost of inferior accuracy compared to autoregressive…

Computation and Language · Computer Science 2018-12-27 Junliang Guo , Xu Tan , Di He , Tao Qin , Linli Xu , Tie-Yan Liu