Related papers: Efficient Bidirectional Neural Machine Translation

Sequence Generation: From Both Sides to the Middle

The encoder-decoder framework has achieved promising process for many sequence generation tasks, such as neural machine translation and text summarization. Such a framework usually generates a sequence token by token from left to right,…

Computation and Language · Computer Science 2019-06-25 Long Zhou , Jiajun Zhang , Chengqing Zong , Heng Yu

Fast Interleaved Bidirectional Sequence Generation

Independence assumptions during sequence generation can speed up inference, but parallel generation of highly inter-dependent tokens comes at a cost in quality. Instead of assuming independence between neighbouring tokens…

Computation and Language · Computer Science 2020-10-28 Biao Zhang , Ivan Titov , Rico Sennrich

Synchronous Bidirectional Neural Machine Translation

Existing approaches to neural machine translation (NMT) generate the target language sequence token by token from left to right. However, this kind of unidirectional decoding framework cannot make full use of the target-side future contexts…

Computation and Language · Computer Science 2019-05-14 Long Zhou , Jiajun Zhang , Chengqing Zong

Asynchronous Bidirectional Decoding for Neural Machine Translation

The dominant neural machine translation (NMT) models apply unified attentional encoder-decoder neural networks for translation. Traditionally, the NMT decoders adopt recurrent neural networks (RNNs) to perform translation in a left-toright…

Computation and Language · Computer Science 2018-02-06 Xiangwen Zhang , Jinsong Su , Yue Qin , Yang Liu , Rongrong Ji , Hongji Wang

A Framework for Bidirectional Decoding: Case Study in Morphological Inflection

Transformer-based encoder-decoder models that generate outputs in a left-to-right fashion have become standard for sequence-to-sequence tasks. In this paper, we propose a framework for decoding that produces sequences from the "outside-in":…

Computation and Language · Computer Science 2023-10-31 Marc E. Canby , Julia Hockenmaier

Asynchronous and Segmented Bidirectional Encoding for NMT

With the rapid advancement of Neural Machine Translation (NMT), enhancing translation efficiency and quality has become a focal point of research. Despite the commendable performance of general models such as the Transformer in various…

Computation and Language · Computer Science 2024-02-26 Jingpu Yang , Zehua Han , Mengyu Xiang , Helin Wang , Yuxiao Huang , Miao Fang

Synchronous Bidirectional Inference for Neural Sequence Generation

In sequence to sequence generation tasks (e.g. machine translation and abstractive summarization), inference is generally performed in a left-to-right manner to produce the result token by token. The neural approaches, such as LSTM and…

Computation and Language · Computer Science 2019-02-26 Jiajun Zhang , Long Zhou , Yang Zhao , Chengqing Zong

Two-Way Neural Machine Translation: A Proof of Concept for Bidirectional Translation Modeling using a Two-Dimensional Grid

Neural translation models have proven to be effective in capturing sufficient information from a source sentence and generating a high-quality target sentence. However, it is not easy to get the best effect for bidirectional translation,…

Computation and Language · Computer Science 2020-11-25 Parnia Bahar , Christopher Brix , Hermann Ney

Neural Machine Transliteration: Preliminary Results

Machine transliteration is the process of automatically transforming the script of a word from a source language to a target language, while preserving pronunciation. Sequence to sequence learning has recently emerged as a new paradigm in…

Computation and Language · Computer Science 2016-09-15 Amir H. Jadidinejad

Attending to Future Tokens For Bidirectional Sequence Generation

Neural sequence generation is typically performed token-by-token and left-to-right. Whenever a token is generated only previously produced tokens are taken into consideration. In contrast, for problems such as sequence classification,…

Machine Learning · Statistics 2019-09-18 Carolin Lawrence , Bhushan Kotnis , Mathias Niepert

Neural Machine Translation by Jointly Learning to Align and Translate

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to…

Computation and Language · Computer Science 2016-05-23 Dzmitry Bahdanau , Kyunghyun Cho , Yoshua Bengio

Mask-Predict: Parallel Decoding of Conditional Masked Language Models

Most machine translation systems generate text autoregressively from left to right. We, instead, use a masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…

Computation and Language · Computer Science 2019-09-05 Marjan Ghazvininejad , Omer Levy , Yinhan Liu , Luke Zettlemoyer

Context- and Sequence-Aware Convolutional Recurrent Encoder for Neural Machine Translation

Neural Machine Translation model is a sequence-to-sequence converter based on neural networks. Existing models use recurrent neural networks to construct both the encoder and decoder modules. In alternative research, the recurrent networks…

Computation and Language · Computer Science 2021-05-04 Ritam Mallick , Seba Susan , Vaibhaw Agrawal , Rizul Garg , Prateek Rawal

Neural Machine Translation in Linear Time

We present a novel neural network for processing sequences. The ByteNet is a one-dimensional convolutional neural network that is composed of two parts, one to encode the source sequence and the other to decode the target sequence. The two…

Computation and Language · Computer Science 2017-03-17 Nal Kalchbrenner , Lasse Espeholt , Karen Simonyan , Aaron van den Oord , Alex Graves , Koray Kavukcuoglu

Source-side Prediction for Neural Headline Generation

The encoder-decoder model is widely used in natural language generation tasks. However, the model sometimes suffers from repeated redundant generation, misses important phrases, and includes irrelevant entities. Toward solving these…

Computation and Language · Computer Science 2017-12-25 Shun Kiyono , Sho Takase , Jun Suzuki , Naoaki Okazaki , Kentaro Inui , Masaaki Nagata

Sequence Modeling with Unconstrained Generation Order

The dominant approach to sequence generation is to produce a sequence in some predefined order, e.g. left to right. In contrast, we propose a more general model that can generate the output sequence by inserting tokens in any arbitrary…

Computation and Language · Computer Science 2019-11-04 Dmitrii Emelianenko , Elena Voita , Pavel Serdyukov

Bi-Decoder Augmented Network for Neural Machine Translation

Neural Machine Translation (NMT) has become a popular technology in recent years, and the encoder-decoder framework is the mainstream among all the methods. It's obvious that the quality of the semantic representations from encoding is very…

Computation and Language · Computer Science 2020-01-15 Boyuan Pan , Yazheng Yang , Zhou Zhao , Yueting Zhuang , Deng Cai

Deconvolution-Based Global Decoding for Neural Machine Translation

A great proportion of sequence-to-sequence (Seq2Seq) models for Neural Machine Translation (NMT) adopt Recurrent Neural Network (RNN) to generate translation word by word following a sequential order. As the studies of linguistics have…

Computation and Language · Computer Science 2018-06-14 Junyang Lin , Xu Sun , Xuancheng Ren , Shuming Ma , Jinsong Su , Qi Su

Transformer with Bidirectional Decoder for Speech Recognition

Attention-based models have made tremendous progress on end-to-end automatic speech recognition(ASR) recently. However, the conventional transformer-based approaches usually generate the sequence results token by token from left to right,…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-12 Xi Chen , Songyang Zhang , Dandan Song , Peng Ouyang , Shouyi Yin

Non-Autoregressive Machine Translation with Disentangled Context Transformer

State-of-the-art neural machine translation models generate a translation from left to right and every step is conditioned on the previously generated tokens. The sequential nature of this generation process causes fundamental latency in…

Computation and Language · Computer Science 2020-07-01 Jungo Kasai , James Cross , Marjan Ghazvininejad , Jiatao Gu