Related papers: Single-Queue Decoding for Neural Machine Translati…

Beam Search Strategies for Neural Machine Translation

The basic concept in Neural Machine Translation (NMT) is to train a large Neural Network that maximizes the translation performance on a given parallel corpus. NMT is then using a simple left-to-right beam-search decoder to generate new…

Computation and Language · Computer Science 2018-12-19 Markus Freitag , Yaser Al-Onaizan

Learning to Stop in Structured Prediction for Neural Machine Translation

Beam search optimization resolves many issues in neural machine translation. However, this method lacks principled stopping criteria and does not learn how to stop during training, and the model naturally prefers the longer hypotheses…

Computation and Language · Computer Science 2019-06-26 Mingbo Ma , Renjie Zheng , Liang Huang

Quality-Aware Decoding: Unifying Quality Estimation and Decoding

Quality Estimation (QE) models for Neural Machine Translation (NMT) predict the quality of the hypothesis without having access to the reference. An emerging research direction in NMT involves the use of QE models, which have demonstrated…

Computation and Language · Computer Science 2025-06-03 Sai Koneru , Matthias Huck , Miriam Exel , Jan Niehues

High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics

In Neural Machine Translation, it is typically assumed that the sentence with the highest estimated probability should also be the translation with the highest quality as measured by humans. In this work, we question this assumption and…

Computation and Language · Computer Science 2022-04-27 Markus Freitag , David Grangier , Qijun Tan , Bowen Liang

Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and Stopping Criteria for Neural Machine Translation

Beam search is widely used in neural machine translation, and usually improves translation quality compared to greedy search. It has been widely observed that, however, beam sizes larger than 5 hurt translation quality. We explain why this…

Computation and Language · Computer Science 2018-10-30 Yilin Yang , Liang Huang , Mingbo Ma

Exploring Recombination for Efficient Decoding of Neural Machine Translation

In Neural Machine Translation (NMT), the decoder can capture the features of the entire prediction history with neural connections and representations. This means that partial hypotheses with different prefixes will be regarded differently…

Computation and Language · Computer Science 2018-10-16 Zhisong Zhang , Rui Wang , Masao Utiyama , Eiichiro Sumita , Hai Zhao

Speeding Up Neural Machine Translation Decoding by Cube Pruning

Although neural machine translation has achieved promising results, it suffers from slow translation speed. The direct consequence is that a trade-off has to be made between translation quality and speed, thus its performance can not come…

Computation and Language · Computer Science 2018-09-11 Wen Zhang , Liang Huang , Yang Feng , Lei Shen , Qun Liu

Correcting Length Bias in Neural Machine Translation

We study two problems in neural machine translation (NMT). First, in beam search, whereas a wider beam should in principle help translation, it often hurts NMT. Second, NMT has a tendency to produce translations that are too short. Here, we…

Computation and Language · Computer Science 2018-09-05 Kenton Murray , David Chiang

Digging Errors in NMT: Evaluating and Understanding Model Errors from Partial Hypothesis Space

Solid evaluation of neural machine translation (NMT) is key to its understanding and improvement. Current evaluation of an NMT system is usually built upon a heuristic decoding algorithm (e.g., beam search) and an evaluation metric…

Computation and Language · Computer Science 2022-10-11 Jianhao Yan , Chenming Wu , Fandong Meng , Jie Zhou

Neural Machine Translation via Binary Code Prediction

In this paper, we propose a new method for calculating the output layer in neural machine translation systems. The method is based on predicting a binary code for each word and can reduce computation time/memory requirements of the output…

Computation and Language · Computer Science 2017-04-25 Yusuke Oda , Philip Arthur , Graham Neubig , Koichiro Yoshino , Satoshi Nakamura

A Simple, Fast Diverse Decoding Algorithm for Neural Generation

In this paper, we propose a simple, fast decoding algorithm that fosters diversity in neural generation. The algorithm modifies the standard beam search algorithm by adding an inter-sibling ranking penalty, favoring choosing hypotheses from…

Computation and Language · Computer Science 2016-12-23 Jiwei Li , Will Monroe , Dan Jurafsky

Sampling-Based Approximations to Minimum Bayes Risk Decoding for Neural Machine Translation

In NMT we search for the mode of the model distribution to form predictions. The mode and other high-probability translations found by beam search have been shown to often be inadequate in a number of ways. This prevents improving…

Computation and Language · Computer Science 2022-10-26 Bryan Eikema , Wilker Aziz

When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size)

In neural text generation such as neural machine translation, summarization, and image captioning, beam search is widely used to improve the output text quality. However, in the neural generation setting, hypotheses can finish in different…

Computation and Language · Computer Science 2018-09-05 Liang Huang , Kai Zhao , Mingbo Ma

Vocabulary Selection Strategies for Neural Machine Translation

Classical translation models constrain the space of possible outputs by selecting a subset of translation rules based on the input sentence. Recent work on improving the efficiency of neural translation models adopted a similar strategy by…

Computation and Language · Computer Science 2016-10-04 Gurvan L'Hostis , David Grangier , Michael Auli

A Call for Clarity in Beam Search: How It Works and When It Stops

Text generation with beam search has proven successful in a wide range of applications. We point out that, though largely overlooked in the literature, the commonly-used implementation of beam decoding (e.g., Hugging Face Transformers and…

Computation and Language · Computer Science 2024-02-29 Jungo Kasai , Keisuke Sakaguchi , Ronan Le Bras , Dragomir Radev , Yejin Choi , Noah A. Smith

Analyzing Uncertainty in Neural Machine Translation

Machine translation is a popular test bed for research in neural sequence-to-sequence models but despite much recent research, there is still a lack of understanding of these models. Practitioners report performance degradation with large…

Computation and Language · Computer Science 2018-08-14 Myle Ott , Michael Auli , David Grangier , Marc'Aurelio Ranzato

Identifying Weaknesses in Machine Translation Metrics Through Minimum Bayes Risk Decoding: A Case Study for COMET

Neural metrics have achieved impressive correlation with human judgements in the evaluation of machine translation systems, but before we can safely optimise towards such metrics, we should be aware of (and ideally eliminate) biases toward…

Computation and Language · Computer Science 2022-09-27 Chantal Amrhein , Rico Sennrich

Neural Machine Translation by Jointly Learning to Align and Translate

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to…

Computation and Language · Computer Science 2016-05-23 Dzmitry Bahdanau , Kyunghyun Cho , Yoshua Bengio

Quality-Aware Decoding for Neural Machine Translation

Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to…

Computation and Language · Computer Science 2022-05-03 Patrick Fernandes , António Farinhas , Ricardo Rei , José G. C. de Souza , Perez Ogayo , Graham Neubig , André F. T. Martins

Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective

Neural Machine Translation (NMT) has made remarkable progress over the past years. However, under-translation and over-translation remain two challenging problems in state-of-the-art NMT systems. In this work, we conduct an in-depth…

Computation and Language · Computer Science 2024-05-30 Chenze Shao , Fandong Meng , Jiali Zeng , Jie Zhou