English
Related papers

Related papers: Centroid-Based Efficient Minimum Bayes Risk Decodi…

200 papers

This paper explores Minimum Bayes Risk (MBR) decoding for self-improvement in machine translation (MT), particularly for domain adaptation and low-resource languages. We implement the self-improvement process by fine-tuning the model on its…

Computation and Language · Computer Science 2024-05-21 Kamil Guttmann , Mikołaj Pokrywka , Adrian Charkiewicz , Artur Nowakowski

Minimum Bayes Risk (MBR) decoding is a powerful decoding strategy widely used for text generation tasks, but its quadratic computational complexity limits its practical application. This paper presents a novel approach for approximating MBR…

Computation and Language · Computer Science 2024-06-06 Firas Trabelsi , David Vilar , Mara Finkelstein , Markus Freitag

Minimum Bayes risk (MBR) decoding generates high-quality translations by maximizing the expected utility of output candidates, but it evaluates all pairwise scores over the candidate set; hence, it takes quadratic time with respect to the…

Computation and Language · Computer Science 2025-12-02 Koki Natsumi , Hiroyuki Deguchi , Yusuke Sakai , Hidetaka Kamigaito , Taro Watanabe

Minimum Bayes Risk (MBR) decoding is a text generation technique that has been shown to improve the quality of machine translations, but is expensive, even if a sampling-based approximation is used. Besides requiring a large number of…

Computation and Language · Computer Science 2024-06-04 Jannis Vamvas , Rico Sennrich

While Minimum Bayes Risk (MBR) decoding using metrics such as COMET or MetricX has outperformed traditional decoding methods such as greedy or beam search, it introduces a challenge we refer to as metric bias. As MBR decoding aims to…

Computation and Language · Computer Science 2024-11-07 Geza Kovacs , Daniel Deutsch , Markus Freitag

Minimum Bayes-Risk (MBR) decoding is shown to be a powerful alternative to beam search decoding for a wide range of text generation tasks. However, MBR requires a huge amount of time for inference to compute the MBR objective, which makes…

Artificial Intelligence · Computer Science 2024-06-13 Yuu Jinnai , Kaito Ariu

Maximum a posteriori decoding, a commonly used method for neural machine translation (NMT), aims to maximize the estimated posterior probability. However, high estimated probability does not always lead to high translation quality. Minimum…

Computation and Language · Computer Science 2025-05-27 Boxuan Lyu , Hidetaka Kamigaito , Kotaro Funakoshi , Manabu Okumura

Minimum Bayes risk (MBR) decoding outputs the hypothesis with the highest expected utility over the model distribution for some utility function. It has been shown to improve accuracy over beam search in conditional language generation…

Computation and Language · Computer Science 2023-11-28 Julius Cheng , Andreas Vlachos

Minimum Bayes risk (MBR) decoding is a decision rule of text generation, which selects the hypothesis that maximizes the expected utility and robustly generates higher-quality texts than maximum a posteriori (MAP) decoding. However, it…

Computation and Language · Computer Science 2025-09-17 Hiroyuki Deguchi , Masaaki Nagata

Minimum Bayes Risk (MBR) decoding optimizes output selection by maximizing the expected utility value of an underlying human distribution. While prior work has shown the effectiveness of MBR decoding through empirical evaluation, few…

Computation and Language · Computer Science 2025-06-23 Yuki Ichihara , Yuu Jinnai , Kaito Ariu , Tetsuro Morimura , Eiji Uchibe

Neural metrics have achieved impressive correlation with human judgements in the evaluation of machine translation systems, but before we can safely optimise towards such metrics, we should be aware of (and ideally eliminate) biases toward…

Computation and Language · Computer Science 2022-09-27 Chantal Amrhein , Rico Sennrich

In Neural Machine Translation, it is typically assumed that the sentence with the highest estimated probability should also be the translation with the highest quality as measured by humans. In this work, we question this assumption and…

Computation and Language · Computer Science 2022-04-27 Markus Freitag , David Grangier , Qijun Tan , Bowen Liang

Minimum Bayes risk (MBR) decoding is a decision rule of text generation tasks that outperforms conventional maximum a posterior (MAP) decoding using beam search by selecting high-quality outputs based on a utility function rather than those…

Computation and Language · Computer Science 2024-10-22 Hiroyuki Deguchi , Yusuke Sakai , Hidetaka Kamigaito , Taro Watanabe

Neural Machine Translation (NMT) currently exhibits biases such as producing translations that are too short and overgenerating frequent words, and shows poor robustness to copy noise in training data or domain shift. Recent work has tied…

Computation and Language · Computer Science 2021-05-19 Mathias Müller , Rico Sennrich

For extended periods of time, sequence generation models rely on beam search algorithm to generate output sequence. However, the correctness of beam search degrades when the a model is over-confident about a suboptimal prediction. In this…

Computation and Language · Computer Science 2017-06-09 Raphael Shu , Hideki Nakayama

Minimum Bayes Risk (MBR) decoding has been shown to be a powerful alternative to beam search decoding in a variety of text generation tasks. MBR decoding selects a hypothesis from a pool of hypotheses that has the least expected risk under…

Artificial Intelligence · Computer Science 2024-06-13 Yuu Jinnai , Tetsuro Morimura , Ukyo Honda , Kaito Ariu , Kenshi Abe

Error Span Detection (ESD) extends automatic machine translation (MT) evaluation by localizing translation errors and labeling their severity. Current generative ESD methods typically use Maximum a Posteriori (MAP) decoding, assuming that…

Computation and Language · Computer Science 2026-01-01 Boxuan Lyu , Haiyue Song , Hidetaka Kamigaito , Chenchen Ding , Hideki Tanaka , Masao Utiyama , Kotaro Funakoshi , Manabu Okumura

In NMT we search for the mode of the model distribution to form predictions. The mode and other high-probability translations found by beam search have been shown to often be inadequate in a number of ways. This prevents improving…

Computation and Language · Computer Science 2022-10-26 Bryan Eikema , Wilker Aziz

Recent advances in machine translation (MT) have shown that Minimum Bayes Risk (MBR) decoding can be a powerful alternative to beam search decoding, especially when combined with neural-based utility functions. However, the performance of…

Computation and Language · Computer Science 2023-05-19 Markus Freitag , Behrooz Ghorbani , Patrick Fernandes

Beam search is the most widely used decoding method for neural machine translation (NMT). In practice, the top-1 candidate with the highest log-probability among the n candidates is selected as the preferred one. However, this top-1…

Computation and Language · Computer Science 2022-03-02 Yidan Zhang , Yu Wan , Dayiheng Liu , Baosong Yang , Zhenan He
‹ Prev 1 2 3 10 Next ›