Related papers: Model-Based Minimum Bayes Risk Decoding for Text G…

mbrs: A Library for Minimum Bayes Risk Decoding

Minimum Bayes risk (MBR) decoding is a decision rule of text generation tasks that outperforms conventional maximum a posterior (MAP) decoding using beam search by selecting high-quality outputs based on a utility function rather than those…

Computation and Language · Computer Science 2024-10-22 Hiroyuki Deguchi , Yusuke Sakai , Hidetaka Kamigaito , Taro Watanabe

Faster Minimum Bayes Risk Decoding with Confidence-based Pruning

Minimum Bayes risk (MBR) decoding outputs the hypothesis with the highest expected utility over the model distribution for some utility function. It has been shown to improve accuracy over beam search in conditional language generation…

Computation and Language · Computer Science 2023-11-28 Julius Cheng , Andreas Vlachos

Linear-time Minimum Bayes Risk Decoding with Reference Aggregation

Minimum Bayes Risk (MBR) decoding is a text generation technique that has been shown to improve the quality of machine translations, but is expensive, even if a sampling-based approximation is used. Besides requiring a large number of…

Computation and Language · Computer Science 2024-06-04 Jannis Vamvas , Rico Sennrich

Uncertainty-Aware Decoding with Minimum Bayes Risk

Despite their outstanding performance in the majority of scenarios, contemporary language models still occasionally generate undesirable outputs, for example, hallucinated text. While such behaviors have previously been linked to…

Computation and Language · Computer Science 2025-03-10 Nico Daheim , Clara Meister , Thomas Möllenhoff , Iryna Gurevych

Theoretical Guarantees for Minimum Bayes Risk Decoding

Minimum Bayes Risk (MBR) decoding optimizes output selection by maximizing the expected utility value of an underlying human distribution. While prior work has shown the effectiveness of MBR decoding through empirical evaluation, few…

Computation and Language · Computer Science 2025-06-23 Yuki Ichihara , Yuu Jinnai , Kaito Ariu , Tetsuro Morimura , Eiji Uchibe

It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk

Minimum Bayes Risk (MBR) decoding is a method for choosing the outputs of a machine learning system based not on the output with the highest probability, but the output with the lowest risk (expected error) among multiple candidates. It is…

Computation and Language · Computer Science 2023-10-03 Amanda Bertsch , Alex Xie , Graham Neubig , Matthew R. Gormley

Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation

Maximum a posteriori decoding, a commonly used method for neural machine translation (NMT), aims to maximize the estimated posterior probability. However, high estimated probability does not always lead to high translation quality. Minimum…

Computation and Language · Computer Science 2025-05-27 Boxuan Lyu , Hidetaka Kamigaito , Kotaro Funakoshi , Manabu Okumura

Case-Based Decision-Theoretic Decoding with Quality Memories

Minimum Bayes risk (MBR) decoding is a decision rule of text generation, which selects the hypothesis that maximizes the expected utility and robustly generates higher-quality texts than maximum a posteriori (MAP) decoding. However, it…

Computation and Language · Computer Science 2025-09-17 Hiroyuki Deguchi , Masaaki Nagata

Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Minimum Bayes Risk (MBR) decoding is a powerful decoding strategy widely used for text generation tasks, but its quadratic computational complexity limits its practical application. This paper presents a novel approach for approximating MBR…

Computation and Language · Computer Science 2024-06-06 Firas Trabelsi , David Vilar , Mara Finkelstein , Markus Freitag

Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine Translation

Recent advances in machine translation (MT) have shown that Minimum Bayes Risk (MBR) decoding can be a powerful alternative to beam search decoding, especially when combined with neural-based utility functions. However, the performance of…

Computation and Language · Computer Science 2023-05-19 Markus Freitag , Behrooz Ghorbani , Patrick Fernandes

High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics

In Neural Machine Translation, it is typically assumed that the sentence with the highest estimated probability should also be the translation with the highest quality as measured by humans. In this work, we question this assumption and…

Computation and Language · Computer Science 2022-04-27 Markus Freitag , David Grangier , Qijun Tan , Bowen Liang

Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding

One of the most important challenges in text generation systems is to produce outputs that are not only correct but also diverse. Recently, Minimum Bayes-Risk (MBR) decoding has gained prominence for generating sentences of the highest…

Computation and Language · Computer Science 2024-06-13 Yuu Jinnai , Ukyo Honda , Tetsuro Morimura , Peinan Zhang

Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech Recognition

Recent work has shown that sample-based Minimum Bayes Risk (MBR) decoding outperforms beam search in text-to-text generation tasks, such as machine translation, text summarization, and image captioning. On the other hand, beam search is the…

Computation and Language · Computer Science 2026-05-14 Yuu Jinnai

On the True Distribution Approximation of Minimum Bayes-Risk Decoding

Minimum Bayes-risk (MBR) decoding has recently gained renewed attention in text generation. MBR decoding considers texts sampled from a model as pseudo-references and selects the text with the highest similarity to the others. Therefore,…

Computation and Language · Computer Science 2024-04-02 Atsumoto Ohashi , Ukyo Honda , Tetsuro Morimura , Yuu Jinnai

Sampling-Based Approximations to Minimum Bayes Risk Decoding for Neural Machine Translation

In NMT we search for the mode of the model distribution to form predictions. The mode and other high-probability translations found by beam search have been shown to often be inadequate in a number of ways. This prevents improving…

Computation and Language · Computer Science 2022-10-26 Bryan Eikema , Wilker Aziz

Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport

Document-level text generation tasks are known to be more difficult than sentence-level text generation tasks as they require the understanding of longer context to generate high-quality texts. In this paper, we investigate the adaption of…

Computation and Language · Computer Science 2025-05-30 Yuu Jinnai

Agreement-Constrained Probabilistic Minimum Bayes Risk Decoding

Minimum Bayes risk (MBR) decoding generates high-quality translations by maximizing the expected utility of output candidates, but it evaluates all pairwise scores over the candidate set; hence, it takes quadratic time with respect to the…

Computation and Language · Computer Science 2025-12-02 Koki Natsumi , Hiroyuki Deguchi , Yusuke Sakai , Hidetaka Kamigaito , Taro Watanabe

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding

Minimum Bayes-Risk (MBR) decoding is shown to be a powerful alternative to beam search decoding for a wide range of text generation tasks. However, MBR requires a huge amount of time for inference to compute the MBR objective, which makes…

Artificial Intelligence · Computer Science 2024-06-13 Yuu Jinnai , Kaito Ariu

Improving Minimum Bayes Risk Decoding with Multi-Prompt

While instruction fine-tuned LLMs are effective text generators, sensitivity to prompt construction makes performance unstable and sub-optimal in practice. Relying on a single "best" prompt cannot capture all differing approaches to a…

Computation and Language · Computer Science 2024-10-07 David Heineman , Yao Dou , Wei Xu

Later-stage Minimum Bayes-Risk Decoding for Neural Machine Translation

For extended periods of time, sequence generation models rely on beam search algorithm to generate output sequence. However, the correctness of beam search degrades when the a model is over-confident about a suboptimal prediction. In this…

Computation and Language · Computer Science 2017-06-09 Raphael Shu , Hideki Nakayama