Related papers: A Simple, Fast Diverse Decoding Algorithm for Neur…

DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation

Despite their growing capabilities, language models still frequently reproduce content from their training data, generate repetitive text, and favor common grammatical patterns and vocabulary. A possible cause is the decoding strategy: the…

Computation and Language · Computer Science 2026-01-15 Giorgio Franceschelli , Mirco Musolesi

BSDAR: Beam Search Decoding with Attention Reward in Neural Keyphrase Generation

This study mainly investigates two common decoding problems in neural keyphrase generation: sequence length bias and beam diversity. To tackle the problems, we introduce a beam search decoding strategy based on word-level and ngram-level…

Computation and Language · Computer Science 2023-10-31 Iftitahu Ni'mah , Vlado Menkovski , Mykola Pechenizkiy

Determinantal Beam Search

Beam search is a go-to strategy for decoding neural sequence models. The algorithm can naturally be viewed as a subset optimization problem, albeit one where the corresponding set function does not reflect interactions between candidates.…

Computation and Language · Computer Science 2023-06-26 Clara Meister , Martina Forster , Ryan Cotterell

On Decoding Strategies for Neural Text Generators

When generating text from probabilistic models, the chosen decoding strategy has a profound effect on the resulting text. Yet the properties elicited by various decoding strategies do not always transfer across natural language generation…

Computation and Language · Computer Science 2022-03-30 Gian Wiher , Clara Meister , Ryan Cotterell

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

Neural sequence models are widely used to model time-series data. Equally ubiquitous is the usage of beam search (BS) as an approximate inference algorithm to decode output sequences from these models. BS explores the search space in a…

Artificial Intelligence · Computer Science 2018-10-23 Ashwin K Vijayakumar , Michael Cogswell , Ramprasath R. Selvaraju , Qing Sun , Stefan Lee , David Crandall , Dhruv Batra

Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation

Decoding from the output distributions of large language models to produce high-quality text is a complex challenge in language modeling. Various approaches, such as beam search, sampling with temperature, $k-$sampling, nucleus…

Computation and Language · Computer Science 2024-10-22 Esteban Garces Arias , Julian Rodemann , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

Comparison of Diverse Decoding Methods from Conditional Language Models

While conditional language models have greatly improved in their ability to output high-quality natural language, many NLP applications benefit from being able to generate a diverse set of candidate sequences. Diverse decoding strategies…

Computation and Language · Computer Science 2019-06-18 Daphne Ippolito , Reno Kriz , Maria Kustikova , João Sedoc , Chris Callison-Burch

Avoidance Decoding for Diverse Multi-Branch Story Generation

Large Language Models (LLMs) often generate repetitive and monotonous outputs, especially in tasks like story generation, due to limited creative diversity when given the same input prompt. To address this challenge, we propose a novel…

Computation and Language · Computer Science 2025-09-04 Kyeongman Park , Nakyeong Yang , Kyomin Jung

Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding

One of the most important challenges in text generation systems is to produce outputs that are not only correct but also diverse. Recently, Minimum Bayes-Risk (MBR) decoding has gained prominence for generating sentences of the highest…

Computation and Language · Computer Science 2024-06-13 Yuu Jinnai , Ukyo Honda , Tetsuro Morimura , Peinan Zhang

Twist Decoding: Diverse Generators Guide Each Other

Many language generation models are now available for a wide range of generation tasks, including machine translation and summarization. Combining such diverse models may lead to further progress, but ensembling generation models is…

Computation and Language · Computer Science 2022-10-31 Jungo Kasai , Keisuke Sakaguchi , Ronan Le Bras , Hao Peng , Ximing Lu , Dragomir Radev , Yejin Choi , Noah A. Smith

Beam Search Strategies for Neural Machine Translation

The basic concept in Neural Machine Translation (NMT) is to train a large Neural Network that maximizes the translation performance on a given parallel corpus. NMT is then using a simple left-to-right beam-search decoder to generate new…

Computation and Language · Computer Science 2018-12-19 Markus Freitag , Yaser Al-Onaizan

Single-Queue Decoding for Neural Machine Translation

Neural machine translation models rely on the beam search algorithm for decoding. In practice, we found that the quality of hypotheses in the search space is negatively affected owing to the fixed beam size. To mitigate this problem, we…

Computation and Language · Computer Science 2017-07-11 Raphael Shu , Hideki Nakayama

Data Diversification: A Simple Strategy For Neural Machine Translation

We introduce Data Diversification: a simple but effective strategy to boost neural machine translation (NMT) performance. It diversifies the training data by using the predictions of multiple forward and backward models and then merging…

Computation and Language · Computer Science 2020-10-06 Xuan-Phi Nguyen , Shafiq Joty , Wu Kui , Ai Ti Aw

Best-$k$ Search Algorithm for Neural Text Generation

Modern natural language generation paradigms require a good decoding strategy to obtain quality sequences out of the model. Beam search yields high-quality but low diversity outputs; stochastic approaches suffer from high variance and…

Computation and Language · Computer Science 2022-11-23 Jiacheng Xu , Caiming Xiong , Silvio Savarese , Yingbo Zhou

Massive-scale Decoding for Text Generation using Lattices

Conditional neural text generation models generate high-quality outputs, but often concentrate around a mode when what we really want is a diverse set of options. We present a search algorithm to construct lattices encoding a massive number…

Computation and Language · Computer Science 2022-05-04 Jiacheng Xu , Siddhartha Reddy Jonnalagadda , Greg Durrett

Trading Off Diversity and Quality in Natural Language Generation

For open-ended language generation tasks such as storytelling and dialogue, choosing the right decoding algorithm is critical to controlling the tradeoff between generation quality and diversity. However, there presently exists no consensus…

Computation and Language · Computer Science 2020-04-23 Hugh Zhang , Daniel Duckworth , Daphne Ippolito , Arvind Neelakantan

Near Maximum Likelihood Decoding with Deep Learning

A novel and efficient neural decoder algorithm is proposed. The proposed decoder is based on the neural Belief Propagation algorithm and the Automorphism Group. By combining neural belief propagation with permutations from the Automorphism…

Information Theory · Computer Science 2018-01-10 Eliya Nachmani , Yaron Bachar , Elad Marciano , David Burshtein , Yair Be'ery

Incremental Beam Manipulation for Natural Language Generation

The performance of natural language generation systems has improved substantially with modern neural networks. At test time they typically employ beam search to avoid locally optimal but globally suboptimal predictions. However, due to…

Computation and Language · Computer Science 2021-03-18 James Hargreaves , Andreas Vlachos , Guy Emerson

Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention

Multiple heads decoding accelerates the inference of Large Language Models (LLMs) by predicting next several tokens simultaneously. It generates and verifies multiple candidate sequences in parallel via tree attention with a fixed…

Computer Vision and Pattern Recognition · Computer Science 2025-02-11 Zhendong Zhang

Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation

The decoding algorithm is critical for open-ended text generation, transforming latent representations into coherent and meaningful outputs. This paper investigates the self-reinforcement effect in text generation and the effectiveness of a…

Computation and Language · Computer Science 2023-10-24 Wenhong Zhu , Hongkun Hao , Rui Wang