English
Related papers

Related papers: A Simple, Fast Diverse Decoding Algorithm for Neur…

200 papers

Despite their growing capabilities, language models still frequently reproduce content from their training data, generate repetitive text, and favor common grammatical patterns and vocabulary. A possible cause is the decoding strategy: the…

Computation and Language · Computer Science 2026-01-15 Giorgio Franceschelli , Mirco Musolesi

This study mainly investigates two common decoding problems in neural keyphrase generation: sequence length bias and beam diversity. To tackle the problems, we introduce a beam search decoding strategy based on word-level and ngram-level…

Computation and Language · Computer Science 2023-10-31 Iftitahu Ni'mah , Vlado Menkovski , Mykola Pechenizkiy

Beam search is a go-to strategy for decoding neural sequence models. The algorithm can naturally be viewed as a subset optimization problem, albeit one where the corresponding set function does not reflect interactions between candidates.…

Computation and Language · Computer Science 2023-06-26 Clara Meister , Martina Forster , Ryan Cotterell

When generating text from probabilistic models, the chosen decoding strategy has a profound effect on the resulting text. Yet the properties elicited by various decoding strategies do not always transfer across natural language generation…

Computation and Language · Computer Science 2022-03-30 Gian Wiher , Clara Meister , Ryan Cotterell

Neural sequence models are widely used to model time-series data. Equally ubiquitous is the usage of beam search (BS) as an approximate inference algorithm to decode output sequences from these models. BS explores the search space in a…

Artificial Intelligence · Computer Science 2018-10-23 Ashwin K Vijayakumar , Michael Cogswell , Ramprasath R. Selvaraju , Qing Sun , Stefan Lee , David Crandall , Dhruv Batra

Decoding from the output distributions of large language models to produce high-quality text is a complex challenge in language modeling. Various approaches, such as beam search, sampling with temperature, $k-$sampling, nucleus…

Computation and Language · Computer Science 2024-10-22 Esteban Garces Arias , Julian Rodemann , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

While conditional language models have greatly improved in their ability to output high-quality natural language, many NLP applications benefit from being able to generate a diverse set of candidate sequences. Diverse decoding strategies…

Computation and Language · Computer Science 2019-06-18 Daphne Ippolito , Reno Kriz , Maria Kustikova , João Sedoc , Chris Callison-Burch

Large Language Models (LLMs) often generate repetitive and monotonous outputs, especially in tasks like story generation, due to limited creative diversity when given the same input prompt. To address this challenge, we propose a novel…

Computation and Language · Computer Science 2025-09-04 Kyeongman Park , Nakyeong Yang , Kyomin Jung

One of the most important challenges in text generation systems is to produce outputs that are not only correct but also diverse. Recently, Minimum Bayes-Risk (MBR) decoding has gained prominence for generating sentences of the highest…

Computation and Language · Computer Science 2024-06-13 Yuu Jinnai , Ukyo Honda , Tetsuro Morimura , Peinan Zhang

Many language generation models are now available for a wide range of generation tasks, including machine translation and summarization. Combining such diverse models may lead to further progress, but ensembling generation models is…

Computation and Language · Computer Science 2022-10-31 Jungo Kasai , Keisuke Sakaguchi , Ronan Le Bras , Hao Peng , Ximing Lu , Dragomir Radev , Yejin Choi , Noah A. Smith

The basic concept in Neural Machine Translation (NMT) is to train a large Neural Network that maximizes the translation performance on a given parallel corpus. NMT is then using a simple left-to-right beam-search decoder to generate new…

Computation and Language · Computer Science 2018-12-19 Markus Freitag , Yaser Al-Onaizan

Neural machine translation models rely on the beam search algorithm for decoding. In practice, we found that the quality of hypotheses in the search space is negatively affected owing to the fixed beam size. To mitigate this problem, we…

Computation and Language · Computer Science 2017-07-11 Raphael Shu , Hideki Nakayama

We introduce Data Diversification: a simple but effective strategy to boost neural machine translation (NMT) performance. It diversifies the training data by using the predictions of multiple forward and backward models and then merging…

Computation and Language · Computer Science 2020-10-06 Xuan-Phi Nguyen , Shafiq Joty , Wu Kui , Ai Ti Aw

Modern natural language generation paradigms require a good decoding strategy to obtain quality sequences out of the model. Beam search yields high-quality but low diversity outputs; stochastic approaches suffer from high variance and…

Computation and Language · Computer Science 2022-11-23 Jiacheng Xu , Caiming Xiong , Silvio Savarese , Yingbo Zhou

Conditional neural text generation models generate high-quality outputs, but often concentrate around a mode when what we really want is a diverse set of options. We present a search algorithm to construct lattices encoding a massive number…

Computation and Language · Computer Science 2022-05-04 Jiacheng Xu , Siddhartha Reddy Jonnalagadda , Greg Durrett

For open-ended language generation tasks such as storytelling and dialogue, choosing the right decoding algorithm is critical to controlling the tradeoff between generation quality and diversity. However, there presently exists no consensus…

Computation and Language · Computer Science 2020-04-23 Hugh Zhang , Daniel Duckworth , Daphne Ippolito , Arvind Neelakantan

A novel and efficient neural decoder algorithm is proposed. The proposed decoder is based on the neural Belief Propagation algorithm and the Automorphism Group. By combining neural belief propagation with permutations from the Automorphism…

Information Theory · Computer Science 2018-01-10 Eliya Nachmani , Yaron Bachar , Elad Marciano , David Burshtein , Yair Be'ery

The performance of natural language generation systems has improved substantially with modern neural networks. At test time they typically employ beam search to avoid locally optimal but globally suboptimal predictions. However, due to…

Computation and Language · Computer Science 2021-03-18 James Hargreaves , Andreas Vlachos , Guy Emerson

Multiple heads decoding accelerates the inference of Large Language Models (LLMs) by predicting next several tokens simultaneously. It generates and verifies multiple candidate sequences in parallel via tree attention with a fixed…

Computer Vision and Pattern Recognition · Computer Science 2025-02-11 Zhendong Zhang

The decoding algorithm is critical for open-ended text generation, transforming latent representations into coherent and meaningful outputs. This paper investigates the self-reinforcement effect in text generation and the effectiveness of a…

Computation and Language · Computer Science 2023-10-24 Wenhong Zhu , Hongkun Hao , Rui Wang
‹ Prev 1 2 3 10 Next ›