English
Related papers

Related papers: Massive-scale Decoding for Text Generation using L…

200 papers

We propose Composition Sampling, a simple but effective method to generate diverse outputs for conditional generation of higher quality compared to previous stochastic decoding strategies. It builds on recently proposed plan-based neural…

Computation and Language · Computer Science 2022-03-30 Shashi Narayan , Gonçalo Simões , Yao Zhao , Joshua Maynez , Dipanjan Das , Michael Collins , Mirella Lapata

Despite their growing capabilities, language models still frequently reproduce content from their training data, generate repetitive text, and favor common grammatical patterns and vocabulary. A possible cause is the decoding strategy: the…

Computation and Language · Computer Science 2026-01-15 Giorgio Franceschelli , Mirco Musolesi

Decoding strategies for generative large language models (LLMs) are a critical but often underexplored aspect of text generation tasks. Guided by specific hyperparameters, these strategies aim to transform the raw probability distributions…

Computation and Language · Computer Science 2024-12-17 Esteban Garces Arias , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

Large language models (LM) based on Transformers allow to generate plausible long texts. In this paper, we explore how this generation can be further controlled at decoding time to satisfy certain constraints (e.g. being non-toxic,…

Computation and Language · Computer Science 2022-05-05 Antoine Chaffin , Vincent Claveau , Ewa Kijak

A latent-variable model is introduced for text matching, inferring sentence representations by jointly optimizing generative and discriminative objectives. To alleviate typical optimization challenges in latent-variable models for text, we…

Computation and Language · Computer Science 2017-11-23 Dinghan Shen , Yizhe Zhang , Ricardo Henao , Qinliang Su , Lawrence Carin

When generating text from probabilistic models, the chosen decoding strategy has a profound effect on the resulting text. Yet the properties elicited by various decoding strategies do not always transfer across natural language generation…

Computation and Language · Computer Science 2022-03-30 Gian Wiher , Clara Meister , Ryan Cotterell

Decoding from the output distributions of large language models to produce high-quality text is a complex challenge in language modeling. Various approaches, such as beam search, sampling with temperature, $k-$sampling, nucleus…

Computation and Language · Computer Science 2024-10-22 Esteban Garces Arias , Julian Rodemann , Meimingwei Li , Christian Heumann , Matthias Aßenmacher

Conditional text generation often requires lexical constraints, i.e., which words should or shouldn't be included in the output text. While the dominant recipe for conditional text generation has been large-scale pretrained language models…

Computation and Language · Computer Science 2021-04-22 Ximing Lu , Peter West , Rowan Zellers , Ronan Le Bras , Chandra Bhagavatula , Yejin Choi

While conditional language models have greatly improved in their ability to output high-quality natural language, many NLP applications benefit from being able to generate a diverse set of candidate sequences. Diverse decoding strategies…

Computation and Language · Computer Science 2019-06-18 Daphne Ippolito , Reno Kriz , Maria Kustikova , João Sedoc , Chris Callison-Burch

Deep LSTM is an ideal candidate for text recognition. However text recognition involves some initial image processing steps like segmentation of lines and words which can induce error to the recognition system. Without segmentation,…

Computer Vision and Pattern Recognition · Computer Science 2015-02-27 Anupama Ray , Sai Rajeswar , Santanu Chaudhury

Generating long and coherent text is an important but challenging task, particularly for open-ended language generation tasks such as story generation. Despite the success in modeling intra-sentence coherence, existing generation models…

Computation and Language · Computer Science 2021-05-20 Jian Guan , Xiaoxi Mao , Changjie Fan , Zitao Liu , Wenbiao Ding , Minlie Huang

Text generation aims to produce human-like natural language output for down-stream tasks. It covers a wide range of applications like machine translation, document summarization, dialogue generation and so on. Recently deep neural…

Computation and Language · Computer Science 2022-03-07 Xiaoyu Shen

Current language models decode text token by token according to probabilistic distribution, and determining the appropriate candidates for the next token is crucial to ensure generation quality. This study introduces adaptive decoding, a…

Computation and Language · Computer Science 2024-06-04 Wenhong Zhu , Hongkun Hao , Zhiwei He , Yiming Ai , Rui Wang

When developing text classification models for real world applications, one major challenge is the difficulty to collect sufficient data for all text classes. In this work, we address this challenge by utilizing large language models (LLMs)…

Computation and Language · Computer Science 2025-08-15 Chenhao Xue , Yuanzhe Jin , Adrian Carrasco-Revilla , Joyraj Chakraborty , Min Chen

Deep learning methods have recently achieved great empirical success on machine translation, dialogue response generation, summarization, and other text generation tasks. At a high level, the technique has been to train end-to-end neural…

Computation and Language · Computer Science 2017-11-28 Ziang Xie

Many language generation models are now available for a wide range of generation tasks, including machine translation and summarization. Combining such diverse models may lead to further progress, but ensembling generation models is…

Computation and Language · Computer Science 2022-10-31 Jungo Kasai , Keisuke Sakaguchi , Ronan Le Bras , Hao Peng , Ximing Lu , Dragomir Radev , Yejin Choi , Noah A. Smith

Large pre-trained language models (LMs) have been shown to perform surprisingly well when fine-tuned on tasks that require commonsense and world knowledge. However, in end-to-end architectures, it is difficult to explain what is the…

Computation and Language · Computer Science 2020-04-14 Veronica Latcinnik , Jonathan Berant

One of the most striking findings in modern research on large language models (LLMs) is that scaling up compute during training leads to better results. However, less attention has been given to the benefits of scaling compute during…

Computation and Language · Computer Science 2024-11-21 Sean Welleck , Amanda Bertsch , Matthew Finlayson , Hailey Schoelkopf , Alex Xie , Graham Neubig , Ilia Kulikov , Zaid Harchaoui

Deep-learning models for language generation tasks tend to produce repetitive output. Various methods have been proposed to encourage lexical diversity during decoding, but this often comes at a cost to the perceived fluency and adequacy of…

Computation and Language · Computer Science 2021-09-22 Giulio Zhou , Gerasimos Lampouras

Recent advances in large language models (LLMs) have stepped forward the development of multilingual speech and machine translation by its reduced representation errors and incorporated external knowledge. However, both translation tasks…

Computation and Language · Computer Science 2024-05-17 Yuchen Hu , Chen Chen , Chao-Han Huck Yang , Ruizhe Li , Dong Zhang , Zhehuai Chen , Eng Siong Chng
‹ Prev 1 2 3 10 Next ›