Related papers: Locally Typical Sampling

On the probability-quality paradox in language generation

When generating natural language from neural probabilistic models, high probability does not always coincide with high quality: It has often been observed that mode-seeking decoding methods, i.e., those that produce high-probability text…

Computation and Language · Computer Science 2022-04-01 Clara Meister , Gian Wiher , Tiago Pimentel , Ryan Cotterell

On the Efficacy of Sampling Adapters

Sampling is a common strategy for generating text from probabilistic models, yet standard ancestral sampling often results in text that is incoherent or ungrammatical. To alleviate this issue, various modifications to a model's sampling…

Computation and Language · Computer Science 2024-01-08 Clara Meister , Tiago Pimentel , Luca Malagutti , Ethan G. Wilcox , Ryan Cotterell

Linguistic Structure from a Bottleneck on Sequential Information Processing

Human language has a distinct systematic structure, where utterances break into individually meaningful words which are combined to form phrases. We show that natural-language-like systematicity arises in codes that are constrained by a…

Computation and Language · Computer Science 2025-11-19 Richard Futrell , Michael Hahn

Language Model Evaluation Beyond Perplexity

We propose an alternate approach to quantifying how well language models learn natural language: we ask how well they match the statistical tendencies of natural language. To answer this question, we analyze whether text generated from…

Computation and Language · Computer Science 2021-08-31 Clara Meister , Ryan Cotterell

DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation

Despite their growing capabilities, language models still frequently reproduce content from their training data, generate repetitive text, and favor common grammatical patterns and vocabulary. A possible cause is the decoding strategy: the…

Computation and Language · Computer Science 2026-01-15 Giorgio Franceschelli , Mirco Musolesi

Language Model Decoding as Direct Metrics Optimization

Despite the remarkable advances in language modeling, current mainstream decoding methods still struggle to generate texts that align with human texts across different aspects. In particular, sampling-based methods produce less-repetitive…

Computation and Language · Computer Science 2024-06-06 Haozhe Ji , Pei Ke , Hongning Wang , Minlie Huang

The Curious Case of Neural Text Degeneration

Despite considerable advancements with deep neural language models, the enigma of neural text degeneration persists when these models are tested as text generators. The counter-intuitive empirical observation is that even though the use of…

Computation and Language · Computer Science 2020-02-18 Ari Holtzman , Jan Buys , Li Du , Maxwell Forbes , Yejin Choi

Neural Text Generation with Unlikelihood Training

Neural text generation is a key tool in natural language applications, but it is well known there are major problems at its core. In particular, standard likelihood training and decoding leads to dull and repetitive outputs. While some…

Machine Learning · Computer Science 2019-09-30 Sean Welleck , Ilia Kulikov , Stephen Roller , Emily Dinan , Kyunghyun Cho , Jason Weston

Taming Repetition in Dialogue Generation

The wave of pre-training language models has been continuously improving the quality of the machine-generated conversations, however, some of the generated responses still suffer from excessive repetition, sometimes repeating words from…

Computation and Language · Computer Science 2021-12-17 Yadong Xi , Jiashu Pu , Xiaoxi Mao

Deterministic or probabilistic? The psychology of LLMs as random number generators

Large Language Models (LLMs) have transformed text generation through inherently probabilistic context-aware mechanisms, mimicking human natural language. In this paper, we systematically investigate the performance of various LLMs when…

Computation and Language · Computer Science 2025-02-28 Javier Coronado-Blázquez

Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Natural language generation (NLG) is a critical component of spoken dialogue and it has a significant impact both on usability and perceived quality. Most NLG systems in common use employ rules and heuristics and tend to generate rigid and…

Computation and Language · Computer Science 2015-08-27 Tsung-Hsien Wen , Milica Gasic , Nikola Mrksic , Pei-Hao Su , David Vandyke , Steve Young

Neural Language Generation: Formulation, Methods, and Evaluation

Recent advances in neural network-based generative modeling have reignited the hopes in having computer systems capable of seamlessly conversing with humans and able to understand natural language. Neural architectures have been employed to…

Computation and Language · Computer Science 2020-08-03 Cristina Garbacea , Qiaozhu Mei

Evaluating Distributional Distortion in Neural Language Modeling

A fundamental characteristic of natural language is the high rate at which speakers produce novel expressions. Because of this novelty, a heavy-tail of rare events accounts for a significant amount of the total probability mass of…

Computation and Language · Computer Science 2022-03-25 Benjamin LeBrun , Alessandro Sordoni , Timothy J. O'Donnell

A Computational Framework for Slang Generation

Slang is a common type of informal language, but its flexible nature and paucity of data resources present challenges for existing natural language systems. We take an initial step toward machine generation of slang by developing a…

Computation and Language · Computer Science 2021-05-25 Zhewei Sun , Richard Zemel , Yang Xu

Entropy in Large Language Models

In this study, the output of large language models (LLM) is considered an information source generating an unlimited sequence of symbols drawn from a finite alphabet. Given the probabilistic nature of modern LLMs, we assume a probabilistic…

Computation and Language · Computer Science 2026-02-24 Marco Scharringhausen

Why is constrained neural language generation particularly challenging?

Recent advances in deep neural language models combined with the capacity of large scale datasets have accelerated the development of natural language generation systems that produce fluent and coherent texts (to various degrees of success)…

Computation and Language · Computer Science 2025-04-15 Cristina Garbacea , Qiaozhu Mei

Pragmatically Informative Text Generation

We improve the informativeness of models for conditional text generation using techniques from computational pragmatics. These techniques formulate language production as a game between speakers and listeners, in which a speaker should…

Computation and Language · Computer Science 2019-04-05 Sheng Shen , Daniel Fried , Jacob Andreas , Dan Klein

Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution

We present a setup for training, evaluating and interpreting neural language models, that uses artificial, language-like data. The data is generated using a massive probabilistic grammar (based on state-split PCFGs), that is itself derived…

Computation and Language · Computer Science 2023-10-24 Jaap Jumelet , Willem Zuidema

Curious Case of Language Generation Evaluation Metrics: A Cautionary Tale

Automatic evaluation of language generation systems is a well-studied problem in Natural Language Processing. While novel metrics are proposed every year, a few popular metrics remain as the de facto metrics to evaluate tasks such as image…

Computation and Language · Computer Science 2020-10-27 Ozan Caglayan , Pranava Madhyastha , Lucia Specia

Truncation Sampling as Language Model Desmoothing

Long samples of text from neural language models can be of poor quality. Truncation sampling algorithms--like top-$p$ or top-$k$ -- address this by setting some words' probabilities to zero at each step. This work provides framing for the…

Computation and Language · Computer Science 2022-10-28 John Hewitt , Christopher D. Manning , Percy Liang