Related papers: Language Model Evaluation Beyond Perplexity

Evaluating Distributional Distortion in Neural Language Modeling

A fundamental characteristic of natural language is the high rate at which speakers produce novel expressions. Because of this novelty, a heavy-tail of rare events accounts for a significant amount of the total probability mass of…

Computation and Language · Computer Science 2022-03-25 Benjamin LeBrun , Alessandro Sordoni , Timothy J. O'Donnell

Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution

We present a setup for training, evaluating and interpreting neural language models, that uses artificial, language-like data. The data is generated using a massive probabilistic grammar (based on state-split PCFGs), that is itself derived…

Computation and Language · Computer Science 2023-10-24 Jaap Jumelet , Willem Zuidema

Locally Typical Sampling

Today's probabilistic language generators fall short when it comes to producing coherent and fluent text despite the fact that the underlying models perform well under standard metrics, e.g., perplexity. This discrepancy has puzzled the…

Computation and Language · Computer Science 2025-06-06 Clara Meister , Tiago Pimentel , Gian Wiher , Ryan Cotterell

Do language models accommodate their users? A study of linguistic convergence

While large language models (LLMs) are generally considered proficient in generating language, how similar their language usage is to that of humans remains understudied. In this paper, we test whether models exhibit linguistic convergence,…

Computation and Language · Computer Science 2026-02-13 Terra Blevins , Susanne Schmalwieser , Benjamin Roth

Do language models learn typicality judgments from text?

Building on research arguing for the possibility of conceptual and categorical knowledge acquisition through statistics contained in language, we evaluate predictive language models (LMs) -- informed solely by textual input -- on a…

Computation and Language · Computer Science 2021-05-10 Kanishka Misra , Allyson Ettinger , Julia Taylor Rayz

Using Artificial Populations to Study Psychological Phenomena in Neural Models

The recent proliferation of research into transformer based natural language processing has led to a number of studies which attempt to detect the presence of human-like cognitive behavior in the models. We contend that, as is true of human…

Computation and Language · Computer Science 2024-04-01 Jesse Roberts , Kyle Moore , Drew Wilenzick , Doug Fisher

A Distributional Perspective on Word Learning in Neural Language Models

Language models (LMs) are increasingly being studied as models of human language learners. Due to the nascency of the field, it is not well-established whether LMs exhibit similar learning dynamics to humans, and there are few direct…

Computation and Language · Computer Science 2025-02-11 Filippo Ficarra , Ryan Cotterell , Alex Warstadt

An Overview on Language Models: Recent Developments and Outlook

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo

Evaluating Neural Language Models as Cognitive Models of Language Acquisition

The success of neural language models (LMs) on many technological tasks has brought about their potential relevance as scientific theories of language despite some clear differences between LM training and child language acquisition. In…

Computation and Language · Computer Science 2026-03-30 Héctor Javier Vázquez Martínez , Annika Lea Heuser , Charles Yang , Jordan Kodner

The Grammar-Learning Trajectories of Neural Language Models

The learning trajectories of linguistic phenomena in humans provide insight into linguistic representation, beyond what can be gleaned from inspecting the behavior of an adult speaker. To apply a similar approach to analyze neural language…

Computation and Language · Computer Science 2022-04-07 Leshem Choshen , Guy Hacohen , Daphna Weinshall , Omri Abend

What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

What can large language models learn? By definition, language models (LM) are distributions over strings. Therefore, an intuitive way of addressing the above question is to formalize it as a matter of learnability of classes of…

Computation and Language · Computer Science 2025-01-14 Nadav Borenstein , Anej Svete , Robin Chan , Josef Valvoda , Franz Nowak , Isabelle Augenstein , Eleanor Chodroff , Ryan Cotterell

Neural Language Models are not Born Equal to Fit Brain Data, but Training Helps

Neural Language Models (NLMs) have made tremendous advances during the last years, achieving impressive performance on various linguistic tasks. Capitalizing on this, studies in neuroscience have started to use NLMs to study neural activity…

Artificial Intelligence · Computer Science 2022-07-08 Alexandre Pasquiou , Yair Lakretz , John Hale , Bertrand Thirion , Christophe Pallier

Language Model Decoding as Direct Metrics Optimization

Despite the remarkable advances in language modeling, current mainstream decoding methods still struggle to generate texts that align with human texts across different aspects. In particular, sampling-based methods produce less-repetitive…

Computation and Language · Computer Science 2024-06-06 Haozhe Ji , Pei Ke , Hongning Wang , Minlie Huang

Integrating Linguistic Theory and Neural Language Models

Transformer-based language models have recently achieved remarkable results in many natural language tasks. However, performance on leaderboards is generally achieved by leveraging massive amounts of training data, and rarely by encoding…

Computation and Language · Computer Science 2022-07-21 Bai Li

Are LLMs Models of Distributional Semantics? A Case Study on Quantifiers

Distributional semantics is the linguistic theory that a word's meaning can be derived from its distribution in natural language (i.e., its use). Language models are commonly viewed as an implementation of distributional semantics, as they…

Computation and Language · Computer Science 2024-10-21 Zhang Enyan , Zewei Wang , Michael A. Lepori , Ellie Pavlick , Helena Aparicio

Is my model perplexed for the right reason? Contrasting LLMs' Benchmark Behavior with Token-Level Perplexity

Standard evaluations of Large language models (LLMs) focus on task performance, offering limited insight into whether correct behavior reflects appropriate underlying mechanisms and risking confirmation bias. We introduce a simple,…

Computation and Language · Computer Science 2026-04-01 Zoë Prins , Samuele Punzo , Frank Wildenburg , Giovanni Cinà , Sandro Pezzelle

Do Neural Models Learn Systematicity of Monotonicity Inference in Natural Language?

Despite the success of language models using neural networks, it remains unclear to what extent neural models have the generalization ability to perform inferences. In this paper, we introduce a method for evaluating whether neural models…

Computation and Language · Computer Science 2020-05-05 Hitomi Yanaka , Koji Mineshima , Daisuke Bekki , Kentaro Inui

On the Efficacy of Sampling Adapters

Sampling is a common strategy for generating text from probabilistic models, yet standard ancestral sampling often results in text that is incoherent or ungrammatical. To alleviate this issue, various modifications to a model's sampling…

Computation and Language · Computer Science 2024-01-08 Clara Meister , Tiago Pimentel , Luca Malagutti , Ethan G. Wilcox , Ryan Cotterell

Detecting and Exorcising Statistical Demons from Language Models with Anti-Models of Negative Data

It's been said that "Language Models are Unsupervised Multitask Learners." Indeed, self-supervised language models trained on "positive" examples of English text generalize in desirable ways to many natural language tasks. But if such…

Computation and Language · Computer Science 2020-10-23 Michael L. Wick , Kate Silverstein , Jean-Baptiste Tristan , Adam Pocock , Mark Johnson

Language Models as Models of Language

This chapter critically examines the potential contributions of modern language models to theoretical linguistics. Despite their focus on engineering goals, these models' ability to acquire sophisticated linguistic knowledge from mere…

Computation and Language · Computer Science 2024-08-15 Raphaël Millière