English
Related papers

Related papers: Language Model Evaluation Beyond Perplexity

200 papers

A fundamental characteristic of natural language is the high rate at which speakers produce novel expressions. Because of this novelty, a heavy-tail of rare events accounts for a significant amount of the total probability mass of…

Computation and Language · Computer Science 2022-03-25 Benjamin LeBrun , Alessandro Sordoni , Timothy J. O'Donnell

We present a setup for training, evaluating and interpreting neural language models, that uses artificial, language-like data. The data is generated using a massive probabilistic grammar (based on state-split PCFGs), that is itself derived…

Computation and Language · Computer Science 2023-10-24 Jaap Jumelet , Willem Zuidema

Today's probabilistic language generators fall short when it comes to producing coherent and fluent text despite the fact that the underlying models perform well under standard metrics, e.g., perplexity. This discrepancy has puzzled the…

Computation and Language · Computer Science 2025-06-06 Clara Meister , Tiago Pimentel , Gian Wiher , Ryan Cotterell

While large language models (LLMs) are generally considered proficient in generating language, how similar their language usage is to that of humans remains understudied. In this paper, we test whether models exhibit linguistic convergence,…

Computation and Language · Computer Science 2026-02-13 Terra Blevins , Susanne Schmalwieser , Benjamin Roth

Building on research arguing for the possibility of conceptual and categorical knowledge acquisition through statistics contained in language, we evaluate predictive language models (LMs) -- informed solely by textual input -- on a…

Computation and Language · Computer Science 2021-05-10 Kanishka Misra , Allyson Ettinger , Julia Taylor Rayz

The recent proliferation of research into transformer based natural language processing has led to a number of studies which attempt to detect the presence of human-like cognitive behavior in the models. We contend that, as is true of human…

Computation and Language · Computer Science 2024-04-01 Jesse Roberts , Kyle Moore , Drew Wilenzick , Doug Fisher

Language models (LMs) are increasingly being studied as models of human language learners. Due to the nascency of the field, it is not well-established whether LMs exhibit similar learning dynamics to humans, and there are few direct…

Computation and Language · Computer Science 2025-02-11 Filippo Ficarra , Ryan Cotterell , Alex Warstadt

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo

The success of neural language models (LMs) on many technological tasks has brought about their potential relevance as scientific theories of language despite some clear differences between LM training and child language acquisition. In…

Computation and Language · Computer Science 2026-03-30 Héctor Javier Vázquez Martínez , Annika Lea Heuser , Charles Yang , Jordan Kodner

The learning trajectories of linguistic phenomena in humans provide insight into linguistic representation, beyond what can be gleaned from inspecting the behavior of an adult speaker. To apply a similar approach to analyze neural language…

Computation and Language · Computer Science 2022-04-07 Leshem Choshen , Guy Hacohen , Daphna Weinshall , Omri Abend

What can large language models learn? By definition, language models (LM) are distributions over strings. Therefore, an intuitive way of addressing the above question is to formalize it as a matter of learnability of classes of…

Computation and Language · Computer Science 2025-01-14 Nadav Borenstein , Anej Svete , Robin Chan , Josef Valvoda , Franz Nowak , Isabelle Augenstein , Eleanor Chodroff , Ryan Cotterell

Neural Language Models (NLMs) have made tremendous advances during the last years, achieving impressive performance on various linguistic tasks. Capitalizing on this, studies in neuroscience have started to use NLMs to study neural activity…

Artificial Intelligence · Computer Science 2022-07-08 Alexandre Pasquiou , Yair Lakretz , John Hale , Bertrand Thirion , Christophe Pallier

Despite the remarkable advances in language modeling, current mainstream decoding methods still struggle to generate texts that align with human texts across different aspects. In particular, sampling-based methods produce less-repetitive…

Computation and Language · Computer Science 2024-06-06 Haozhe Ji , Pei Ke , Hongning Wang , Minlie Huang

Transformer-based language models have recently achieved remarkable results in many natural language tasks. However, performance on leaderboards is generally achieved by leveraging massive amounts of training data, and rarely by encoding…

Computation and Language · Computer Science 2022-07-21 Bai Li

Distributional semantics is the linguistic theory that a word's meaning can be derived from its distribution in natural language (i.e., its use). Language models are commonly viewed as an implementation of distributional semantics, as they…

Computation and Language · Computer Science 2024-10-21 Zhang Enyan , Zewei Wang , Michael A. Lepori , Ellie Pavlick , Helena Aparicio

Standard evaluations of Large language models (LLMs) focus on task performance, offering limited insight into whether correct behavior reflects appropriate underlying mechanisms and risking confirmation bias. We introduce a simple,…

Computation and Language · Computer Science 2026-04-01 Zoë Prins , Samuele Punzo , Frank Wildenburg , Giovanni Cinà , Sandro Pezzelle

Despite the success of language models using neural networks, it remains unclear to what extent neural models have the generalization ability to perform inferences. In this paper, we introduce a method for evaluating whether neural models…

Computation and Language · Computer Science 2020-05-05 Hitomi Yanaka , Koji Mineshima , Daisuke Bekki , Kentaro Inui

Sampling is a common strategy for generating text from probabilistic models, yet standard ancestral sampling often results in text that is incoherent or ungrammatical. To alleviate this issue, various modifications to a model's sampling…

Computation and Language · Computer Science 2024-01-08 Clara Meister , Tiago Pimentel , Luca Malagutti , Ethan G. Wilcox , Ryan Cotterell

It's been said that "Language Models are Unsupervised Multitask Learners." Indeed, self-supervised language models trained on "positive" examples of English text generalize in desirable ways to many natural language tasks. But if such…

Computation and Language · Computer Science 2020-10-23 Michael L. Wick , Kate Silverstein , Jean-Baptiste Tristan , Adam Pocock , Mark Johnson

This chapter critically examines the potential contributions of modern language models to theoretical linguistics. Despite their focus on engineering goals, these models' ability to acquire sophisticated linguistic knowledge from mere…

Computation and Language · Computer Science 2024-08-15 Raphaël Millière
‹ Prev 1 2 3 10 Next ›