Related papers: Multilingual Language Models Predict Human Reading…

Cross-Lingual Transfer of Cognitive Processing Complexity

When humans read a text, their eye movements are influenced by the structural complexity of the input sentences. This cognitive phenomenon holds across languages and recent studies indicate that multilingual language models utilize…

Computation and Language · Computer Science 2023-02-28 Charlotte Pouw , Nora Hollenstein , Lisa Beinborn

Vectors from Larger Language Models Predict Human Reading Time and fMRI Data More Poorly when Dimensionality Expansion is Controlled

The impressive linguistic abilities of large language models (LLMs) have recommended them as models of human sentence processing, with some conjecturing a positive 'quality-power' relationship (Wilcox et al., 2023), in which language…

Computation and Language · Computer Science 2025-05-20 Yi-Chien Lin , Hongao Zhu , William Schuler

Measuring the Impact of (Psycho-)Linguistic and Readability Features and Their Spill Over Effects on the Prediction of Eye Movement Patterns

There is a growing interest in the combined use of NLP and machine learning methods to predict gaze patterns during naturalistic reading. While promising results have been obtained through the use of transformer-based language models,…

Computation and Language · Computer Science 2022-03-16 Daniel Wiechmann , Yu Qiao , Elma Kerz , Justus Mattern

Zero Shot Crosslingual Eye-Tracking Data Prediction using Multilingual Transformer Models

Eye tracking data during reading is a useful source of information to understand the cognitive processes that take place during language comprehension processes. Different languages account for different brain triggers , however there seems…

Computation and Language · Computer Science 2022-03-31 Harshvardhan Srivastava

Collateral facilitation in humans and language models

Are the predictions of humans and language models affected by similar things? Research suggests that while comprehending language, humans make predictions about upcoming words, with more predictable words being processed more easily.…

Computation and Language · Computer Science 2022-11-11 James A. Michaelov , Benjamin K. Bergen

Probing for Reading Times

Probing has shown that language model representations encode rich linguistic information, but it remains unclear whether they also capture cognitive signals about human processing. In this work, we probe language model representations for…

Computation and Language · Computer Science 2026-04-22 Eleftheria Tsipidi , Samuel Kiegeland , Francesco Ignazio Re , Tianyang Xu , Mario Giulianelli , Karolina Stanczak , Ryan Cotterell

On the Predictive Power of Neural Language Models for Human Real-Time Comprehension Behavior

Human reading behavior is tuned to the statistics of natural language: the time it takes human subjects to read a word can be predicted from estimates of the word's probability in context. However, it remains an open question what…

Computation and Language · Computer Science 2020-06-04 Ethan Gotlieb Wilcox , Jon Gauthier , Jennifer Hu , Peng Qian , Roger Levy

Probing Large Language Models from A Human Behavioral Perspective

Large Language Models (LLMs) have emerged as dominant foundational models in modern NLP. However, the understanding of their prediction processes and internal mechanisms, such as feed-forward networks (FFN) and multi-head self-attention…

Computation and Language · Computer Science 2024-04-16 Xintong Wang , Xiaoyu Li , Xingshan Li , Chris Biemann

Beyond Human-Like Processing: Large Language Models Perform Equivalently on Forward and Backward Scientific Text

The impressive performance of large language models (LLMs) has led to their consideration as models of human language processing. Instead, we suggest that the success of LLMs arises from the flexibility of the transformer learning…

Computation and Language · Computer Science 2024-11-19 Xiaoliang Luo , Michael Ramscar , Bradley C. Love

To model human linguistic prediction, make LLMs less superhuman

When we read, we make predictions about upcoming words; these predictions influence our reading behavior. The success of large language models (LLMs), which, like humans, make predictions about upcoming words, has motivated their use as…

Computation and Language · Computer Science 2026-05-27 Byung-Doh Oh , Tal Linzen

Transformer-Based Language Model Surprisal Predicts Human Reading Times Best with About Two Billion Training Tokens

Recent psycholinguistic studies have drawn conflicting conclusions about the relationship between the quality of a language model and the ability of its surprisal estimates to predict human reading times, which has been speculated to be due…

Computation and Language · Computer Science 2023-10-24 Byung-Doh Oh , William Schuler

Advancements in Natural Language Processing: Exploring Transformer-Based Architectures for Text Understanding

Natural Language Processing (NLP) has witnessed a transformative leap with the advent of transformer-based architectures, which have significantly enhanced the ability of machines to understand and generate human-like text. This paper…

Computation and Language · Computer Science 2025-03-27 Tianhao Wu , Yu Wang , Ngoc Quach

N-gram-like Language Models Predict Reading Time Best

Recent work has found that contemporary language models such as transformers can become so good at next-word prediction that the probabilities they calculate become worse for predicting reading time. In this paper, we propose that this can…

Computation and Language · Computer Science 2026-03-11 James A. Michaelov , Roger P. Levy

A Primer on Pretrained Multilingual Language Models

Multilingual Language Models (\MLLMs) such as mBERT, XLM, XLM-R, \textit{etc.} have emerged as a viable option for bringing the power of pretraining to a large number of languages. Given their success in zero-shot transfer learning, there…

Computation and Language · Computer Science 2021-12-24 Sumanth Doddapaneni , Gowtham Ramesh , Mitesh M. Khapra , Anoop Kunchukuttan , Pratyush Kumar

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field. We present a survey of recent work that uses these large language models to solve NLP tasks via…

Computation and Language · Computer Science 2021-11-03 Bonan Min , Hayley Ross , Elior Sulem , Amir Pouran Ben Veyseh , Thien Huu Nguyen , Oscar Sainz , Eneko Agirre , Ilana Heinz , Dan Roth

Surprisal from Larger Transformer-based Language Models Predicts fMRI Data More Poorly

There has been considerable interest in using surprisal from Transformer-based language models (LMs) as predictors of human sentence processing difficulty. Recent work has observed an inverse scaling relationship between Transformers'…

Computation and Language · Computer Science 2026-02-04 Yi-Chien Lin , William Schuler

Pushing on Personality Detection from Verbal Behavior: A Transformer Meets Text Contours of Psycholinguistic Features

Research at the intersection of personality psychology, computer science, and linguistics has recently focused increasingly on modeling and predicting personality from language use. We report two major improvements in predicting personality…

Computation and Language · Computer Science 2022-04-12 Elma Kerz , Yu Qiao , Sourabh Zanwar , Daniel Wiechmann

Comparative Approaches to Sentiment Analysis Using Datasets in Major European and Arabic Languages

This study explores transformer-based models such as BERT, mBERT, and XLM-R for multi-lingual sentiment analysis across diverse linguistic structures. Key contributions include the identification of XLM-R superior adaptability in…

Computation and Language · Computer Science 2025-01-23 Mikhail Krasitskii , Olga Kolesnikova , Liliana Chanona Hernandez , Grigori Sidorov , Alexander Gelbukh

Do Multilingual LLMs Think In English?

Large language models (LLMs) have multilingual capabilities and can solve tasks across various languages. However, we show that current LLMs make key decisions in a representation space closest to English, regardless of their input and…

Computation and Language · Computer Science 2025-02-24 Lisa Schut , Yarin Gal , Sebastian Farquhar

An Investigation of Language Model Interpretability via Sentence Editing

Pre-trained language models (PLMs) like BERT are being used for almost all language-related tasks, but interpreting their behavior still remains a significant challenge and many important questions remain largely unanswered. In this work,…

Computation and Language · Computer Science 2021-09-28 Samuel Stevens , Yu Su