Related papers: Racing Thoughts: Explaining Contextualization Erro…

When Does Context Help? Error Dynamics of Contextual Information in Large Language Models

Contextual information at inference time, such as demonstrations, retrieved knowledge, or interaction history, can substantially improve large language models (LLMs) without parameter updates, yet its theoretical role remains poorly…

Computation and Language · Computer Science 2026-02-10 Dingzirui Wang , Xuanliang Zhang , Keyan Xu , Qingfu Zhu , Wanxiang Che , Yang Deng

On the Robustness of Transformers against Context Hijacking for Linear Classification

Transformer-based Large Language Models (LLMs) have demonstrated powerful in-context learning capabilities. However, their predictions can be disrupted by factually correct context, a phenomenon known as context hijacking, revealing a…

Computation and Language · Computer Science 2025-02-24 Tianle Li , Chenyang Zhang , Xingwu Chen , Yuan Cao , Difan Zou

Conditional and Modal Reasoning in Large Language Models

The reasoning abilities of large language models (LLMs) are the topic of a growing body of research in AI and cognitive science. In this paper, we probe the extent to which twenty-nine LLMs are able to distinguish logically correct…

Computation and Language · Computer Science 2024-10-15 Wesley H. Holliday , Matthew Mandelkern , Cedegao E. Zhang

Do Large Language Models Understand Logic or Just Mimick Context?

Over the past few years, the abilities of large language models (LLMs) have received extensive attention, which have performed exceptionally well in complicated scenarios such as logical reasoning and symbolic inference. A significant…

Computation and Language · Computer Science 2024-02-20 Junbing Yan , Chengyu Wang , Jun Huang , Wei Zhang

Evaluating Large Language Models (LLMs) in Financial NLP: A Comparative Study on Financial Report Analysis

Large language models (LLMs) are increasingly used to support the analysis of complex financial disclosures, yet their reliability, behavioral consistency, and transparency remain insufficiently understood in high-stakes settings. This…

Computation and Language · Computer Science 2026-01-21 Md Talha Mohsin

Sensivity of LLMs' Explanations to the Training Randomness:Context, Class & Task Dependencies

Transformer models are now a cornerstone in natural language processing. Yet, explaining their decisions remains a challenge. It was shown recently that the same model trained on the same data with a different randomness can lead to very…

Computation and Language · Computer Science 2026-03-10 Romain Loncour , Jérémie Bogaert , François-Xavier Standaert

Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge

Prior research has explored the ability of computational models to predict a word semantic fit with a given predicate. While much work has been devoted to modeling the typicality relation between verbs and arguments in isolation, in this…

Computation and Language · Computer Science 2021-07-26 Paolo Pedinotti , Giulia Rambelli , Emmanuele Chersoni , Enrico Santus , Alessandro Lenci , Philippe Blache

Insights into LLM Long-Context Failures: When Transformers Know but Don't Tell

Large Language Models (LLMs) exhibit positional bias, struggling to utilize information from the middle or end of long contexts. Our study explores LLMs' long-context reasoning by probing their hidden representations. We find that while…

Computation and Language · Computer Science 2024-10-08 Taiming Lu , Muhan Gao , Kuai Yu , Adam Byerly , Daniel Khashabi

Action Contextualization: Adaptive Task Planning and Action Tuning using Large Language Models

Large Language Models (LLMs) present a promising frontier in robotic task planning by leveraging extensive human knowledge. Nevertheless, the current literature often overlooks the critical aspects of robots' adaptability and error…

Robotics · Computer Science 2024-11-27 Sthithpragya Gupta , Kunpeng Yao , Loïc Niederhauser , Aude Billard

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning

In recent years, pre-trained large language models (LLMs) have demonstrated remarkable efficiency in achieving an inference-time few-shot learning capability known as in-context learning. However, existing literature has highlighted the…

Computation and Language · Computer Science 2024-02-14 Xinyi Wang , Wanrong Zhu , Michael Saxon , Mark Steyvers , William Yang Wang

A Theoretical Understanding of Self-Correction through In-context Alignment

Going beyond mimicking limited human experiences, recent studies show initial evidence that, like humans, large language models (LLMs) are capable of improving their abilities purely by self-correction, i.e., correcting previous responses…

Machine Learning · Computer Science 2024-11-19 Yifei Wang , Yuyang Wu , Zeming Wei , Stefanie Jegelka , Yisen Wang

Is More Context Always Better? Examining LLM Reasoning Capability for Time Interval Prediction

Large Language Models (LLMs) have demonstrated impressive capabilities in reasoning and prediction across different domains. Yet, their ability to infer temporal regularities from structured behavioral data remains underexplored. This paper…

Artificial Intelligence · Computer Science 2026-01-27 Yanan Cao , Farnaz Fallahi , Murali Mohana Krishna Dandu , Lalitesh Morishetti , Kai Zhao , Luyi Ma , Sinduja Subramaniam , Jianpeng Xu , Evren Korpeoglu , Kaushiki Nag , Sushant Kumar , Kannan Achan

Contextual Drag: How Errors in the Context Affect LLM Reasoning

Central to many self-improvement pipelines for large language models (LLMs) is the assumption that models can improve by reflecting on past mistakes. We study a phenomenon termed contextual drag: the presence of failed attempts in the…

Computation and Language · Computer Science 2026-03-04 Yun Cheng , Xingyu Zhu , Haoyu Zhao , Sanjeev Arora

Can Large Language Models Develop Gambling Addiction?

This study identifies the specific conditions under which large language models exhibit human-like gambling addiction patterns, providing critical insights into their decision-making mechanisms and AI safety. We analyze LLM decision-making…

Artificial Intelligence · Computer Science 2025-12-22 Seungpil Lee , Donghyeon Shin , Yunjeong Lee , Sundong Kim

Can Large Language Models Understand Context?

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various…

Computation and Language · Computer Science 2024-02-02 Yilun Zhu , Joel Ruben Antony Moniz , Shruti Bhargava , Jiarui Lu , Dhivya Piraviperumal , Site Li , Yuan Zhang , Hong Yu , Bo-Hsiang Tseng

Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context

Human processing of idioms relies on understanding the contextual sentences in which idioms occur, as well as language-intrinsic features such as frequency and speaker-intrinsic factors like familiarity. While LLMs have shown high…

Computation and Language · Computer Science 2025-07-17 Maggie Mi , Aline Villavicencio , Nafise Sadat Moosavi

If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models

Conditional acceptability refers to how plausible a conditional statement is perceived to be. It plays an important role in communication and reasoning, as it influences how individuals interpret implications, assess arguments, and make…

Computation and Language · Computer Science 2026-03-20 Jasmin Orth , Philipp Mondorf , Barbara Plank

Context Collapse: In-Context Learning and Model Collapse

This thesis investigates two key phenomena in large language models (LLMs): in-context learning (ICL) and model collapse. We study ICL in a linear transformer with tied weights trained on linear regression tasks, and show that minimising…

Artificial Intelligence · Computer Science 2026-01-06 Josef Ott

What Do Language Models Learn in Context? The Structured Task Hypothesis

Large language models (LLMs) exhibit an intriguing ability to learn a novel task from in-context examples presented in a demonstration, termed in-context learning (ICL). Understandably, a swath of research has been dedicated to uncovering…

Computation and Language · Computer Science 2024-08-06 Jiaoda Li , Yifan Hou , Mrinmaya Sachan , Ryan Cotterell

SimLM: Can Language Models Infer Parameters of Physical Systems?

Several machine learning methods aim to learn or reason about complex physical systems. A common first-step towards reasoning is to infer system parameters from observations of its behavior. In this paper, we investigate the performance of…

Computation and Language · Computer Science 2024-02-07 Sean Memery , Mirella Lapata , Kartic Subr