English
Related papers

Related papers: Racing Thoughts: Explaining Contextualization Erro…

200 papers

Contextual information at inference time, such as demonstrations, retrieved knowledge, or interaction history, can substantially improve large language models (LLMs) without parameter updates, yet its theoretical role remains poorly…

Computation and Language · Computer Science 2026-02-10 Dingzirui Wang , Xuanliang Zhang , Keyan Xu , Qingfu Zhu , Wanxiang Che , Yang Deng

Transformer-based Large Language Models (LLMs) have demonstrated powerful in-context learning capabilities. However, their predictions can be disrupted by factually correct context, a phenomenon known as context hijacking, revealing a…

Computation and Language · Computer Science 2025-02-24 Tianle Li , Chenyang Zhang , Xingwu Chen , Yuan Cao , Difan Zou

The reasoning abilities of large language models (LLMs) are the topic of a growing body of research in AI and cognitive science. In this paper, we probe the extent to which twenty-nine LLMs are able to distinguish logically correct…

Computation and Language · Computer Science 2024-10-15 Wesley H. Holliday , Matthew Mandelkern , Cedegao E. Zhang

Over the past few years, the abilities of large language models (LLMs) have received extensive attention, which have performed exceptionally well in complicated scenarios such as logical reasoning and symbolic inference. A significant…

Computation and Language · Computer Science 2024-02-20 Junbing Yan , Chengyu Wang , Jun Huang , Wei Zhang

Large language models (LLMs) are increasingly used to support the analysis of complex financial disclosures, yet their reliability, behavioral consistency, and transparency remain insufficiently understood in high-stakes settings. This…

Computation and Language · Computer Science 2026-01-21 Md Talha Mohsin

Transformer models are now a cornerstone in natural language processing. Yet, explaining their decisions remains a challenge. It was shown recently that the same model trained on the same data with a different randomness can lead to very…

Computation and Language · Computer Science 2026-03-10 Romain Loncour , Jérémie Bogaert , François-Xavier Standaert

Prior research has explored the ability of computational models to predict a word semantic fit with a given predicate. While much work has been devoted to modeling the typicality relation between verbs and arguments in isolation, in this…

Computation and Language · Computer Science 2021-07-26 Paolo Pedinotti , Giulia Rambelli , Emmanuele Chersoni , Enrico Santus , Alessandro Lenci , Philippe Blache

Large Language Models (LLMs) exhibit positional bias, struggling to utilize information from the middle or end of long contexts. Our study explores LLMs' long-context reasoning by probing their hidden representations. We find that while…

Computation and Language · Computer Science 2024-10-08 Taiming Lu , Muhan Gao , Kuai Yu , Adam Byerly , Daniel Khashabi

Large Language Models (LLMs) present a promising frontier in robotic task planning by leveraging extensive human knowledge. Nevertheless, the current literature often overlooks the critical aspects of robots' adaptability and error…

Robotics · Computer Science 2024-11-27 Sthithpragya Gupta , Kunpeng Yao , Loïc Niederhauser , Aude Billard

In recent years, pre-trained large language models (LLMs) have demonstrated remarkable efficiency in achieving an inference-time few-shot learning capability known as in-context learning. However, existing literature has highlighted the…

Computation and Language · Computer Science 2024-02-14 Xinyi Wang , Wanrong Zhu , Michael Saxon , Mark Steyvers , William Yang Wang

Going beyond mimicking limited human experiences, recent studies show initial evidence that, like humans, large language models (LLMs) are capable of improving their abilities purely by self-correction, i.e., correcting previous responses…

Machine Learning · Computer Science 2024-11-19 Yifei Wang , Yuyang Wu , Zeming Wei , Stefanie Jegelka , Yisen Wang

Large Language Models (LLMs) have demonstrated impressive capabilities in reasoning and prediction across different domains. Yet, their ability to infer temporal regularities from structured behavioral data remains underexplored. This paper…

Central to many self-improvement pipelines for large language models (LLMs) is the assumption that models can improve by reflecting on past mistakes. We study a phenomenon termed contextual drag: the presence of failed attempts in the…

Computation and Language · Computer Science 2026-03-04 Yun Cheng , Xingyu Zhu , Haoyu Zhao , Sanjeev Arora

This study identifies the specific conditions under which large language models exhibit human-like gambling addiction patterns, providing critical insights into their decision-making mechanisms and AI safety. We analyze LLM decision-making…

Artificial Intelligence · Computer Science 2025-12-22 Seungpil Lee , Donghyeon Shin , Yunjeong Lee , Sundong Kim

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various…

Computation and Language · Computer Science 2024-02-02 Yilun Zhu , Joel Ruben Antony Moniz , Shruti Bhargava , Jiarui Lu , Dhivya Piraviperumal , Site Li , Yuan Zhang , Hong Yu , Bo-Hsiang Tseng

Human processing of idioms relies on understanding the contextual sentences in which idioms occur, as well as language-intrinsic features such as frequency and speaker-intrinsic factors like familiarity. While LLMs have shown high…

Computation and Language · Computer Science 2025-07-17 Maggie Mi , Aline Villavicencio , Nafise Sadat Moosavi

Conditional acceptability refers to how plausible a conditional statement is perceived to be. It plays an important role in communication and reasoning, as it influences how individuals interpret implications, assess arguments, and make…

Computation and Language · Computer Science 2026-03-20 Jasmin Orth , Philipp Mondorf , Barbara Plank

This thesis investigates two key phenomena in large language models (LLMs): in-context learning (ICL) and model collapse. We study ICL in a linear transformer with tied weights trained on linear regression tasks, and show that minimising…

Artificial Intelligence · Computer Science 2026-01-06 Josef Ott

Large language models (LLMs) exhibit an intriguing ability to learn a novel task from in-context examples presented in a demonstration, termed in-context learning (ICL). Understandably, a swath of research has been dedicated to uncovering…

Computation and Language · Computer Science 2024-08-06 Jiaoda Li , Yifan Hou , Mrinmaya Sachan , Ryan Cotterell

Several machine learning methods aim to learn or reason about complex physical systems. A common first-step towards reasoning is to infer system parameters from observations of its behavior. In this paper, we investigate the performance of…

Computation and Language · Computer Science 2024-02-07 Sean Memery , Mirella Lapata , Kartic Subr
‹ Prev 1 2 3 10 Next ›