Related papers: Parallel Structures in Pre-training Data Yield In-…

From Unstructured Data to In-Context Learning: Exploring What Tasks Can Be Learned and When

Large language models (LLMs) like transformers demonstrate impressive in-context learning (ICL) capabilities, allowing them to make predictions for new tasks based on prompt exemplars without parameter updates. While existing ICL theories…

Machine Learning · Computer Science 2024-11-12 Kevin Christian Wibisono , Yixin Wang

What Do Language Models Learn in Context? The Structured Task Hypothesis

Large language models (LLMs) exhibit an intriguing ability to learn a novel task from in-context examples presented in a demonstration, termed in-context learning (ICL). Understandably, a swath of research has been dedicated to uncovering…

Computation and Language · Computer Science 2024-08-06 Jiaoda Li , Yifan Hou , Mrinmaya Sachan , Ryan Cotterell

Understanding In-Context Learning via Supportive Pretraining Data

In-context learning (ICL) improves language models' performance on a variety of NLP tasks by simply demonstrating a handful of examples at inference time. It is not well understood why ICL ability emerges, as the model has never been…

Computation and Language · Computer Science 2023-06-28 Xiaochuang Han , Daniel Simig , Todor Mihaylov , Yulia Tsvetkov , Asli Celikyilmaz , Tianlu Wang

In-Context Learning Learns Label Relationships but Is Not Conventional Learning

The predictions of Large Language Models (LLMs) on downstream tasks often improve significantly when including examples of the input--label relationship in the context. However, there is currently no consensus about how this in-context…

Computation and Language · Computer Science 2024-03-14 Jannik Kossen , Yarin Gal , Tom Rainforth

Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting

Language models have the ability to perform in-context learning (ICL), allowing them to flexibly adapt their behavior based on context. This contrasts with in-weights learning (IWL), where memorized information is encoded in model…

Computation and Language · Computer Science 2025-03-04 Suraj Anand , Michael A. Lepori , Jack Merullo , Ellie Pavlick

Context Collapse: In-Context Learning and Model Collapse

This thesis investigates two key phenomena in large language models (LLMs): in-context learning (ICL) and model collapse. We study ICL in a linear transformer with tied weights trained on linear regression tasks, and show that minimising…

Artificial Intelligence · Computer Science 2026-01-06 Josef Ott

The Strong Pull of Prior Knowledge in Large Language Models and Its Impact on Emotion Recognition

In-context Learning (ICL) has emerged as a powerful paradigm for performing natural language tasks with Large Language Models (LLM) without updating the models' parameters, in contrast to the traditional gradient-based finetuning. The…

Computation and Language · Computer Science 2025-08-11 Georgios Chochlakis , Alexandros Potamianos , Kristina Lerman , Shrikanth Narayanan

Understanding Emergent In-Context Learning from a Kernel Regression Perspective

Large language models (LLMs) have initiated a paradigm shift in transfer learning. In contrast to the classic pretraining-then-finetuning procedure, in order to use LLMs for downstream prediction tasks, one only needs to provide a few…

Computation and Language · Computer Science 2025-09-16 Chi Han , Ziqi Wang , Han Zhao , Heng Ji

A Framework for Quantifying How Pre-Training and Context Benefit In-Context Learning

Pre-trained large language models have demonstrated a strong ability to learn from context, known as in-context learning (ICL). Despite a surge of recent applications that leverage such capabilities, it is by no means clear, at least…

Artificial Intelligence · Computer Science 2025-10-28 Bingqing Song , Jiaxiang Li , Rong Wang , Songtao Lu , Mingyi Hong

Unlocking In-Context Learning for Natural Datasets Beyond Language Modelling

Large Language Models (LLMs) exhibit In-Context Learning (ICL), which enables the model to perform new tasks conditioning only on the examples provided in the context without updating the model's weights. While ICL offers fast adaptation…

Computation and Language · Computer Science 2025-10-07 Jelena Bratulić , Sudhanshu Mittal , David T. Hoffmann , Samuel Böhm , Robin Tibor Schirrmeister , Tonio Ball , Christian Rupprecht , Thomas Brox

How Does the Pretraining Distribution Shape In-Context Learning? Task Selection, Generalization, and Robustness

The emergence of in-context learning (ICL) in large language models (LLMs) remains poorly understood despite its consistent effectiveness, enabling models to adapt to new tasks from only a handful of examples. To clarify and improve these…

Machine Learning · Computer Science 2025-10-02 Waïss Azizian , Ali Hasan

In-Context Language Learning: Architectures and Algorithms

Large-scale neural language models exhibit a remarkable capacity for in-context learning (ICL): they can infer novel functions from datasets provided as input. Most of our current understanding of when and how ICL arises comes from LMs…

Computation and Language · Computer Science 2024-01-31 Ekin Akyürek , Bailin Wang , Yoon Kim , Jacob Andreas

Towards the Effect of Examples on In-Context Learning: A Theoretical Case Study

In-context learning (ICL) has emerged as a powerful capability for large language models (LLMs) to adapt to downstream tasks by leveraging a few (demonstration) examples. Despite its effectiveness, the mechanism behind ICL remains…

Machine Learning · Computer Science 2025-06-03 Pengfei He , Yingqian Cui , Han Xu , Hui Liu , Makoto Yamada , Jiliang Tang , Yue Xing

Revisiting In-Context Learning with Long Context Language Models

In-Context Learning (ICL) is a technique by which language models make predictions based on examples provided in their input context. Previously, their context window size imposed a limit on the number of examples that can be shown, making…

Computation and Language · Computer Science 2025-05-29 Jinheon Baek , Sun Jae Lee , Prakhar Gupta , Geunseob Oh , Siddharth Dalmia , Prateek Kolhar

What do vision-language models see in the context? Investigating multimodal in-context learning

In-context learning (ICL) enables Large Language Models (LLMs) to learn tasks from demonstration examples without parameter updates. Although it has been extensively studied in LLMs, its effectiveness in Vision-Language Models (VLMs)…

Machine Learning · Computer Science 2025-10-29 Gabriel O. dos Santos , Esther Colombini , Sandra Avila

Understanding Generalization and Forgetting in In-Context Continual Learning

In-context learning (ICL) derives its power from enabling Large Language Models to adapt to new tasks via prompt-based reasoning alone, entirely bypassing the need for parameter updates. Existing theories primarily study ICL in single-task…

Machine Learning · Computer Science 2026-05-28 Guangyu Li , Meng Ding , Lijie Hu

In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax

In-context learning (ICL) is now a common method for teaching large language models (LLMs) new tasks: given labeled examples in the input context, the LLM learns to perform the task without weight updates. Do models guided via ICL infer the…

Computation and Language · Computer Science 2024-04-11 Aaron Mueller , Albert Webson , Jackson Petty , Tal Linzen

Context Structure Reshapes the Representational Geometry of Language Models

Large Language Models (LLMs) have been shown to organize the representations of input sequences into straighter neural trajectories in their deep layers, which has been hypothesized to facilitate next-token prediction via linear…

Computation and Language · Computer Science 2026-02-02 Eghbal A. Hosseini , Yuxuan Li , Yasaman Bahri , Declan Campbell , Andrew Kyle Lampinen

Does In-Context Learning Really Learn? Rethinking How Large Language Models Respond and Solve Tasks via In-Context Learning

In-context Learning (ICL) has emerged as a powerful capability alongside the development of scaled-up large language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks…

Computation and Language · Computer Science 2024-07-24 Quanyu Long , Yin Wu , Wenya Wang , Sinno Jialin Pan

Dr.ICL: Demonstration-Retrieved In-context Learning

In-context learning (ICL), teaching a large language model (LLM) to perform a task with few-shot demonstrations rather than adjusting the model parameters, has emerged as a strong paradigm for using LLMs. While early studies primarily used…

Computation and Language · Computer Science 2023-05-24 Man Luo , Xin Xu , Zhuyun Dai , Panupong Pasupat , Mehran Kazemi , Chitta Baral , Vaiva Imbrasaite , Vincent Y Zhao