Related papers: In-Context Function Learning in Large Language Mod…

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning

In recent years, pre-trained large language models (LLMs) have demonstrated remarkable efficiency in achieving an inference-time few-shot learning capability known as in-context learning. However, existing literature has highlighted the…

Computation and Language · Computer Science 2024-02-14 Xinyi Wang , Wanrong Zhu , Michael Saxon , Mark Steyvers , William Yang Wang

Learning vs Retrieval: The Role of In-Context Examples in Regression with Large Language Models

Generative Large Language Models (LLMs) are capable of being in-context learners. However, the underlying mechanism of in-context learning (ICL) is still a major research question, and experimental research results about how models exploit…

Computation and Language · Computer Science 2025-02-11 Aliakbar Nafar , Kristen Brent Venable , Parisa Kordjamshidi

An Explanation of In-context Learning as Implicit Bayesian Inference

Large language models (LMs) such as GPT-3 have the surprising ability to do in-context learning, where the model learns to do a downstream task simply by conditioning on a prompt consisting of input-output examples. The LM learns from these…

Computation and Language · Computer Science 2022-07-22 Sang Michael Xie , Aditi Raghunathan , Percy Liang , Tengyu Ma

Large Language Models Can Take False First Steps at Inference-time Planning

Large language models (LLMs) have been shown to acquire sequence-level planning abilities during training, yet their planning behavior exhibited at inference time often appears short-sighted and inconsistent with these capabilities. We…

Artificial Intelligence · Computer Science 2026-02-04 Haijiang Yan , Jian-Qiao Zhu , Adam Sanborn

Understanding Emergent In-Context Learning from a Kernel Regression Perspective

Large language models (LLMs) have initiated a paradigm shift in transfer learning. In contrast to the classic pretraining-then-finetuning procedure, in order to use LLMs for downstream prediction tasks, one only needs to provide a few…

Computation and Language · Computer Science 2025-09-16 Chi Han , Ziqi Wang , Han Zhao , Heng Ji

A Theory of Emergent In-Context Learning as Implicit Structure Induction

Scaling large language models (LLMs) leads to an emergent capacity to learn in-context from example demonstrations. Despite progress, theoretical understanding of this phenomenon remains limited. We argue that in-context learning relies on…

Computation and Language · Computer Science 2023-03-15 Michael Hahn , Navin Goyal

In-Context Learning Dynamics with Random Binary Sequences

Large language models (LLMs) trained on huge corpora of text datasets demonstrate intriguing capabilities, achieving state-of-the-art performance on tasks they were not explicitly trained for. The precise nature of LLM capabilities is often…

Artificial Intelligence · Computer Science 2024-04-17 Eric J. Bigelow , Ekdeep Singh Lubana , Robert P. Dick , Hidenori Tanaka , Tomer D. Ullman

Probing the Decision Boundaries of In-context Learning in Large Language Models

In-context learning is a key paradigm in large language models (LLMs) that enables them to generalize to new tasks and domains by simply prompting these models with a few exemplars without explicit parameter updates. Many attempts have been…

Machine Learning · Computer Science 2024-12-11 Siyan Zhao , Tung Nguyen , Aditya Grover

Can a Large Language Model Learn Matrix Functions In Context?

Large Language Models (LLMs) have demonstrated the ability to solve complex tasks through In-Context Learning (ICL), where models learn from a few input-output pairs without explicit fine-tuning. In this paper, we explore the capacity of…

Machine Learning · Computer Science 2024-11-26 Paimon Goulart , Evangelos E. Papalexakis

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

Large language models (LMs) are able to in-context learn -- perform a new task via inference alone by conditioning on a few input-label pairs (demonstrations) and making predictions for new inputs. However, there has been little…

Computation and Language · Computer Science 2022-10-21 Sewon Min , Xinxi Lyu , Ari Holtzman , Mikel Artetxe , Mike Lewis , Hannaneh Hajishirzi , Luke Zettlemoyer

The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design

Pretraining Neural Language Models (NLMs) over a large corpus involves chunking the text into training examples, which are contiguous text segments of sizes processable by the neural architecture. We highlight a bias introduced by this…

Computation and Language · Computer Science 2022-03-22 Yoav Levine , Noam Wies , Daniel Jannai , Dan Navon , Yedid Hoshen , Amnon Shashua

Large Language Models are Biased Reinforcement Learners

In-context learning enables large language models (LLMs) to perform a variety of tasks, including learning to make reward-maximizing choices in simple bandit tasks. Given their potential use as (autonomous) decision-making agents, it is…

Computation and Language · Computer Science 2024-05-21 William M. Hayes , Nicolas Yax , Stefano Palminteri

What Can Transformers Learn In-Context? A Case Study of Simple Function Classes

In-context learning refers to the ability of a model to condition on a prompt sequence consisting of in-context examples (input-output pairs corresponding to some task) along with a new query input, and generate the corresponding output.…

Computation and Language · Computer Science 2023-08-15 Shivam Garg , Dimitris Tsipras , Percy Liang , Gregory Valiant

Provable Low-Frequency Bias of In-Context Learning of Representations

In-context learning (ICL) enables large language models (LLMs) to acquire new behaviors from the input sequence alone without any parameter updates. Recent studies have shown that ICL can surpass the original meaning learned in pretraining…

Machine Learning · Computer Science 2025-07-31 Yongyi Yang , Hidenori Tanaka , Wei Hu

The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities

Large Language Models (LLMs), trained on extensive web-scale corpora, have demonstrated remarkable abilities across diverse tasks, especially as they are scaled up. Nevertheless, even state-of-the-art models struggle in certain cases,…

Computation and Language · Computer Science 2025-01-16 Irina Bigoulaeva , Harish Tayyar Madabushi , Iryna Gurevych

Context Collapse: In-Context Learning and Model Collapse

This thesis investigates two key phenomena in large language models (LLMs): in-context learning (ICL) and model collapse. We study ICL in a linear transformer with tied weights trained on linear regression tasks, and show that minimising…

Artificial Intelligence · Computer Science 2026-01-06 Josef Ott

Investigating the Learning Behaviour of In-context Learning: A Comparison with Supervised Learning

Large language models (LLMs) have shown remarkable capacity for in-context learning (ICL), where learning a new task from just a few training examples is done without being explicitly pre-trained. However, despite the success of LLMs, there…

Computation and Language · Computer Science 2023-08-02 Xindi Wang , Yufei Wang , Can Xu , Xiubo Geng , Bowen Zhang , Chongyang Tao , Frank Rudzicz , Robert E. Mercer , Daxin Jiang

Large Language Models Know What Makes Exemplary Contexts

In-context learning (ICL) has proven to be a significant capability with the advancement of Large Language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without…

Computation and Language · Computer Science 2024-08-21 Quanyu Long , Jianda Chen , Wenya Wang , Sinno Jialin Pan

Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations

In-context learning (ICL) is an important paradigm for adapting large language models (LLMs) to new tasks, but the generalization behavior of ICL remains poorly understood. We investigate the inductive biases of ICL from the perspective of…

Computation and Language · Computer Science 2023-05-23 Chenglei Si , Dan Friedman , Nitish Joshi , Shi Feng , Danqi Chen , He He

Does In-Context Learning Really Learn? Rethinking How Large Language Models Respond and Solve Tasks via In-Context Learning

In-context Learning (ICL) has emerged as a powerful capability alongside the development of scaled-up large language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks…

Computation and Language · Computer Science 2024-07-24 Quanyu Long , Yin Wu , Wenya Wang , Sinno Jialin Pan