English
Related papers

Related papers: Parallel Structures in Pre-training Data Yield In-…

200 papers

Large language models (LLMs) like transformers demonstrate impressive in-context learning (ICL) capabilities, allowing them to make predictions for new tasks based on prompt exemplars without parameter updates. While existing ICL theories…

Machine Learning · Computer Science 2024-11-12 Kevin Christian Wibisono , Yixin Wang

Large language models (LLMs) exhibit an intriguing ability to learn a novel task from in-context examples presented in a demonstration, termed in-context learning (ICL). Understandably, a swath of research has been dedicated to uncovering…

Computation and Language · Computer Science 2024-08-06 Jiaoda Li , Yifan Hou , Mrinmaya Sachan , Ryan Cotterell

In-context learning (ICL) improves language models' performance on a variety of NLP tasks by simply demonstrating a handful of examples at inference time. It is not well understood why ICL ability emerges, as the model has never been…

Computation and Language · Computer Science 2023-06-28 Xiaochuang Han , Daniel Simig , Todor Mihaylov , Yulia Tsvetkov , Asli Celikyilmaz , Tianlu Wang

The predictions of Large Language Models (LLMs) on downstream tasks often improve significantly when including examples of the input--label relationship in the context. However, there is currently no consensus about how this in-context…

Computation and Language · Computer Science 2024-03-14 Jannik Kossen , Yarin Gal , Tom Rainforth

Language models have the ability to perform in-context learning (ICL), allowing them to flexibly adapt their behavior based on context. This contrasts with in-weights learning (IWL), where memorized information is encoded in model…

Computation and Language · Computer Science 2025-03-04 Suraj Anand , Michael A. Lepori , Jack Merullo , Ellie Pavlick

This thesis investigates two key phenomena in large language models (LLMs): in-context learning (ICL) and model collapse. We study ICL in a linear transformer with tied weights trained on linear regression tasks, and show that minimising…

Artificial Intelligence · Computer Science 2026-01-06 Josef Ott

In-context Learning (ICL) has emerged as a powerful paradigm for performing natural language tasks with Large Language Models (LLM) without updating the models' parameters, in contrast to the traditional gradient-based finetuning. The…

Computation and Language · Computer Science 2025-08-11 Georgios Chochlakis , Alexandros Potamianos , Kristina Lerman , Shrikanth Narayanan

Large language models (LLMs) have initiated a paradigm shift in transfer learning. In contrast to the classic pretraining-then-finetuning procedure, in order to use LLMs for downstream prediction tasks, one only needs to provide a few…

Computation and Language · Computer Science 2025-09-16 Chi Han , Ziqi Wang , Han Zhao , Heng Ji

Pre-trained large language models have demonstrated a strong ability to learn from context, known as in-context learning (ICL). Despite a surge of recent applications that leverage such capabilities, it is by no means clear, at least…

Artificial Intelligence · Computer Science 2025-10-28 Bingqing Song , Jiaxiang Li , Rong Wang , Songtao Lu , Mingyi Hong

Large Language Models (LLMs) exhibit In-Context Learning (ICL), which enables the model to perform new tasks conditioning only on the examples provided in the context without updating the model's weights. While ICL offers fast adaptation…

The emergence of in-context learning (ICL) in large language models (LLMs) remains poorly understood despite its consistent effectiveness, enabling models to adapt to new tasks from only a handful of examples. To clarify and improve these…

Machine Learning · Computer Science 2025-10-02 Waïss Azizian , Ali Hasan

Large-scale neural language models exhibit a remarkable capacity for in-context learning (ICL): they can infer novel functions from datasets provided as input. Most of our current understanding of when and how ICL arises comes from LMs…

Computation and Language · Computer Science 2024-01-31 Ekin Akyürek , Bailin Wang , Yoon Kim , Jacob Andreas

In-context learning (ICL) has emerged as a powerful capability for large language models (LLMs) to adapt to downstream tasks by leveraging a few (demonstration) examples. Despite its effectiveness, the mechanism behind ICL remains…

Machine Learning · Computer Science 2025-06-03 Pengfei He , Yingqian Cui , Han Xu , Hui Liu , Makoto Yamada , Jiliang Tang , Yue Xing

In-Context Learning (ICL) is a technique by which language models make predictions based on examples provided in their input context. Previously, their context window size imposed a limit on the number of examples that can be shown, making…

Computation and Language · Computer Science 2025-05-29 Jinheon Baek , Sun Jae Lee , Prakhar Gupta , Geunseob Oh , Siddharth Dalmia , Prateek Kolhar

In-context learning (ICL) enables Large Language Models (LLMs) to learn tasks from demonstration examples without parameter updates. Although it has been extensively studied in LLMs, its effectiveness in Vision-Language Models (VLMs)…

Machine Learning · Computer Science 2025-10-29 Gabriel O. dos Santos , Esther Colombini , Sandra Avila

In-context learning (ICL) derives its power from enabling Large Language Models to adapt to new tasks via prompt-based reasoning alone, entirely bypassing the need for parameter updates. Existing theories primarily study ICL in single-task…

Machine Learning · Computer Science 2026-05-28 Guangyu Li , Meng Ding , Lijie Hu

In-context learning (ICL) is now a common method for teaching large language models (LLMs) new tasks: given labeled examples in the input context, the LLM learns to perform the task without weight updates. Do models guided via ICL infer the…

Computation and Language · Computer Science 2024-04-11 Aaron Mueller , Albert Webson , Jackson Petty , Tal Linzen

Large Language Models (LLMs) have been shown to organize the representations of input sequences into straighter neural trajectories in their deep layers, which has been hypothesized to facilitate next-token prediction via linear…

Computation and Language · Computer Science 2026-02-02 Eghbal A. Hosseini , Yuxuan Li , Yasaman Bahri , Declan Campbell , Andrew Kyle Lampinen

In-context Learning (ICL) has emerged as a powerful capability alongside the development of scaled-up large language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks…

Computation and Language · Computer Science 2024-07-24 Quanyu Long , Yin Wu , Wenya Wang , Sinno Jialin Pan

In-context learning (ICL), teaching a large language model (LLM) to perform a task with few-shot demonstrations rather than adjusting the model parameters, has emerged as a strong paradigm for using LLMs. While early studies primarily used…

Computation and Language · Computer Science 2023-05-24 Man Luo , Xin Xu , Zhuyun Dai , Panupong Pasupat , Mehran Kazemi , Chitta Baral , Vaiva Imbrasaite , Vincent Y Zhao
‹ Prev 1 2 3 10 Next ›