Related papers: Conditional Language Learning with Context

Teaching LLMs How to Learn with Contextual Fine-Tuning

Prompting Large Language Models (LLMs), or providing context on the expected model of operation, is an effective way to steer the outputs of such models to satisfy human desiderata after they have been trained. But in rapidly evolving…

Machine Learning · Computer Science 2025-08-08 Younwoo Choi , Muhammad Adil Asif , Ziwen Han , John Willes , Rahul G. Krishnan

Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models

Transformer-based large language models exhibit in-context learning, enabling adaptation to downstream tasks via few-shot prompting with demonstrations. In practice, such models are often fine-tuned to improve zero-shot performance on…

Computation and Language · Computer Science 2026-02-27 Chungpa Lee , Jy-yong Sohn , Kangwook Lee

In-Context Learning (and Unlearning) of Length Biases

Large language models have demonstrated strong capabilities to learn in-context, where exemplar input-output pairings are appended to the prompt for demonstration. However, existing work has demonstrated the ability of models to learn…

Computation and Language · Computer Science 2025-02-11 Stephanie Schoch , Yangfeng Ji

Can language models learn from explanations in context?

Language Models (LMs) can perform new tasks by adapting to a few in-context examples. For humans, explanations that connect examples to task principles can improve learning. We therefore investigate whether explanations of few-shot examples…

Computation and Language · Computer Science 2022-10-11 Andrew K. Lampinen , Ishita Dasgupta , Stephanie C. Y. Chan , Kory Matthewson , Michael Henry Tessler , Antonia Creswell , James L. McClelland , Jane X. Wang , Felix Hill

Selecting Informative Contexts Improves Language Model Finetuning

Language model fine-tuning is essential for modern natural language processing, but is computationally expensive and time-consuming. Further, the effectiveness of fine-tuning is limited by the inclusion of training examples that negatively…

Computation and Language · Computer Science 2022-05-23 Richard Antonello , Nicole Beckage , Javier Turek , Alexander Huth

Meta-learning via Language Model In-context Tuning

The goal of meta-learning is to learn to adapt to a new task with only a few labeled examples. To tackle this problem in NLP, we propose $\textit{in-context tuning}$, which recasts adaptation and prediction as a simple sequence prediction…

Computation and Language · Computer Science 2022-04-13 Yanda Chen , Ruiqi Zhong , Sheng Zha , George Karypis , He He

Fine-Tune Language Models as Multi-Modal Differential Equation Solvers

In the growing domain of scientific machine learning, in-context operator learning has shown notable potential in building foundation models, as in this framework the model is trained to learn operators and solve differential equations…

Machine Learning · Computer Science 2024-02-02 Liu Yang , Siting Liu , Stanley J. Osher

When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations

Context-based fine-tuning methods, including prompting, in-context learning, soft prompting (also known as prompt tuning), and prefix-tuning, have gained popularity due to their ability to often match the performance of full fine-tuning…

Machine Learning · Computer Science 2024-04-10 Aleksandar Petrov , Philip H. S. Torr , Adel Bibi

Understanding and Enhancing the Use of Context for Machine Translation

To understand and infer meaning in language, neural models have to learn complicated nuances. Discovering distinctive linguistic phenomena from data is not an easy task. For instance, lexical ambiguity is a fundamental feature of language…

Computation and Language · Computer Science 2021-02-23 Marzieh Fadaee

Pre-Training to Learn in Context

In-context learning, where pre-trained language models learn to perform tasks from task examples and instructions in their contexts, has attracted much attention in the NLP community. However, the ability of in-context learning is not fully…

Computation and Language · Computer Science 2023-05-17 Yuxian Gu , Li Dong , Furu Wei , Minlie Huang

What Context Features Can Transformer Language Models Use?

Transformer-based language models benefit from conditioning on contexts of hundreds to thousands of previous tokens. What aspects of these contexts contribute to accurate model prediction? We describe a series of experiments that measure…

Computation and Language · Computer Science 2021-06-17 Joe O'Connor , Jacob Andreas

When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

The ability to acquire latent semantics is one of the key properties that determines the performance of language models. One convenient approach to invoke this ability is to prepend metadata (e.g. URLs, domains, and styles) at the beginning…

Computation and Language · Computer Science 2025-07-29 Rei Higuchi , Ryotaro Kawata , Naoki Nishikawa , Kazusato Oko , Shoichiro Yamaguchi , Sosuke Kobayashi , Seiya Tokui , Kohei Hayashi , Daisuke Okanohara , Taiji Suzuki

Measuring and Increasing Context Usage in Context-Aware Machine Translation

Recent work in neural machine translation has demonstrated both the necessity and feasibility of using inter-sentential context -- context from sentences other than those currently being translated. However, while many current methods…

Computation and Language · Computer Science 2021-06-03 Patrick Fernandes , Kayo Yin , Graham Neubig , André F. T. Martins

Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning

A growing body of literature has focused on detailing the linguistic knowledge embedded in large, pretrained language models. Existing work has shown that non-linguistic biases in models can drive model behavior away from linguistic…

Computation and Language · Computer Science 2021-06-03 Forrest Davis , Marten van Schijndel

Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Transformer-based language models excel at in-context learning (ICL), where they can adapt to new tasks based on contextual examples, without parameter updates. In a specific form of ICL, which we refer to as \textit{contextual recall},…

Machine Learning · Computer Science 2026-03-24 Bhavya Vasudeva , Puneesh Deora , Alberto Bietti , Vatsal Sharan , Christos Thrampoulidis

The broader spectrum of in-context learning

The ability of language models to learn a task from a few examples in context has generated substantial interest. Here, we provide a perspective that situates this type of supervised few-shot learning within a much broader spectrum of…

Computation and Language · Computer Science 2025-06-06 Andrew Kyle Lampinen , Stephanie C. Y. Chan , Aaditya K. Singh , Murray Shanahan

Does Neural Machine Translation Benefit from Larger Context?

We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when…

Machine Learning · Statistics 2017-04-19 Sebastien Jean , Stanislas Lauly , Orhan Firat , Kyunghyun Cho

Context Tuning for In-Context Optimization

We introduce Context Tuning, a simple and effective method to significantly enhance few-shot adaptation of language models (LLMs) without fine-tuning model parameters. While prompt-based adaptation techniques have demonstrated the…

Computation and Language · Computer Science 2025-11-04 Jack Lu , Ryan Teehan , Zhenbang Yang , Mengye Ren

In-Context Learning Dynamics with Random Binary Sequences

Large language models (LLMs) trained on huge corpora of text datasets demonstrate intriguing capabilities, achieving state-of-the-art performance on tasks they were not explicitly trained for. The precise nature of LLM capabilities is often…

Artificial Intelligence · Computer Science 2024-04-17 Eric J. Bigelow , Ekdeep Singh Lubana , Robert P. Dick , Hidenori Tanaka , Tomer D. Ullman

In-context Learning in Presence of Spurious Correlations

Large language models exhibit a remarkable capacity for in-context learning, where they learn to solve tasks given a few examples. Recent work has shown that transformers can be trained to perform simple regression tasks in-context. This…

Machine Learning · Computer Science 2026-04-03 Hrayr Harutyunyan , Rafayel Darbinyan , Samvel Karapetyan , Hrant Khachatrian