English
Related papers

Related papers: Conditional Language Learning with Context

200 papers

Prompting Large Language Models (LLMs), or providing context on the expected model of operation, is an effective way to steer the outputs of such models to satisfy human desiderata after they have been trained. But in rapidly evolving…

Machine Learning · Computer Science 2025-08-08 Younwoo Choi , Muhammad Adil Asif , Ziwen Han , John Willes , Rahul G. Krishnan

Transformer-based large language models exhibit in-context learning, enabling adaptation to downstream tasks via few-shot prompting with demonstrations. In practice, such models are often fine-tuned to improve zero-shot performance on…

Computation and Language · Computer Science 2026-02-27 Chungpa Lee , Jy-yong Sohn , Kangwook Lee

Large language models have demonstrated strong capabilities to learn in-context, where exemplar input-output pairings are appended to the prompt for demonstration. However, existing work has demonstrated the ability of models to learn…

Computation and Language · Computer Science 2025-02-11 Stephanie Schoch , Yangfeng Ji

Language Models (LMs) can perform new tasks by adapting to a few in-context examples. For humans, explanations that connect examples to task principles can improve learning. We therefore investigate whether explanations of few-shot examples…

Language model fine-tuning is essential for modern natural language processing, but is computationally expensive and time-consuming. Further, the effectiveness of fine-tuning is limited by the inclusion of training examples that negatively…

Computation and Language · Computer Science 2022-05-23 Richard Antonello , Nicole Beckage , Javier Turek , Alexander Huth

The goal of meta-learning is to learn to adapt to a new task with only a few labeled examples. To tackle this problem in NLP, we propose $\textit{in-context tuning}$, which recasts adaptation and prediction as a simple sequence prediction…

Computation and Language · Computer Science 2022-04-13 Yanda Chen , Ruiqi Zhong , Sheng Zha , George Karypis , He He

In the growing domain of scientific machine learning, in-context operator learning has shown notable potential in building foundation models, as in this framework the model is trained to learn operators and solve differential equations…

Machine Learning · Computer Science 2024-02-02 Liu Yang , Siting Liu , Stanley J. Osher

Context-based fine-tuning methods, including prompting, in-context learning, soft prompting (also known as prompt tuning), and prefix-tuning, have gained popularity due to their ability to often match the performance of full fine-tuning…

Machine Learning · Computer Science 2024-04-10 Aleksandar Petrov , Philip H. S. Torr , Adel Bibi

To understand and infer meaning in language, neural models have to learn complicated nuances. Discovering distinctive linguistic phenomena from data is not an easy task. For instance, lexical ambiguity is a fundamental feature of language…

Computation and Language · Computer Science 2021-02-23 Marzieh Fadaee

In-context learning, where pre-trained language models learn to perform tasks from task examples and instructions in their contexts, has attracted much attention in the NLP community. However, the ability of in-context learning is not fully…

Computation and Language · Computer Science 2023-05-17 Yuxian Gu , Li Dong , Furu Wei , Minlie Huang

Transformer-based language models benefit from conditioning on contexts of hundreds to thousands of previous tokens. What aspects of these contexts contribute to accurate model prediction? We describe a series of experiments that measure…

Computation and Language · Computer Science 2021-06-17 Joe O'Connor , Jacob Andreas

The ability to acquire latent semantics is one of the key properties that determines the performance of language models. One convenient approach to invoke this ability is to prepend metadata (e.g. URLs, domains, and styles) at the beginning…

Recent work in neural machine translation has demonstrated both the necessity and feasibility of using inter-sentential context -- context from sentences other than those currently being translated. However, while many current methods…

Computation and Language · Computer Science 2021-06-03 Patrick Fernandes , Kayo Yin , Graham Neubig , André F. T. Martins

A growing body of literature has focused on detailing the linguistic knowledge embedded in large, pretrained language models. Existing work has shown that non-linguistic biases in models can drive model behavior away from linguistic…

Computation and Language · Computer Science 2021-06-03 Forrest Davis , Marten van Schijndel

Transformer-based language models excel at in-context learning (ICL), where they can adapt to new tasks based on contextual examples, without parameter updates. In a specific form of ICL, which we refer to as \textit{contextual recall},…

Machine Learning · Computer Science 2026-03-24 Bhavya Vasudeva , Puneesh Deora , Alberto Bietti , Vatsal Sharan , Christos Thrampoulidis

The ability of language models to learn a task from a few examples in context has generated substantial interest. Here, we provide a perspective that situates this type of supervised few-shot learning within a much broader spectrum of…

Computation and Language · Computer Science 2025-06-06 Andrew Kyle Lampinen , Stephanie C. Y. Chan , Aaditya K. Singh , Murray Shanahan

We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when…

Machine Learning · Statistics 2017-04-19 Sebastien Jean , Stanislas Lauly , Orhan Firat , Kyunghyun Cho

We introduce Context Tuning, a simple and effective method to significantly enhance few-shot adaptation of language models (LLMs) without fine-tuning model parameters. While prompt-based adaptation techniques have demonstrated the…

Computation and Language · Computer Science 2025-11-04 Jack Lu , Ryan Teehan , Zhenbang Yang , Mengye Ren

Large language models (LLMs) trained on huge corpora of text datasets demonstrate intriguing capabilities, achieving state-of-the-art performance on tasks they were not explicitly trained for. The precise nature of LLM capabilities is often…

Artificial Intelligence · Computer Science 2024-04-17 Eric J. Bigelow , Ekdeep Singh Lubana , Robert P. Dick , Hidenori Tanaka , Tomer D. Ullman

Large language models exhibit a remarkable capacity for in-context learning, where they learn to solve tasks given a few examples. Recent work has shown that transformers can be trained to perform simple regression tasks in-context. This…

Machine Learning · Computer Science 2026-04-03 Hrayr Harutyunyan , Rafayel Darbinyan , Samvel Karapetyan , Hrant Khachatrian
‹ Prev 1 2 3 10 Next ›