English
Related papers

Related papers: Context is Environment

200 papers

In-context learning is a key paradigm in large language models (LLMs) that enables them to generalize to new tasks and domains by simply prompting these models with a few exemplars without explicit parameter updates. Many attempts have been…

Machine Learning · Computer Science 2024-12-11 Siyan Zhao , Tung Nguyen , Aditya Grover

While Reinforcement Learning ( RL) has made great strides towards solving increasingly complicated problems, many algorithms are still brittle to even slight environmental changes. Contextual Reinforcement Learning (cRL) provides a…

In-context learning (ICL) is now a common method for teaching large language models (LLMs) new tasks: given labeled examples in the input context, the LLM learns to perform the task without weight updates. Do models guided via ICL infer the…

Computation and Language · Computer Science 2024-04-11 Aaron Mueller , Albert Webson , Jackson Petty , Tal Linzen

Generative Large Language Models (LLMs) are capable of being in-context learners. However, the underlying mechanism of in-context learning (ICL) is still a major research question, and experimental research results about how models exploit…

Computation and Language · Computer Science 2025-02-11 Aliakbar Nafar , Kristen Brent Venable , Parisa Kordjamshidi

Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of LLMs encompasses various…

Computation and Language · Computer Science 2024-02-02 Yilun Zhu , Joel Ruben Antony Moniz , Shruti Bhargava , Jiarui Lu , Dhivya Piraviperumal , Site Li , Yuan Zhang , Hong Yu , Bo-Hsiang Tseng

In-Context Learning (ICL) is a technique by which language models make predictions based on examples provided in their input context. Previously, their context window size imposed a limit on the number of examples that can be shown, making…

Computation and Language · Computer Science 2025-05-29 Jinheon Baek , Sun Jae Lee , Prakhar Gupta , Geunseob Oh , Siddharth Dalmia , Prateek Kolhar

In-context learning (ICL) has proven to be a significant capability with the advancement of Large Language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without…

Computation and Language · Computer Science 2024-08-21 Quanyu Long , Jianda Chen , Wenya Wang , Sinno Jialin Pan

Large language models (LLMs) trained on huge corpora of text datasets demonstrate intriguing capabilities, achieving state-of-the-art performance on tasks they were not explicitly trained for. The precise nature of LLM capabilities is often…

Artificial Intelligence · Computer Science 2024-04-17 Eric J. Bigelow , Ekdeep Singh Lubana , Robert P. Dick , Hidenori Tanaka , Tomer D. Ullman

The performance of Large Language Models (LLMs) is fundamentally determined by the contextual information provided during inference. This survey introduces Context Engineering, a formal discipline that transcends simple prompt design to…

Question-answering (QA) is a significant application of Large Language Models (LLMs), shaping chatbot capabilities across healthcare, education, and customer service. However, widespread LLM integration presents a challenge for small…

Computation and Language · Computer Science 2023-09-06 Md Adnan Arefeen , Biplob Debnath , Srimat Chakradhar

Most existing large language models (LLMs) are expensive to adapt after deployment, especially when a task requires newly produced information or niche domain knowledge. Recent work has shown that, by manipulating and optimizing their…

Computation and Language · Computer Science 2026-05-15 Zeyu Huang , Adhiguna Kuncoro , Qixuan Feng , Jiajun Shen , Lucio Dery , Arthur Szlam , Marc'Aurelio Ranzato

With the increasing capabilities of large language models (LLMs), in-context learning (ICL) has emerged as a new paradigm for natural language processing (NLP), where LLMs make predictions based on contexts augmented with a few examples. It…

Computation and Language · Computer Science 2024-10-08 Qingxiu Dong , Lei Li , Damai Dai , Ce Zheng , Jingyuan Ma , Rui Li , Heming Xia , Jingjing Xu , Zhiyong Wu , Tianyu Liu , Baobao Chang , Xu Sun , Lei Li , Zhifang Sui

Recent developments in text classification using Large Language Models (LLMs) in the social sciences suggest that costs can be cut significantly, while performance can sometimes rival existing computational methods. However, with a wide…

Computation and Language · Computer Science 2026-03-27 Erkan Gunes , Christoffer Florczak , Tevfik Murat Yildirim

In-context learning refers to the emerging ability of large language models (LLMs) to perform a target task without additional training, utilizing demonstrations of the task. Recent studies aim to enhance in-context learning performance by…

Computation and Language · Computer Science 2024-12-02 Junyong Kang , Donghyun Son , Hwanjun Song , Buru Chang

Large Language Models (LLMs) have been widely used as general-purpose AI agents showing comparable performance on many downstream tasks. However, existing work shows that it is challenging for LLMs to integrate structured data (e.g. KG,…

Computation and Language · Computer Science 2024-02-23 Younghun Lee , Sungchul Kim , Tong Yu , Ryan A. Rossi , Xiang Chen

Large language models (LLMs) excel in abstractive summarization tasks, delivering fluent and pertinent summaries. Recent advancements have extended their capabilities to handle long-input contexts, exceeding 100k tokens. However, in…

Computation and Language · Computer Science 2024-11-15 Mathieu Ravaut , Aixin Sun , Nancy F. Chen , Shafiq Joty

Contextual information at inference time, such as demonstrations, retrieved knowledge, or interaction history, can substantially improve large language models (LLMs) without parameter updates, yet its theoretical role remains poorly…

Computation and Language · Computer Science 2026-02-10 Dingzirui Wang , Xuanliang Zhang , Keyan Xu , Qingfu Zhu , Wanxiang Che , Yang Deng

We formalize a new concept for LLMs, context-enhanced learning. It involves standard gradient-based learning on text except that the context is enhanced with additional data on which no auto-regressive gradients are computed. This setting…

Machine Learning · Computer Science 2025-06-06 Xingyu Zhu , Abhishek Panigrahi , Sanjeev Arora

Large language models (LLMs) are increasingly strong contenders in machine translation. In this work, we focus on document-level translation, where some words cannot be translated without context from outside the sentence. Specifically, we…

Computation and Language · Computer Science 2025-02-17 Wafaa Mohammed , Vlad Niculae

In-context Learning (ICL) has emerged as a powerful capability alongside the development of scaled-up large language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks…

Computation and Language · Computer Science 2024-07-24 Quanyu Long , Yin Wu , Wenya Wang , Sinno Jialin Pan
‹ Prev 1 2 3 10 Next ›