Related papers: Learning to Search Effective Example Sequences for…

$Se^2$: Sequential Example Selection for In-Context Learning

The remarkable capability of large language models (LLMs) for in-context learning (ICL) needs to be activated by demonstration examples. Prior work has extensively explored the selection of examples for ICL, predominantly following the…

Computation and Language · Computer Science 2024-06-07 Haoyu Liu , Jianfeng Liu , Shaohan Huang , Yuefeng Zhan , Hao Sun , Weiwei Deng , Furu Wei , Qi Zhang

Large Language Models Know What Makes Exemplary Contexts

In-context learning (ICL) has proven to be a significant capability with the advancement of Large Language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without…

Computation and Language · Computer Science 2024-08-21 Quanyu Long , Jianda Chen , Wenya Wang , Sinno Jialin Pan

In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting

In-context learning (ICL) of large language models (LLMs) has attracted increasing attention in the community where LLMs make predictions only based on instructions augmented with a few examples. Existing example selection methods for ICL…

Computation and Language · Computer Science 2024-08-26 Haowei Du , Dongyan Zhao

Revisiting In-Context Learning with Long Context Language Models

In-Context Learning (ICL) is a technique by which language models make predictions based on examples provided in their input context. Previously, their context window size imposed a limit on the number of examples that can be shown, making…

Computation and Language · Computer Science 2025-05-29 Jinheon Baek , Sun Jae Lee , Prakhar Gupta , Geunseob Oh , Siddharth Dalmia , Prateek Kolhar

Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities

Large language models (LLMs) can adapt to new tasks through in-context learning (ICL) based on a few examples presented in dialogue history without any model parameter update. Despite such convenience, the performance of ICL heavily depends…

Computation and Language · Computer Science 2024-06-18 Siyin Wang , Chao-Han Huck Yang , Ji Wu , Chao Zhang

Sequence-to-Sequence Learning as Beam-Search Optimization

Sequence-to-Sequence (seq2seq) modeling has rapidly become an important general-purpose NLP tool that has proven effective for many text-generation and sequence-labeling tasks. Seq2seq builds on deep neural language modeling and inherits…

Computation and Language · Computer Science 2016-11-11 Sam Wiseman , Alexander M. Rush

A Learning Search Algorithm for the Restricted Longest Common Subsequence Problem

This paper addresses the Restricted Longest Common Subsequence (RLCS) problem, an extension of the well-known Longest Common Subsequence (LCS) problem. This problem has significant applications in bioinformatics, particularly for…

Artificial Intelligence · Computer Science 2024-10-17 Marko Djukanović , Jaume Reixach , Ana Nikolikj , Tome Eftimov , Aleksandar Kartelj , Christian Blum

Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning

Large Language Models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities from few-shot demonstration exemplars. While recent learning-based demonstration selection methods have proven beneficial to ICL by choosing…

Machine Learning · Computer Science 2024-10-16 Hui Liu , Wenya Wang , Hao Sun , Chris Xing Tian , Chenqi Kong , Xin Dong , Haoliang Li

Context-Enhanced Contrastive Search for Improved LLM Text Generation

Recently, Large Language Models (LLMs) have demonstrated remarkable advancements in Natural Language Processing (NLP). However, generating high-quality text that balances coherence, diversity, and relevance remains challenging. Traditional…

Computation and Language · Computer Science 2025-05-01 Jaydip Sen , Rohit Pandey , Hetvi Waghela

Coverage-based Example Selection for In-Context Learning

In-context learning (ICL), the ability of large language models to perform novel tasks by conditioning on a prompt with a few task examples, requires these examples to be informative about the test instance. The standard approach of…

Computation and Language · Computer Science 2023-11-08 Shivanshu Gupta , Matt Gardner , Sameer Singh

Learning to Retrieve In-Context Examples for Large Language Models

Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the…

Computation and Language · Computer Science 2024-01-29 Liang Wang , Nan Yang , Furu Wei

Complementary Explanations for Effective In-Context Learning

Large language models (LLMs) have exhibited remarkable capabilities in learning from explanations in prompts, but there has been limited understanding of exactly how these explanations function or why they are effective. This work aims to…

Computation and Language · Computer Science 2023-06-14 Xi Ye , Srinivasan Iyer , Asli Celikyilmaz , Ves Stoyanov , Greg Durrett , Ramakanth Pasunuru

RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning

Recent developments in large pre-trained language models have enabled unprecedented performance on a variety of downstream tasks. Achieving best performance with these models often leverages in-context learning, where a model performs a…

Computation and Language · Computer Science 2024-04-17 Alexander Scarlatos , Andrew Lan

Finding Support Examples for In-Context Learning

Additionally, the strong dependency among in-context examples makes it an NP-hard combinatorial optimization problem and enumerating all permutations is infeasible. Hence we propose LENS, a fiLter-thEN-Search method to tackle this challenge…

Computation and Language · Computer Science 2023-10-10 Xiaonan Li , Xipeng Qiu

Lost in Sequence: Do Large Language Models Understand Sequential Recommendation?

Large Language Models (LLMs) have recently emerged as promising tools for recommendation thanks to their advanced textual understanding ability and context-awareness. Despite the current practice of training and evaluating LLM-based…

Information Retrieval · Computer Science 2025-06-12 Sein Kim , Hongseok Kang , Kibum Kim , Jiwan Kim , Donghyun Kim , Minchul Yang , Kwangjin Oh , Julian McAuley , Chanyoung Park

Learning vs Retrieval: The Role of In-Context Examples in Regression with Large Language Models

Generative Large Language Models (LLMs) are capable of being in-context learners. However, the underlying mechanism of in-context learning (ICL) is still a major research question, and experimental research results about how models exploit…

Computation and Language · Computer Science 2025-02-11 Aliakbar Nafar , Kristen Brent Venable , Parisa Kordjamshidi

Efficient Sequence Training of Attention Models using Approximative Recombination

Sequence discriminative training is a great tool to improve the performance of an automatic speech recognition system. It does, however, necessitate a sum over all possible word sequences, which is intractable to compute in practice.…

Computation and Language · Computer Science 2022-04-22 Nils-Philipp Wynands , Wilfried Michel , Jan Rosendahl , Ralf Schlüter , Hermann Ney

Learning To Retrieve Prompts for In-Context Learning

In-context learning is a recent paradigm in natural language understanding, where a large pre-trained language model (LM) observes a test instance and a few training examples as its input, and directly decodes the output without any update…

Computation and Language · Computer Science 2022-05-10 Ohad Rubin , Jonathan Herzig , Jonathan Berant

GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks

In-context Learning (ICL) is the ability of Large Language Models (LLMs) to perform new tasks when conditioned on prompts comprising a few task examples. However, ICL performance can be critically sensitive to the choice of examples. To…

Computation and Language · Computer Science 2024-02-23 Shivanshu Gupta , Clemens Rosenbaum , Ethan R. Elenberg

Improving In-Context Learning with Small Language Model Ensembles

Large language models (LLMs) have shown impressive capabilities across various tasks, but their performance on domain-specific tasks remains limited. While methods like retrieval augmented generation and fine-tuning can help to address…

Computation and Language · Computer Science 2024-12-23 M. Mehdi Mojarradi , Lingyi Yang , Robert McCraith , Adam Mahdi