English
Related papers

Related papers: Learning to Search Effective Example Sequences for…

200 papers

The remarkable capability of large language models (LLMs) for in-context learning (ICL) needs to be activated by demonstration examples. Prior work has extensively explored the selection of examples for ICL, predominantly following the…

Computation and Language · Computer Science 2024-06-07 Haoyu Liu , Jianfeng Liu , Shaohan Huang , Yuefeng Zhan , Hao Sun , Weiwei Deng , Furu Wei , Qi Zhang

In-context learning (ICL) has proven to be a significant capability with the advancement of Large Language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without…

Computation and Language · Computer Science 2024-08-21 Quanyu Long , Jianda Chen , Wenya Wang , Sinno Jialin Pan

In-context learning (ICL) of large language models (LLMs) has attracted increasing attention in the community where LLMs make predictions only based on instructions augmented with a few examples. Existing example selection methods for ICL…

Computation and Language · Computer Science 2024-08-26 Haowei Du , Dongyan Zhao

In-Context Learning (ICL) is a technique by which language models make predictions based on examples provided in their input context. Previously, their context window size imposed a limit on the number of examples that can be shown, making…

Computation and Language · Computer Science 2025-05-29 Jinheon Baek , Sun Jae Lee , Prakhar Gupta , Geunseob Oh , Siddharth Dalmia , Prateek Kolhar

Large language models (LLMs) can adapt to new tasks through in-context learning (ICL) based on a few examples presented in dialogue history without any model parameter update. Despite such convenience, the performance of ICL heavily depends…

Computation and Language · Computer Science 2024-06-18 Siyin Wang , Chao-Han Huck Yang , Ji Wu , Chao Zhang

Sequence-to-Sequence (seq2seq) modeling has rapidly become an important general-purpose NLP tool that has proven effective for many text-generation and sequence-labeling tasks. Seq2seq builds on deep neural language modeling and inherits…

Computation and Language · Computer Science 2016-11-11 Sam Wiseman , Alexander M. Rush

This paper addresses the Restricted Longest Common Subsequence (RLCS) problem, an extension of the well-known Longest Common Subsequence (LCS) problem. This problem has significant applications in bioinformatics, particularly for…

Artificial Intelligence · Computer Science 2024-10-17 Marko Djukanović , Jaume Reixach , Ana Nikolikj , Tome Eftimov , Aleksandar Kartelj , Christian Blum

Large Language Models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities from few-shot demonstration exemplars. While recent learning-based demonstration selection methods have proven beneficial to ICL by choosing…

Machine Learning · Computer Science 2024-10-16 Hui Liu , Wenya Wang , Hao Sun , Chris Xing Tian , Chenqi Kong , Xin Dong , Haoliang Li

Recently, Large Language Models (LLMs) have demonstrated remarkable advancements in Natural Language Processing (NLP). However, generating high-quality text that balances coherence, diversity, and relevance remains challenging. Traditional…

Computation and Language · Computer Science 2025-05-01 Jaydip Sen , Rohit Pandey , Hetvi Waghela

In-context learning (ICL), the ability of large language models to perform novel tasks by conditioning on a prompt with a few task examples, requires these examples to be informative about the test instance. The standard approach of…

Computation and Language · Computer Science 2023-11-08 Shivanshu Gupta , Matt Gardner , Sameer Singh

Large language models (LLMs) have demonstrated their ability to learn in-context, allowing them to perform various tasks based on a few input-output examples. However, the effectiveness of in-context learning is heavily reliant on the…

Computation and Language · Computer Science 2024-01-29 Liang Wang , Nan Yang , Furu Wei

Large language models (LLMs) have exhibited remarkable capabilities in learning from explanations in prompts, but there has been limited understanding of exactly how these explanations function or why they are effective. This work aims to…

Computation and Language · Computer Science 2023-06-14 Xi Ye , Srinivasan Iyer , Asli Celikyilmaz , Ves Stoyanov , Greg Durrett , Ramakanth Pasunuru

Recent developments in large pre-trained language models have enabled unprecedented performance on a variety of downstream tasks. Achieving best performance with these models often leverages in-context learning, where a model performs a…

Computation and Language · Computer Science 2024-04-17 Alexander Scarlatos , Andrew Lan

Additionally, the strong dependency among in-context examples makes it an NP-hard combinatorial optimization problem and enumerating all permutations is infeasible. Hence we propose LENS, a fiLter-thEN-Search method to tackle this challenge…

Computation and Language · Computer Science 2023-10-10 Xiaonan Li , Xipeng Qiu

Large Language Models (LLMs) have recently emerged as promising tools for recommendation thanks to their advanced textual understanding ability and context-awareness. Despite the current practice of training and evaluating LLM-based…

Information Retrieval · Computer Science 2025-06-12 Sein Kim , Hongseok Kang , Kibum Kim , Jiwan Kim , Donghyun Kim , Minchul Yang , Kwangjin Oh , Julian McAuley , Chanyoung Park

Generative Large Language Models (LLMs) are capable of being in-context learners. However, the underlying mechanism of in-context learning (ICL) is still a major research question, and experimental research results about how models exploit…

Computation and Language · Computer Science 2025-02-11 Aliakbar Nafar , Kristen Brent Venable , Parisa Kordjamshidi

Sequence discriminative training is a great tool to improve the performance of an automatic speech recognition system. It does, however, necessitate a sum over all possible word sequences, which is intractable to compute in practice.…

Computation and Language · Computer Science 2022-04-22 Nils-Philipp Wynands , Wilfried Michel , Jan Rosendahl , Ralf Schlüter , Hermann Ney

In-context learning is a recent paradigm in natural language understanding, where a large pre-trained language model (LM) observes a test instance and a few training examples as its input, and directly decodes the output without any update…

Computation and Language · Computer Science 2022-05-10 Ohad Rubin , Jonathan Herzig , Jonathan Berant

In-context Learning (ICL) is the ability of Large Language Models (LLMs) to perform new tasks when conditioned on prompts comprising a few task examples. However, ICL performance can be critically sensitive to the choice of examples. To…

Computation and Language · Computer Science 2024-02-23 Shivanshu Gupta , Clemens Rosenbaum , Ethan R. Elenberg

Large language models (LLMs) have shown impressive capabilities across various tasks, but their performance on domain-specific tasks remains limited. While methods like retrieval augmented generation and fine-tuning can help to address…

Computation and Language · Computer Science 2024-12-23 M. Mehdi Mojarradi , Lingyi Yang , Robert McCraith , Adam Mahdi
‹ Prev 1 2 3 10 Next ›