Related papers: Context-Aware Meta-Learning

Large (Vision) Language Models are Unsupervised In-Context Learners

Recent advances in large language and vision-language models have enabled zero-shot inference, allowing models to solve new tasks without task-specific training. Various adaptation techniques such as prompt engineering, In-Context Learning…

Machine Learning · Computer Science 2025-04-04 Artyom Gadetsky , Andrei Atanov , Yulun Jiang , Zhitong Gao , Ghazal Hosseini Mighan , Amir Zamir , Maria Brbic

Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

Large pre-trained language models (LMs) such as GPT-3 have acquired a surprising ability to perform zero-shot learning. For example, to classify sentiment without any training examples, we can "prompt" the LM with the review and the label…

Computation and Language · Computer Science 2021-09-09 Ruiqi Zhong , Kristy Lee , Zheng Zhang , Dan Klein

Meta-learning via Language Model In-context Tuning

The goal of meta-learning is to learn to adapt to a new task with only a few labeled examples. To tackle this problem in NLP, we propose $\textit{in-context tuning}$, which recasts adaptation and prediction as a simple sequence prediction…

Computation and Language · Computer Science 2022-04-13 Yanda Chen , Ruiqi Zhong , Sheng Zha , George Karypis , He He

What Makes Good Examples for Visual In-Context Learning?

Large-scale models trained on broad data have recently become the mainstream architecture in computer vision due to their strong generalization performance. In this paper, the main focus is on an emergent ability in large vision models,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-02 Yuanhan Zhang , Kaiyang Zhou , Ziwei Liu

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning

In recent years, pre-trained large language models (LLMs) have demonstrated remarkable efficiency in achieving an inference-time few-shot learning capability known as in-context learning. However, existing literature has highlighted the…

Computation and Language · Computer Science 2024-02-14 Xinyi Wang , Wanrong Zhu , Michael Saxon , Mark Steyvers , William Yang Wang

Meta-in-context learning in large language models

Large language models have shown tremendous performance in a variety of tasks. In-context learning -- the ability to improve at a task after being provided with a number of demonstrations -- is seen as one of the main contributors to their…

Computation and Language · Computer Science 2023-05-23 Julian Coda-Forno , Marcel Binz , Zeynep Akata , Matthew Botvinick , Jane X. Wang , Eric Schulz

Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning

Large Language Models (LLMs), with their remarkable ability to tackle challenging and unseen reasoning problems, hold immense potential for tabular learning, that is vital for many real-world applications. In this paper, we propose a novel…

Machine Learning · Computer Science 2024-05-07 Sungwon Han , Jinsung Yoon , Sercan O Arik , Tomas Pfister

GPT4Image: Large Pre-trained Models Help Vision Models Learn Better on Perception Task

The upsurge in pre-trained large models started by ChatGPT has swept across the entire deep learning community. Such powerful models demonstrate advanced generative ability and multimodal understanding capability, which quickly set new…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Ning Ding , Yehui Tang , Zhongqian Fu , Chao Xu , Kai Han , Yunhe Wang

MetaICL: Learning to Learn In Context

We introduce MetaICL (Meta-training for In-Context Learning), a new meta-training framework for few-shot learning where a pretrained language model is tuned to do in-context learning on a large set of training tasks. This meta-training…

Computation and Language · Computer Science 2022-05-04 Sewon Min , Mike Lewis , Luke Zettlemoyer , Hannaneh Hajishirzi

In-Context Learning for Text Classification with Many Labels

In-context learning (ICL) using large language models for tasks with many labels is challenging due to the limited context window, which makes it difficult to fit a sufficient number of examples in the prompt. In this paper, we use a…

Computation and Language · Computer Science 2023-12-07 Aristides Milios , Siva Reddy , Dzmitry Bahdanau

Meta Learning Deep Visual Words for Fast Video Object Segmentation

Personal robots and driverless cars need to be able to operate in novel environments and thus quickly and efficiently learn to recognise new object classes. We address this problem by considering the task of video object segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2020-08-18 Harkirat Singh Behl , Mohammad Najafi , Anurag Arnab , Philip H. S. Torr

Large Language Models in Wireless Application Design: In-Context Learning-enhanced Automatic Network Intrusion Detection

Large language models (LLMs), especially generative pre-trained transformers (GPTs), have recently demonstrated outstanding ability in information comprehension and problem-solving. This has motivated many studies in applying LLMs to…

Machine Learning · Computer Science 2024-05-21 Han Zhang , Akram Bin Sediq , Ali Afana , Melike Erol-Kantarci

Why Can GPT Learn In-Context? Language Models Implicitly Perform Gradient Descent as Meta-Optimizers

Large pretrained language models have shown surprising in-context learning (ICL) ability. With a few demonstration input-label pairs, they can predict the label for an unseen input without parameter updates. Despite the great success in…

Computation and Language · Computer Science 2023-05-16 Damai Dai , Yutao Sun , Li Dong , Yaru Hao , Shuming Ma , Zhifang Sui , Furu Wei

Enhancing In-Context Learning with Answer Feedback for Multi-Span Question Answering

Whereas the recent emergence of large language models (LLMs) like ChatGPT has exhibited impressive general performance, it still has a large gap with fully-supervised models on specific tasks such as multi-span question answering. Previous…

Computation and Language · Computer Science 2023-06-08 Zixian Huang , Jiaying Zhou , Gengyang Xiao , Gong Cheng

Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models

Transformer-based large language models exhibit in-context learning, enabling adaptation to downstream tasks via few-shot prompting with demonstrations. In practice, such models are often fine-tuned to improve zero-shot performance on…

Computation and Language · Computer Science 2026-02-27 Chungpa Lee , Jy-yong Sohn , Kangwook Lee

Learning Electromagnetic Metamaterial Physics With ChatGPT

Large language models (LLMs) such as ChatGPT, Gemini, LlaMa, and Claude are trained on massive quantities of text parsed from the internet and have shown a remarkable ability to respond to complex prompts in a manner often indistinguishable…

Optics · Physics 2025-04-01 Darui Lu , Yang Deng , Jordan M. Malof , Willie J. Padilla

Unsupervised Meta-Learning via In-Context Learning

Unsupervised meta-learning aims to learn feature representations from unsupervised datasets that can transfer to downstream tasks with limited labeled data. In this paper, we propose a novel approach to unsupervised meta-learning that…

Machine Learning · Computer Science 2025-02-11 Anna Vettoruzzo , Lorenzo Braccaioli , Joaquin Vanschoren , Marlena Nowaczyk

Estimating Large Language Model Capabilities without Labeled Test Data

Large Language Models (LLMs) have the impressive ability to perform in-context learning (ICL) from only a few examples, but the success of ICL varies widely from task to task. Thus, it is important to quickly determine whether ICL is…

Computation and Language · Computer Science 2023-10-27 Harvey Yiyun Fu , Qinyuan Ye , Albert Xu , Xiang Ren , Robin Jia

FineText: Text Classification via Attention-based Language Model Fine-tuning

Training deep neural networks from scratch on natural language processing (NLP) tasks requires significant amount of manually labeled text corpus and substantial time to converge, which usually cannot be satisfied by the customers. In this…

Computation and Language · Computer Science 2019-10-29 Yunzhe Tao , Saurabh Gupta , Satyapriya Krishna , Xiong Zhou , Orchid Majumder , Vineet Khare

CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning

Recent advancements in Contrastive Language-Image Pre-training (CLIP) have demonstrated notable success in self-supervised representation learning across various tasks. However, the existing CLIP-like approaches often demand extensive GPU…

Computer Vision and Pattern Recognition · Computer Science 2024-07-31 Yuexi Du , Brian Chang , Nicha C. Dvornek