Related papers: Improving and Simplifying Pattern Exploiting Train…

Active PETs: Active Data Annotation Prioritisation for Few-Shot Claim Verification with Pattern Exploiting Training

To mitigate the impact of the scarcity of labelled data on fact-checking systems, we focus on few-shot claim verification. Despite recent work on few-shot classification by proposing advanced language models, there is a dearth of research…

Computation and Language · Computer Science 2022-10-12 Xia Zeng , Arkaitz Zubiaga

Few-shot learning for sentence pair classification and its applications in software engineering

Few-shot learning-the ability to train models with access to limited data-has become increasingly popular in the natural language processing (NLP) domain, as large language models such as GPT and T0 have been empirically shown to achieve…

Software Engineering · Computer Science 2023-06-16 Robert Kraig Helmeczi , Mucahit Cevik , Savas Yıldırım

Few-Shot Text Generation with Pattern-Exploiting Training

Providing pretrained language models with simple task descriptions in natural language enables them to solve some tasks in a fully unsupervised fashion. Moreover, when combined with regular learning from examples, this idea yields…

Computation and Language · Computer Science 2021-10-05 Timo Schick , Hinrich Schütze

Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference

Some NLP tasks can be solved in a fully unsupervised fashion by providing a pretrained language model with "task descriptions" in natural language (e.g., Radford et al., 2019). While this approach underperforms its supervised counterpart,…

Computation and Language · Computer Science 2021-01-26 Timo Schick , Hinrich Schütze

PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning

Few-shot learning and parameter-efficient fine-tuning (PEFT) are crucial to overcome the challenges of data scarcity and ever growing language model sizes. This applies in particular to specialized scientific domains, where researchers…

Computation and Language · Computer Science 2025-09-18 Jonas Rieger , Mattes Ruckdeschel , Gregor Wiedemann

Revisiting Self-Training for Few-Shot Learning of Language Model

As unlabeled data carry rich task-relevant information, they are proven useful for few-shot learning of language model. The question is how to effectively make use of such data. In this work, we revisit the self-training technique for…

Computation and Language · Computer Science 2021-10-05 Yiming Chen , Yan Zhang , Chen Zhang , Grandee Lee , Ran Cheng , Haizhou Li

Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning

Recent studies have revealed the intriguing few-shot learning ability of pretrained language models (PLMs): They can quickly adapt to a new task when fine-tuned on a small amount of labeled data formulated as prompts, without requiring…

Computation and Language · Computer Science 2023-05-15 Yu Meng , Martin Michalski , Jiaxin Huang , Yu Zhang , Tarek Abdelzaher , Jiawei Han

Evaluating the fairness of task-adaptive pretraining on unlabeled test data before few-shot text classification

Few-shot learning benchmarks are critical for evaluating modern NLP techniques. It is possible, however, that benchmarks favor methods which easily make use of unlabeled text, because researchers can use unlabeled text from the test set to…

Computation and Language · Computer Science 2024-10-03 Kush Dubey

Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models

Active learning is an iterative labeling process that is used to obtain a small labeled subset, despite the absence of labeled data, thereby enabling to train a model for supervised tasks such as text classification. While active learning…

Computation and Language · Computer Science 2024-10-07 Christopher Schröder , Gerhard Heyer

Uncertainty-aware Parameter-Efficient Self-training for Semi-supervised Language Understanding

The recent success of large pre-trained language models (PLMs) heavily hinges on massive labeled data, which typically produces inferior performance in low-resource scenarios. To remedy this dilemma, we study self-training as one of the…

Machine Learning · Computer Science 2023-10-23 Jianing Wang , Qiushi Sun , Nuo Chen , Chengyu Wang , Jun Huang , Ming Gao , Xiang Li

The Benefits of Label-Description Training for Zero-Shot Text Classification

Pretrained language models have improved zero-shot text classification by allowing the transfer of semantic knowledge from the training data in order to classify among specific label sets in downstream tasks. We propose a simple way to…

Computation and Language · Computer Science 2023-10-24 Lingyu Gao , Debanjan Ghosh , Kevin Gimpel

Pre-Trained Vision-Language Models as Partial Annotators

Pre-trained vision-language models learn massive data to model unified representations of images and natural languages, which can be widely applied to downstream machine learning tasks. In addition to zero-shot inference, in order to better…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Qian-Wei Wang , Yuqiu Xie , Letian Zhang , Zimo Liu , Shu-Tao Xia

Few-shot Named Entity Recognition with Cloze Questions

Despite the huge and continuous advances in computational linguistics, the lack of annotated data for Named Entity Recognition (NER) is still a challenging issue, especially in low-resource languages and when domain knowledge is required…

Computation and Language · Computer Science 2021-11-25 Valerio La Gatta , Vincenzo Moscato , Marco Postiglione , Giancarlo Sperlì

Predictions For Pre-training Language Models

Language model pre-training has proven to be useful in many language understanding tasks. In this paper, we investigate whether it is still helpful to add the self-training method in the pre-training step and the fine-tuning step. Towards…

Computation and Language · Computer Science 2023-02-17 Tong Guo

Effectiveness of Pre-training for Few-shot Intent Classification

This paper investigates the effectiveness of pre-training for few-shot intent classification. While existing paradigms commonly further pre-train language models such as BERT on a vast amount of unlabeled corpus, we find it highly effective…

Computation and Language · Computer Science 2024-09-17 Haode Zhang , Yuwei Zhang , Li-Ming Zhan , Jiaxin Chen , Guangyuan Shi , Albert Y. S. Lam , Xiao-Ming Wu

Efficient Few-Shot Learning Without Prompts

Recent few-shot methods, such as parameter-efficient fine-tuning (PEFT) and pattern exploiting training (PET), have achieved impressive results in label-scarce settings. However, they are difficult to employ since they are subject to high…

Computation and Language · Computer Science 2022-09-23 Lewis Tunstall , Nils Reimers , Unso Eun Seo Jo , Luke Bates , Daniel Korat , Moshe Wasserblat , Oren Pereg

Towards Zero-Label Language Learning

This paper explores zero-label learning in Natural Language Processing (NLP), whereby no human-annotated data is used anywhere during training and models are trained purely on synthetic data. At the core of our framework is a novel approach…

Computation and Language · Computer Science 2021-09-21 Zirui Wang , Adams Wei Yu , Orhan Firat , Yuan Cao

Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization

Training language models to learn from human instructions for zero-shot cross-task generalization has attracted much attention in NLP communities. Recently, instruction tuning (IT), which fine-tunes a pre-trained language model on a massive…

Computation and Language · Computer Science 2022-10-18 Yuxian Gu , Pei Ke , Xiaoyan Zhu , Minlie Huang

Meta-learning for Few-shot Natural Language Processing: A Survey

Few-shot natural language processing (NLP) refers to NLP tasks that are accompanied with merely a handful of labeled examples. This is a real-world challenge that an AI system must learn to handle. Usually we rely on collecting more…

Computation and Language · Computer Science 2020-07-21 Wenpeng Yin

Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning

Most existing few-shot learning (FSL) methods require a large amount of labeled data in meta-training, which is a major limit. To reduce the requirement of labels, a semi-supervised meta-training (SSMT) setting has been proposed for FSL,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xingping Dong , Tianran Ouyang , Shengcai Liao , Bo Du , Ling Shao