Related papers: Zero-Shot Text Classification with Self-Training

The Benefits of Label-Description Training for Zero-Shot Text Classification

Pretrained language models have improved zero-shot text classification by allowing the transfer of semantic knowledge from the training data in order to classify among specific label sets in downstream tasks. We propose a simple way to…

Computation and Language · Computer Science 2023-10-24 Lingyu Gao , Debanjan Ghosh , Kevin Gimpel

Zero-Shot Text Classification via Self-Supervised Tuning

Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this…

Computation and Language · Computer Science 2023-05-26 Chaoqun Liu , Wenxuan Zhang , Guizhen Chen , Xiaobao Wu , Anh Tuan Luu , Chip Hong Chang , Lidong Bing

Zero-shot Text Classification With Generative Language Models

This work investigates the use of natural language to enable zero-shot model adaptation to new tasks. We use text and metadata from social commenting platforms as a source for a simple pretraining task. We then provide the language model…

Computation and Language · Computer Science 2019-12-24 Raul Puri , Bryan Catanzaro

Train Once, Test Anywhere: Zero-Shot Learning for Text Classification

Zero-shot Learners are models capable of predicting unseen classes. In this work, we propose a Zero-shot Learning approach for text categorization. Our method involves training model on a large corpus of sentences to learn the relationship…

Computation and Language · Computer Science 2017-12-27 Pushpankar Kumar Pushp , Muktabh Mayank Srivastava

Label Agnostic Pre-training for Zero-shot Text Classification

Conventional approaches to text classification typically assume the existence of a fixed set of predefined labels to which a given text can be classified. However, in real-world applications, there exists an infinite label space for…

Computation and Language · Computer Science 2023-05-29 Christopher Clarke , Yuzhao Heng , Yiping Kang , Krisztian Flautner , Lingjia Tang , Jason Mars

Uncertainty-aware Self-training for Text Classification with Few Labels

Recent success of large-scale pre-trained language models crucially hinge on fine-tuning them on large amounts of labeled data for the downstream task, that are typically expensive to acquire. In this work, we study self-training as one of…

Computation and Language · Computer Science 2020-06-30 Subhabrata Mukherjee , Ahmed Hassan Awadallah

Flexible Job Classification with Zero-Shot Learning

Using a taxonomy to organize information requires classifying objects (documents, images, etc) with appropriate taxonomic classes. The flexible nature of zero-shot learning is appealing for this task because it allows classifiers to…

Computation and Language · Computer Science 2022-09-27 Thom Lake

Large Language Models Are Zero-Shot Text Classifiers

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective

We propose a new paradigm for zero-shot learners that is format agnostic, i.e., it is compatible with any format and applicable to a list of language tasks, such as text classification, commonsense reasoning, coreference resolution, and…

Computation and Language · Computer Science 2022-10-19 Ping Yang , Junjie Wang , Ruyi Gan , Xinyu Zhu , Lin Zhang , Ziwei Wu , Xinyu Gao , Jiaxing Zhang , Tetsuya Sakai

Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models

Zero-shot audio classification aims to recognize and classify a sound class that the model has never seen during training. This paper presents a novel approach for zero-shot audio classification using automatically generated sound attribute…

Sound · Computer Science 2024-07-22 Xuenan Xu , Pingyue Zhang , Ming Yan , Ji Zhang , Mengyue Wu

What Makes Pre-trained Language Models Better Zero-shot Learners?

Current methods for prompt learning in zeroshot scenarios widely rely on a development set with sufficient human-annotated data to select the best-performing prompt template a posteriori. This is not ideal because in a realworld zero-shot…

Computation and Language · Computer Science 2023-05-17 Jinghui Lu , Dongsheng Zhu , Weidong Han , Rui Zhao , Brian Mac Namee , Fei Tan

Zero-Shot Robustification of Zero-Shot Models

Zero-shot inference is a powerful paradigm that enables the use of large pretrained models for downstream classification tasks without further training. However, these models are vulnerable to inherited biases that can impact their…

Machine Learning · Computer Science 2024-02-13 Dyah Adila , Changho Shin , Linrong Cai , Frederic Sala

Learning to Name Classes for Vision and Language Models

Large scale vision and language models can achieve impressive zero-shot recognition performance by mapping class specific text queries to image content. Two distinct challenges that remain however, are high sensitivity to the choice of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-05 Sarah Parisot , Yongxin Yang , Steven McDonagh

Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing

Few-shot and zero-shot text classification aim to recognize samples from novel classes with limited labeled samples or no labeled samples at all. While prevailing methods have shown promising performance via transferring knowledge from seen…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Han Liu , Siyang Zhao , Xiaotong Zhang , Feng Zhang , Wei Wang , Fenglong Ma , Hongyang Chen , Hong Yu , Xianchao Zhang

Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

Large pre-trained language models (LMs) such as GPT-3 have acquired a surprising ability to perform zero-shot learning. For example, to classify sentiment without any training examples, we can "prompt" the LM with the review and the label…

Computation and Language · Computer Science 2021-09-09 Ruiqi Zhong , Kristy Lee , Zheng Zhang , Dan Klein

Text2Model: Text-based Model Induction for Zero-shot Image Classification

We address the challenge of building task-agnostic classifiers using only text descriptions, demonstrating a unified approach to image classification, 3D point cloud classification, and action recognition from scenes. Unlike approaches that…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Ohad Amosy , Tomer Volk , Eilam Shapira , Eyal Ben-David , Roi Reichart , Gal Chechik

Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models

Classifying scanned documents is a challenging problem that involves image, layout, and text analysis for document understanding. Nevertheless, for certain benchmark datasets, notably RVL-CDIP, the state of the art is closing in to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Anna Scius-Bertrand , Michael Jungo , Lars Vögtlin , Jean-Marc Spat , Andreas Fischer

Language Models for Text Classification: Is In-Context Learning Enough?

Recent foundational language models have shown state-of-the-art performance in many NLP tasks in zero- and few-shot settings. An advantage of these models over more standard approaches based on fine-tuning is the ability to understand…

Computation and Language · Computer Science 2024-04-16 Aleksandra Edwards , Jose Camacho-Collados

A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks

Autoregressive language models, pretrained using large text corpora to do well on next word prediction, have been successful at solving many downstream tasks, even with zero-shot usage. However, there is little theoretical understanding of…

Computation and Language · Computer Science 2021-04-15 Nikunj Saunshi , Sadhika Malladi , Sanjeev Arora

Beyond Prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations

Recent work has demonstrated that pre-trained language models (PLMs) are zero-shot learners. However, most existing zero-shot methods involve heavy human engineering or complicated self-training pipelines, hindering their application to new…

Computation and Language · Computer Science 2022-11-24 Yu Fei , Ping Nie , Zhao Meng , Roger Wattenhofer , Mrinmaya Sachan