Related papers: Building Efficient Universal Classifiers with Natu…

Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference

While discriminative neural network classifiers are generally preferred, recent work has shown advantages of generative classifiers in term of data efficiency and robustness. In this paper, we focus on natural language inference (NLI). We…

Computation and Language · Computer Science 2020-10-09 Xiaoan Ding , Tianyu Liu , Baobao Chang , Zhifang Sui , Kevin Gimpel

Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification

Generative AI offers a simple, prompt-based alternative to fine-tuning smaller BERT-style LLMs for text classification tasks. This promises to eliminate the need for manually labeled training data and task-specific model training. However,…

Computation and Language · Computer Science 2024-08-19 Martin Juan José Bucher , Marco Martini

Large Language Models Are Zero-Shot Text Classifiers

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data

Large language models (LLMs) have demonstrated remarkable success in NLP tasks. However, there is a paucity of studies that attempt to evaluate their performances on social media-based health-related natural language processing tasks, which…

Computation and Language · Computer Science 2024-03-29 Yuting Guo , Anthony Ovadje , Mohammed Ali Al-Garadi , Abeed Sarker

It's All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers

While encoder-only models such as BERT and ModernBERT are ubiquitous in real-world NLP applications, their conventional reliance on task-specific classification heads can limit their applicability compared to decoder-based large language…

Computation and Language · Computer Science 2025-02-11 Benjamin Clavié , Nathan Cooper , Benjamin Warner

Leveraging Open-Source Large Language Models for Native Language Identification

Native Language Identification (NLI) - the task of identifying the native language (L1) of a person based on their writing in the second language (L2) - has applications in forensics, marketing, and second language acquisition.…

Computation and Language · Computer Science 2025-01-22 Yee Man Ng , Ilia Markov

Universal Language Model Fine-tuning for Text Classification

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective…

Computation and Language · Computer Science 2018-05-24 Jeremy Howard , Sebastian Ruder

Text Classification in the LLM Era -- Where do we stand?

Large Language Models revolutionized NLP and showed dramatic performance improvements across several tasks. In this paper, we investigated the role of such language models in text classification and how they compare with other approaches…

Computation and Language · Computer Science 2025-02-21 Sowmya Vajjala , Shwetali Shimangaud

Generating Training Data with Language Models: Towards Zero-Shot Language Understanding

Pretrained language models (PLMs) have demonstrated remarkable performance in various natural language processing tasks: Unidirectional PLMs (e.g., GPT) are well known for their superior text generation capabilities; bidirectional PLMs…

Computation and Language · Computer Science 2022-10-13 Yu Meng , Jiaxin Huang , Yu Zhang , Jiawei Han

Lessons from the Use of Natural Language Inference (NLI) in Requirements Engineering Tasks

We investigate the use of Natural Language Inference (NLI) in automating requirements engineering tasks. In particular, we focus on three tasks: requirements classification, identification of requirements specification defects, and…

Software Engineering · Computer Science 2024-05-09 Mohamad Fazelnia , Viktoria Koscinski , Spencer Herzog , Mehdi Mirakhorli

Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science

Instruction-tuned Large Language Models (LLMs) have exhibited impressive language understanding and the capacity to generate responses that follow specific prompts. However, due to the computational demands associated with training these…

Computation and Language · Computer Science 2024-03-26 Yida Mu , Ben P. Wu , William Thorne , Ambrose Robinson , Nikolaos Aletras , Carolina Scarton , Kalina Bontcheva , Xingyi Song

Language Models for Text Classification: Is In-Context Learning Enough?

Recent foundational language models have shown state-of-the-art performance in many NLP tasks in zero- and few-shot settings. An advantage of these models over more standard approaches based on fine-tuning is the ability to understand…

Computation and Language · Computer Science 2024-04-16 Aleksandra Edwards , Jose Camacho-Collados

Universal Self-Adaptive Prompting

A hallmark of modern large language models (LLMs) is their impressive general zero-shot and few-shot abilities, often elicited through in-context learning (ICL) via prompting. However, while highly coveted and being the most general,…

Computation and Language · Computer Science 2023-10-24 Xingchen Wan , Ruoxi Sun , Hootan Nakhost , Hanjun Dai , Julian Martin Eisenschlos , Sercan O. Arik , Tomas Pfister

Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection

Large language models (LLMs) are very proficient text generators. We leverage this capability of LLMs to generate task-specific data via zero-shot prompting and promote cross-lingual transfer for low-resource target languages. Given…

Computation and Language · Computer Science 2024-07-16 Barah Fazili , Ashish Sunil Agrawal , Preethi Jyothi

Zero-shot Text Classification With Generative Language Models

This work investigates the use of natural language to enable zero-shot model adaptation to new tasks. We use text and metadata from social commenting platforms as a source for a simple pretraining task. We then provide the language model…

Computation and Language · Computer Science 2019-12-24 Raul Puri , Bryan Catanzaro

Deep Natural Language Feature Learning for Interpretable Prediction

We propose a general method to break down a main complex task into a set of intermediary easier sub-tasks, which are formulated in natural language as binary questions related to the final target task. Our method allows for representing…

Computation and Language · Computer Science 2024-02-02 Felipe Urrutia , Cristian Buc , Valentin Barriere

ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval

With the development of large language models (LLMs), zero-shot learning has attracted much attention for various NLP tasks. Different from prior works that generate training data with billion-scale natural language generation (NLG) models,…

Computation and Language · Computer Science 2023-05-19 Yue Yu , Yuchen Zhuang , Rongzhi Zhang , Yu Meng , Jiaming Shen , Chao Zhang

What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models

Large language models (LLMs) have been effectively used for many computer vision tasks, including image classification. In this paper, we present a simple yet effective approach for zero-shot image classification using multimodal LLMs.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Abdelrahman Abdelhamed , Mahmoud Afifi , Alec Go

On Bilingual Lexicon Induction with Large Language Models

Bilingual Lexicon Induction (BLI) is a core task in multilingual NLP that still, to a large extent, relies on calculating cross-lingual word representations. Inspired by the global paradigm shift in NLP towards Large Language Models (LLMs),…

Computation and Language · Computer Science 2024-02-27 Yaoyiran Li , Anna Korhonen , Ivan Vulić

Large Language Models as Universal Predictors? An Empirical Study on Small Tabular Datasets

Large Language Models (LLMs), originally developed for natural language processing (NLP), have demonstrated the potential to generalize across modalities and domains. With their in-context learning (ICL) capabilities, LLMs can perform…

Artificial Intelligence · Computer Science 2025-08-26 Nikolaos Pavlidis , Vasilis Perifanis , Symeon Symeonidis , Pavlos S. Efraimidis