English
Related papers

Related papers: Building Efficient Universal Classifiers with Natu…

200 papers

While discriminative neural network classifiers are generally preferred, recent work has shown advantages of generative classifiers in term of data efficiency and robustness. In this paper, we focus on natural language inference (NLI). We…

Computation and Language · Computer Science 2020-10-09 Xiaoan Ding , Tianyu Liu , Baobao Chang , Zhifang Sui , Kevin Gimpel

Generative AI offers a simple, prompt-based alternative to fine-tuning smaller BERT-style LLMs for text classification tasks. This promises to eliminate the need for manually labeled training data and task-specific model training. However,…

Computation and Language · Computer Science 2024-08-19 Martin Juan José Bucher , Marco Martini

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Large language models (LLMs) have demonstrated remarkable success in NLP tasks. However, there is a paucity of studies that attempt to evaluate their performances on social media-based health-related natural language processing tasks, which…

Computation and Language · Computer Science 2024-03-29 Yuting Guo , Anthony Ovadje , Mohammed Ali Al-Garadi , Abeed Sarker

While encoder-only models such as BERT and ModernBERT are ubiquitous in real-world NLP applications, their conventional reliance on task-specific classification heads can limit their applicability compared to decoder-based large language…

Computation and Language · Computer Science 2025-02-11 Benjamin Clavié , Nathan Cooper , Benjamin Warner

Native Language Identification (NLI) - the task of identifying the native language (L1) of a person based on their writing in the second language (L2) - has applications in forensics, marketing, and second language acquisition.…

Computation and Language · Computer Science 2025-01-22 Yee Man Ng , Ilia Markov

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective…

Computation and Language · Computer Science 2018-05-24 Jeremy Howard , Sebastian Ruder

Large Language Models revolutionized NLP and showed dramatic performance improvements across several tasks. In this paper, we investigated the role of such language models in text classification and how they compare with other approaches…

Computation and Language · Computer Science 2025-02-21 Sowmya Vajjala , Shwetali Shimangaud

Pretrained language models (PLMs) have demonstrated remarkable performance in various natural language processing tasks: Unidirectional PLMs (e.g., GPT) are well known for their superior text generation capabilities; bidirectional PLMs…

Computation and Language · Computer Science 2022-10-13 Yu Meng , Jiaxin Huang , Yu Zhang , Jiawei Han

We investigate the use of Natural Language Inference (NLI) in automating requirements engineering tasks. In particular, we focus on three tasks: requirements classification, identification of requirements specification defects, and…

Software Engineering · Computer Science 2024-05-09 Mohamad Fazelnia , Viktoria Koscinski , Spencer Herzog , Mehdi Mirakhorli

Instruction-tuned Large Language Models (LLMs) have exhibited impressive language understanding and the capacity to generate responses that follow specific prompts. However, due to the computational demands associated with training these…

Computation and Language · Computer Science 2024-03-26 Yida Mu , Ben P. Wu , William Thorne , Ambrose Robinson , Nikolaos Aletras , Carolina Scarton , Kalina Bontcheva , Xingyi Song

Recent foundational language models have shown state-of-the-art performance in many NLP tasks in zero- and few-shot settings. An advantage of these models over more standard approaches based on fine-tuning is the ability to understand…

Computation and Language · Computer Science 2024-04-16 Aleksandra Edwards , Jose Camacho-Collados

A hallmark of modern large language models (LLMs) is their impressive general zero-shot and few-shot abilities, often elicited through in-context learning (ICL) via prompting. However, while highly coveted and being the most general,…

Computation and Language · Computer Science 2023-10-24 Xingchen Wan , Ruoxi Sun , Hootan Nakhost , Hanjun Dai , Julian Martin Eisenschlos , Sercan O. Arik , Tomas Pfister

Large language models (LLMs) are very proficient text generators. We leverage this capability of LLMs to generate task-specific data via zero-shot prompting and promote cross-lingual transfer for low-resource target languages. Given…

Computation and Language · Computer Science 2024-07-16 Barah Fazili , Ashish Sunil Agrawal , Preethi Jyothi

This work investigates the use of natural language to enable zero-shot model adaptation to new tasks. We use text and metadata from social commenting platforms as a source for a simple pretraining task. We then provide the language model…

Computation and Language · Computer Science 2019-12-24 Raul Puri , Bryan Catanzaro

We propose a general method to break down a main complex task into a set of intermediary easier sub-tasks, which are formulated in natural language as binary questions related to the final target task. Our method allows for representing…

Computation and Language · Computer Science 2024-02-02 Felipe Urrutia , Cristian Buc , Valentin Barriere

With the development of large language models (LLMs), zero-shot learning has attracted much attention for various NLP tasks. Different from prior works that generate training data with billion-scale natural language generation (NLG) models,…

Computation and Language · Computer Science 2023-05-19 Yue Yu , Yuchen Zhuang , Rongzhi Zhang , Yu Meng , Jiaming Shen , Chao Zhang

Large language models (LLMs) have been effectively used for many computer vision tasks, including image classification. In this paper, we present a simple yet effective approach for zero-shot image classification using multimodal LLMs.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Abdelrahman Abdelhamed , Mahmoud Afifi , Alec Go

Bilingual Lexicon Induction (BLI) is a core task in multilingual NLP that still, to a large extent, relies on calculating cross-lingual word representations. Inspired by the global paradigm shift in NLP towards Large Language Models (LLMs),…

Computation and Language · Computer Science 2024-02-27 Yaoyiran Li , Anna Korhonen , Ivan Vulić

Large Language Models (LLMs), originally developed for natural language processing (NLP), have demonstrated the potential to generalize across modalities and domains. With their in-context learning (ICL) capabilities, LLMs can perform…

Artificial Intelligence · Computer Science 2025-08-26 Nikolaos Pavlidis , Vasilis Perifanis , Symeon Symeonidis , Pavlos S. Efraimidis
‹ Prev 1 2 3 10 Next ›