Related papers: Select, Label, Evaluate: Active Testing in NLP

Active Learning for NLP with Large Language Models

Human annotation of training samples is expensive, laborious, and sometimes challenging, especially for Natural Language Processing (NLP) tasks. To reduce the labeling cost and enhance the sample efficiency, Active Learning (AL) technique…

Computation and Language · Computer Science 2024-01-17 Xuesong Wang

Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems

Data annotated by humans is a source of knowledge by describing the peculiarities of the problem and therefore fueling the decision process of the trained model. Unfortunately, the annotation process for subjective natural language…

Computation and Language · Computer Science 2023-12-14 Kamil Kanclerz , Julita Bielaniewicz , Marcin Gruza , Jan Kocon , Stanisław Woźniak , Przemysław Kazienko

Cold-start Active Learning through Self-supervised Language Modeling

Active learning strives to reduce annotation costs by choosing the most critical examples to label. Typically, the active learning strategy is contingent on the classification model. For instance, uncertainty sampling depends on poorly…

Computation and Language · Computer Science 2020-10-26 Michelle Yuan , Hsuan-Tien Lin , Jordan Boyd-Graber

Active Learning for Natural Language Generation

The field of Natural Language Generation (NLG) suffers from a severe shortage of labeled data due to the extremely expensive and time-consuming process involved in manual annotation. A natural approach for coping with this problem is active…

Computation and Language · Computer Science 2023-10-18 Yotam Perlitz , Ariel Gera , Michal Shmueli-Scheuer , Dafna Sheinwald , Noam Slonim , Liat Ein-Dor

Reassessing Active Learning Adoption in Contemporary NLP: A Community Survey

Supervised learning relies on data annotation which usually is time-consuming and therefore expensive. A longstanding strategy to reduce annotation costs is active learning, an iterative process, in which a human annotates only data…

Computation and Language · Computer Science 2026-02-03 Julia Romberg , Christopher Schröder , Julius Gonsior , Katrin Tomanek , Fredrik Olsson

Active Testing: Sample-Efficient Model Evaluation

We introduce a new framework for sample-efficient model evaluation that we call active testing. While approaches like active learning reduce the number of labels needed for model training, existing literature largely ignores the cost of…

Machine Learning · Statistics 2021-06-15 Jannik Kossen , Sebastian Farquhar , Yarin Gal , Tom Rainforth

On the Limitations of Simulating Active Learning

Active learning (AL) is a human-and-model-in-the-loop paradigm that iteratively selects informative unlabeled data for human annotation, aiming to improve over random sampling. However, performing AL experiments with human annotations…

Machine Learning · Computer Science 2023-05-24 Katerina Margatina , Nikolaos Aletras

Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation

In the context of text classification, the financial burden of annotation exercises for creating training data is a critical issue. Active learning techniques, particularly those rooted in uncertainty sampling, offer a cost-effective…

Computation and Language · Computer Science 2024-06-19 Hamidreza Rouzegar , Masoud Makrehchi

Survey of Active Learning Hyperparameters: Insights from a Large-Scale Experimental Grid

Annotating data is a time-consuming and costly task, but it is inherently required for supervised machine learning. Active Learning (AL) is an established method that minimizes human labeling effort by iteratively selecting the most…

Machine Learning · Computer Science 2025-06-05 Julius Gonsior , Tim Rieß , Anja Reusch , Claudio Hartmann , Maik Thiele , Wolfgang Lehner

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

Supervised finetuning (SFT) on instruction datasets has played a crucial role in achieving the remarkable zero-shot generalization capabilities observed in modern large language models (LLMs). However, the annotation efforts required to…

Computation and Language · Computer Science 2024-07-09 Gantavya Bhatt , Yifang Chen , Arnav M. Das , Jifan Zhang , Sang T. Truong , Stephen Mussmann , Yinglun Zhu , Jeffrey Bilmes , Simon S. Du , Kevin Jamieson , Jordan T. Ash , Robert D. Nowak

Effortless Active Labeling for Long-Term Test-Time Adaptation

Long-term test-time adaptation (TTA) is a challenging task due to error accumulation. Recent approaches tackle this issue by actively labeling a small proportion of samples in each batch, yet the annotation burden quickly grows as the batch…

Computer Vision and Pattern Recognition · Computer Science 2025-03-20 Guowei Wang , Changxing Ding

Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and…

Computation and Language · Computer Science 2021-04-06 Rishi Hazra , Parag Dutta , Shubham Gupta , Mohammed Abdul Qaathir , Ambedkar Dukkipati

Annotator-Centric Active Learning for Subjective NLP Tasks

Active Learning (AL) addresses the high costs of collecting human annotations by strategically annotating the most informative samples. However, for subjective NLP tasks, incorporating a wide range of perspectives in the annotation process…

Computation and Language · Computer Science 2024-10-24 Michiel van der Meer , Neele Falk , Pradeep K. Murukannaiah , Enrico Liscio

Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and…

Machine Learning · Computer Science 2021-04-08 Rishi Hazra , Parag Dutta , Shubham Gupta , Mohammed Abdul Qaathir , Ambedkar Dukkipati

Active Testing of Large Language Models via Approximate Neyman Allocation

Large language models (LLMs) require reliable evaluation from pre-training to test-time scaling, making evaluation a recurring rather than one-off cost. As model scales grow and target tasks increasingly demand expert annotators, both the…

Artificial Intelligence · Computer Science 2026-05-20 Zeli Liu , Jiancheng Zhang , Cong Liu , Yinglun Zhu

FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

Collecting high-quality labeled data for model training is notoriously time-consuming and labor-intensive for various NLP tasks. While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context…

Computation and Language · Computer Science 2023-11-28 Ruixuan Xiao , Yiwen Dong , Junbo Zhao , Runze Wu , Minmin Lin , Gang Chen , Haobo Wang

ALE: A Simulation-Based Active Learning Evaluation Framework for the Parameter-Driven Comparison of Query Strategies for NLP

Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data…

Computation and Language · Computer Science 2023-08-08 Philipp Kohl , Nils Freyer , Yoka Krämer , Henri Werth , Steffen Wolf , Bodo Kraft , Matthias Meinecke , Albert Zündorf

To Label or Not to Label: PALM -- A Predictive Model for Evaluating Sample Efficiency in Active Learning Models

Active learning (AL) seeks to reduce annotation costs by selecting the most informative samples for labeling, making it particularly valuable in resource-constrained settings. However, traditional evaluation methods, which focus solely on…

Machine Learning · Computer Science 2025-07-22 Julia Machnio , Mads Nielsen , Mostafa Mehdipour Ghazi

Revisiting Sample Size Determination in Natural Language Understanding

Knowing exactly how many data points need to be labeled to achieve a certain model performance is a hugely beneficial step towards reducing the overall budgets for annotation. It pertains to both active learning and traditional data…

Computation and Language · Computer Science 2023-07-04 Ernie Chang , Muhammad Hassan Rashid , Pin-Jie Lin , Changsheng Zhao , Vera Demberg , Yangyang Shi , Vikas Chandra

Hands-On Tutorial: Labeling with LLM and Human-in-the-Loop

Training and deploying machine learning models relies on a large amount of human-annotated data. As human labeling becomes increasingly expensive and time-consuming, recent research has developed multiple strategies to speed up annotation…

Computation and Language · Computer Science 2025-01-28 Ekaterina Artemova , Akim Tsvigun , Dominik Schlechtweg , Natalia Fedorova , Konstantin Chernyshev , Sergei Tilga , Boris Obmoroshev