Related papers: Active Model Selection for Large Language Models

Large Language Model Selection with Limited Annotations

Choosing a Large Language Model (LLM) for a given task requires comparing many strong candidates, yet standard evaluation relies on costly annotations over fixed evaluation sets. To address this challenge, we develop SELECT-LLM, the first…

Computation and Language · Computer Science 2026-05-26 Yavuz Durmazkeser , Patrik Okanovic , Andreas Kirsch , Torsten Hoefler , Nezihe Merve Gürel

Enhancing Text Classification through LLM-Driven Active Learning and Human Annotation

In the context of text classification, the financial burden of annotation exercises for creating training data is a critical issue. Active learning techniques, particularly those rooted in uncertainty sampling, offer a cost-effective…

Computation and Language · Computer Science 2024-06-19 Hamidreza Rouzegar , Masoud Makrehchi

From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling

Although the annotation paradigm based on Large Language Models (LLMs) has made significant breakthroughs in recent years, its actual deployment still has two core bottlenecks: first, the cost of calling commercial APIs in large-scale…

Computation and Language · Computer Science 2025-06-23 Yao Lu , Zhaiyuan Ji , Jiawei Du , Yu Shanqing , Qi Xuan , Tianyi Zhou

Which Examples to Annotate for In-Context Learning? Towards Effective and Efficient Selection

Large Language Models (LLMs) can adapt to new tasks via in-context learning (ICL). ICL is efficient as it does not require any parameter updates to the trained LLM, but only few annotated examples as input for the LLM. In this work, we…

Computation and Language · Computer Science 2023-11-01 Costas Mavromatis , Balasubramaniam Srinivasan , Zhengyuan Shen , Jiani Zhang , Huzefa Rangwala , Christos Faloutsos , George Karypis

Active Learning for NLP with Large Language Models

Human annotation of training samples is expensive, laborious, and sometimes challenging, especially for Natural Language Processing (NLP) tasks. To reduce the labeling cost and enhance the sample efficiency, Active Learning (AL) technique…

Computation and Language · Computer Science 2024-01-17 Xuesong Wang

SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models

Large language models (LLMs) have been widely adopted due to their remarkable performance across various applications, driving the accelerated development of a large number of diverse models. However, these individual LLMs show limitations…

Computation and Language · Computer Science 2025-06-13 Kaushal Kumar Maurya , KV Aditya Srivatsa , Ekaterina Kochmar

Large Language Models as Annotators: Enhancing Generalization of NLP Models at Minimal Cost

State-of-the-art supervised NLP models achieve high accuracy but are also susceptible to failures on inputs from low-data regimes, such as domains that are not represented in training data. As an approximation to collecting ground-truth…

Computation and Language · Computer Science 2023-06-29 Parikshit Bansal , Amit Sharma

FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

Collecting high-quality labeled data for model training is notoriously time-consuming and labor-intensive for various NLP tasks. While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context…

Computation and Language · Computer Science 2023-11-28 Ruixuan Xiao , Yiwen Dong , Junbo Zhao , Runze Wu , Minmin Lin , Gang Chen , Haobo Wang

LLMs in the Loop: Leveraging Large Language Model Annotations for Active Learning in Low-Resource Languages

Low-resource languages face significant barriers in AI development due to limited linguistic resources and expertise for data labeling, rendering them rare and costly. The scarcity of data and the absence of preexisting tools exacerbate…

Computation and Language · Computer Science 2024-06-25 Nataliia Kholodna , Sahib Julka , Mohammad Khodadadi , Muhammed Nurullah Gumus , Michael Granitzer

From Selection to Generation: A Survey of LLM-based Active Learning

Active Learning (AL) has been a powerful paradigm for improving model efficiency and performance by selecting the most informative data points for labeling and training. In recent active learning frameworks, Large Language Models (LLMs)…

Machine Learning · Computer Science 2025-06-03 Yu Xia , Subhojyoti Mukherjee , Zhouhang Xie , Junda Wu , Xintong Li , Ryan Aponte , Hanjia Lyu , Joe Barrow , Hongjie Chen , Franck Dernoncourt , Branislav Kveton , Tong Yu , Ruiyi Zhang , Jiuxiang Gu , Nesreen K. Ahmed , Yu Wang , Xiang Chen , Hanieh Deilamsalehy , Sungchul Kim , Zhengmian Hu , Yue Zhao , Nedim Lipka , Seunghyun Yoon , Ting-Hao Kenneth Huang , Zichao Wang , Puneet Mathur , Soumyabrata Pal , Koyel Mukherjee , Zhehao Zhang , Namyong Park , Thien Huu Nguyen , Jiebo Luo , Ryan A. Rossi , Julian McAuley

Cost-Aware Model Orchestration for LLM-based Systems

As modern artificial intelligence (AI) systems become more advanced and capable, they can leverage a wide range of tools and models to perform complex tasks. The task of orchestrating these models is increasingly performed by Large Language…

Artificial Intelligence · Computer Science 2026-04-20 Daria Smirnova , Hamid Nasiri , Marta Adamska , Zhengxin Yu , Peter Garraghan

OptLLM: Optimal Assignment of Queries to Large Language Models

Large Language Models (LLMs) have garnered considerable attention owing to their remarkable capabilities, leading to an increasing number of companies offering LLMs as services. Different LLMs achieve different performance at different…

Software Engineering · Computer Science 2024-05-27 Yueyue Liu , Hongyu Zhang , Yuantian Miao , Van-Hoang Le , Zhiqiang Li

AcTracer: Active Testing of Large Language Model via Multi-Stage Sampling

Performance evaluation plays a crucial role in the development life cycle of large language models (LLMs). It estimates the model's capability, elucidates behavior characteristics, and facilitates the identification of potential issues and…

Software Engineering · Computer Science 2025-06-12 Yuheng Huang , Jiayang Song , Qiang Hu , Felix Juefei-Xu , Lei Ma

The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection

High annotation costs from hiring or crowdsourcing complicate the creation of large, high-quality datasets needed for training reliable text classifiers. Recent research suggests using Large Language Models (LLMs) to automate the annotation…

Computation and Language · Computer Science 2025-01-27 Tomas Horych , Christoph Mandl , Terry Ruas , Andre Greiner-Petter , Bela Gipp , Akiko Aizawa , Timo Spinde

Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data

Machine learning-based classifiers have been used for text classification, such as sentiment analysis, news classification, and toxic comment classification. However, supervised machine learning models often require large amounts of labeled…

Computation and Language · Computer Science 2025-05-06 Yejian Zhang , Shingo Takada

Large Language Models as Financial Data Annotators: A Study on Effectiveness and Efficiency

Collecting labeled datasets in finance is challenging due to scarcity of domain experts and higher cost of employing them. While Large Language Models (LLMs) have demonstrated remarkable performance in data annotation tasks on general…

Computation and Language · Computer Science 2024-03-28 Toyin Aguda , Suchetha Siddagangappa , Elena Kochkina , Simerjot Kaur , Dongsheng Wang , Charese Smiley , Sameena Shah

Annotating Data for Fine-Tuning a Neural Ranker? Current Active Learning Strategies are not Better than Random Selection

Search methods based on Pretrained Language Models (PLM) have demonstrated great effectiveness gains compared to statistical and early neural ranking models. However, fine-tuning PLM-based rankers requires a great amount of annotated…

Information Retrieval · Computer Science 2023-09-13 Sophia Althammer , Guido Zuccon , Sebastian Hofstätter , Suzan Verberne , Allan Hanbury

Evaluating Large Language Models as Expert Annotators

Textual data annotation, the process of labeling or tagging text with relevant information, is typically costly, time-consuming, and labor-intensive. While large language models (LLMs) have demonstrated their potential as direct…

Computation and Language · Computer Science 2025-08-12 Yu-Min Tseng , Wei-Lin Chen , Chung-Chi Chen , Hsin-Hsi Chen

Next Generation Active Learning: Mixture of LLMs in the Loop

With the rapid advancement and strong generalization capabilities of large language models (LLMs), they have been increasingly incorporated into the active learning pipelines as annotators to reduce annotation costs. However, considering…

Machine Learning · Computer Science 2026-01-23 Yuanyuan Qi , Xiaohao Yang , Jueqing Lu , Guoxiang Guo , Joanne Enticott , Gang Liu , Lan Du

Efficient Sequential Decision Making with Large Language Models

This paper focuses on extending the success of large language models (LLMs) to sequential decision making. Existing efforts either (i) re-train or finetune LLMs for decision making, or (ii) design prompts for pretrained LLMs. The former…

Machine Learning · Computer Science 2025-06-17 Dingyang Chen , Qi Zhang , Yinglun Zhu