English
Related papers

Related papers: Active Model Selection for Large Language Models

200 papers

Choosing a Large Language Model (LLM) for a given task requires comparing many strong candidates, yet standard evaluation relies on costly annotations over fixed evaluation sets. To address this challenge, we develop SELECT-LLM, the first…

Computation and Language · Computer Science 2026-05-26 Yavuz Durmazkeser , Patrik Okanovic , Andreas Kirsch , Torsten Hoefler , Nezihe Merve Gürel

In the context of text classification, the financial burden of annotation exercises for creating training data is a critical issue. Active learning techniques, particularly those rooted in uncertainty sampling, offer a cost-effective…

Computation and Language · Computer Science 2024-06-19 Hamidreza Rouzegar , Masoud Makrehchi

Although the annotation paradigm based on Large Language Models (LLMs) has made significant breakthroughs in recent years, its actual deployment still has two core bottlenecks: first, the cost of calling commercial APIs in large-scale…

Computation and Language · Computer Science 2025-06-23 Yao Lu , Zhaiyuan Ji , Jiawei Du , Yu Shanqing , Qi Xuan , Tianyi Zhou

Large Language Models (LLMs) can adapt to new tasks via in-context learning (ICL). ICL is efficient as it does not require any parameter updates to the trained LLM, but only few annotated examples as input for the LLM. In this work, we…

Human annotation of training samples is expensive, laborious, and sometimes challenging, especially for Natural Language Processing (NLP) tasks. To reduce the labeling cost and enhance the sample efficiency, Active Learning (AL) technique…

Computation and Language · Computer Science 2024-01-17 Xuesong Wang

Large language models (LLMs) have been widely adopted due to their remarkable performance across various applications, driving the accelerated development of a large number of diverse models. However, these individual LLMs show limitations…

Computation and Language · Computer Science 2025-06-13 Kaushal Kumar Maurya , KV Aditya Srivatsa , Ekaterina Kochmar

State-of-the-art supervised NLP models achieve high accuracy but are also susceptible to failures on inputs from low-data regimes, such as domains that are not represented in training data. As an approximation to collecting ground-truth…

Computation and Language · Computer Science 2023-06-29 Parikshit Bansal , Amit Sharma

Collecting high-quality labeled data for model training is notoriously time-consuming and labor-intensive for various NLP tasks. While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context…

Computation and Language · Computer Science 2023-11-28 Ruixuan Xiao , Yiwen Dong , Junbo Zhao , Runze Wu , Minmin Lin , Gang Chen , Haobo Wang

Low-resource languages face significant barriers in AI development due to limited linguistic resources and expertise for data labeling, rendering them rare and costly. The scarcity of data and the absence of preexisting tools exacerbate…

Computation and Language · Computer Science 2024-06-25 Nataliia Kholodna , Sahib Julka , Mohammad Khodadadi , Muhammed Nurullah Gumus , Michael Granitzer

Active Learning (AL) has been a powerful paradigm for improving model efficiency and performance by selecting the most informative data points for labeling and training. In recent active learning frameworks, Large Language Models (LLMs)…

As modern artificial intelligence (AI) systems become more advanced and capable, they can leverage a wide range of tools and models to perform complex tasks. The task of orchestrating these models is increasingly performed by Large Language…

Artificial Intelligence · Computer Science 2026-04-20 Daria Smirnova , Hamid Nasiri , Marta Adamska , Zhengxin Yu , Peter Garraghan

Large Language Models (LLMs) have garnered considerable attention owing to their remarkable capabilities, leading to an increasing number of companies offering LLMs as services. Different LLMs achieve different performance at different…

Software Engineering · Computer Science 2024-05-27 Yueyue Liu , Hongyu Zhang , Yuantian Miao , Van-Hoang Le , Zhiqiang Li

Performance evaluation plays a crucial role in the development life cycle of large language models (LLMs). It estimates the model's capability, elucidates behavior characteristics, and facilitates the identification of potential issues and…

Software Engineering · Computer Science 2025-06-12 Yuheng Huang , Jiayang Song , Qiang Hu , Felix Juefei-Xu , Lei Ma

High annotation costs from hiring or crowdsourcing complicate the creation of large, high-quality datasets needed for training reliable text classifiers. Recent research suggests using Large Language Models (LLMs) to automate the annotation…

Computation and Language · Computer Science 2025-01-27 Tomas Horych , Christoph Mandl , Terry Ruas , Andre Greiner-Petter , Bela Gipp , Akiko Aizawa , Timo Spinde

Machine learning-based classifiers have been used for text classification, such as sentiment analysis, news classification, and toxic comment classification. However, supervised machine learning models often require large amounts of labeled…

Computation and Language · Computer Science 2025-05-06 Yejian Zhang , Shingo Takada

Collecting labeled datasets in finance is challenging due to scarcity of domain experts and higher cost of employing them. While Large Language Models (LLMs) have demonstrated remarkable performance in data annotation tasks on general…

Computation and Language · Computer Science 2024-03-28 Toyin Aguda , Suchetha Siddagangappa , Elena Kochkina , Simerjot Kaur , Dongsheng Wang , Charese Smiley , Sameena Shah

Search methods based on Pretrained Language Models (PLM) have demonstrated great effectiveness gains compared to statistical and early neural ranking models. However, fine-tuning PLM-based rankers requires a great amount of annotated…

Information Retrieval · Computer Science 2023-09-13 Sophia Althammer , Guido Zuccon , Sebastian Hofstätter , Suzan Verberne , Allan Hanbury

Textual data annotation, the process of labeling or tagging text with relevant information, is typically costly, time-consuming, and labor-intensive. While large language models (LLMs) have demonstrated their potential as direct…

Computation and Language · Computer Science 2025-08-12 Yu-Min Tseng , Wei-Lin Chen , Chung-Chi Chen , Hsin-Hsi Chen

With the rapid advancement and strong generalization capabilities of large language models (LLMs), they have been increasingly incorporated into the active learning pipelines as annotators to reduce annotation costs. However, considering…

Machine Learning · Computer Science 2026-01-23 Yuanyuan Qi , Xiaohao Yang , Jueqing Lu , Guoxiang Guo , Joanne Enticott , Gang Liu , Lan Du

This paper focuses on extending the success of large language models (LLMs) to sequential decision making. Existing efforts either (i) re-train or finetune LLMs for decision making, or (ii) design prompts for pretrained LLMs. The former…

Machine Learning · Computer Science 2025-06-17 Dingyang Chen , Qi Zhang , Yinglun Zhu
‹ Prev 1 2 3 10 Next ›