Related papers: Average-Case Active Learning with Costs

Submodular Learning and Covering with Response-Dependent Costs

We consider interactive learning and covering problems, in a setting where actions may incur different costs, depending on the response to the action. We propose a natural greedy algorithm for response-dependent costs. We bound the…

Machine Learning · Computer Science 2018-11-21 Sivan Sabato

Greedy Active Learning Algorithm for Logistic Regression Models

We study a logistic model-based active learning procedure for binary classification problems, in which we adopt a batch subject selection strategy with a modified sequential experimental design method. Moreover, accompanying the proposed…

Machine Learning · Statistics 2018-02-02 Hsiang-Ling Hsu , Yuan-Chin Ivan Chang , Ray-Bing Chen

Cost-Accuracy Aware Adaptive Labeling for Active Learning

Conventional active learning algorithms assume a single labeler that produces noiseless label at a given, fixed cost, and aim to achieve the best generalization performance for given classifier under a budget constraint. However, in many…

Machine Learning · Computer Science 2021-05-25 Ruijiang Gao , Maytal Saar-tsechansky

Bayesian Active Learning With Abstention Feedbacks

We study pool-based active learning with abstention feedbacks where a labeler can abstain from labeling a queried example with some unknown abstention rate. This is an important problem with many useful applications. We take a Bayesian…

Machine Learning · Computer Science 2021-01-01 Cuong V. Nguyen , Lam Si Tung Ho , Huan Xu , Vu Dinh , Binh Nguyen

Fast Rates in Pool-Based Batch Active Learning

We consider a batch active learning scenario where the learner adaptively issues batches of points to a labeling oracle. Sampling labels in batches is highly desirable in practice due to the smaller number of interactive rounds with the…

Machine Learning · Computer Science 2022-06-14 Claudio Gentile , Zhilei Wang , Tong Zhang

The Cost of Replicability in Active Learning

Active learning aims to reduce the number of labeled data points required by machine learning algorithms by selectively querying labels from initially unlabeled data. Ensuring replicability, where an algorithm produces consistent outcomes…

Machine Learning · Computer Science 2026-03-24 Rupkatha Hira , Dominik Kau , Jessica Sorrell

Batch Active Learning at Scale

The ability to train complex and highly effective models often requires an abundance of training data, which can easily become a bottleneck in cost, time, and computational resources. Batch active learning, which adaptively issues batched…

Machine Learning · Computer Science 2021-08-02 Gui Citovsky , Giulia DeSalvo , Claudio Gentile , Lazaros Karydas , Anand Rajagopalan , Afshin Rostamizadeh , Sanjiv Kumar

Bayesian Pool-based Active Learning With Abstention Feedbacks

We study pool-based active learning with abstention feedbacks, where a labeler can abstain from labeling a queried example with some unknown abstention rate. This is an important problem with many useful applications. We take a Bayesian…

Machine Learning · Statistics 2021-01-05 Cuong V. Nguyen , Lam Si Tung Ho , Huan Xu , Vu Dinh , Binh Nguyen

Batch Active Preference-Based Learning of Reward Functions

Data generation and labeling are usually an expensive part of learning for robotics. While active learning methods are commonly used to tackle the former problem, preference-based learning is a concept that attempts to solve the latter by…

Machine Learning · Computer Science 2018-10-11 Erdem Bıyık , Dorsa Sadigh

Bayesian Batch Active Learning as Sparse Subset Approximation

Leveraging the wealth of unlabeled data produced in recent years provides great potential for improving supervised models. When the cost of acquiring labels is high, probabilistic active learning methods can be used to greedily select the…

Machine Learning · Statistics 2021-02-09 Robert Pinsler , Jonathan Gordon , Eric Nalisnick , José Miguel Hernández-Lobato

Auditing: Active Learning with Outcome-Dependent Query Costs

We propose a learning setting in which unlabeled data is free, and the cost of a label depends on its value, which is not known in advance. We study binary classification in an extreme case, where the algorithm only pays for negative…

Machine Learning · Computer Science 2015-07-14 Sivan Sabato , Anand D. Sarwate , Nathan Srebro

Query Learning with Exponential Query Costs

In query learning, the goal is to identify an unknown object while minimizing the number of "yes" or "no" questions (queries) posed about that object. A well-studied algorithm for query learning is known as generalized binary search (GBS).…

Machine Learning · Statistics 2010-02-23 Gowtham Bellala , Suresh Bhavnani , Clayton Scott

Active Learning for Cost-Sensitive Classification

We design an active learning algorithm for cost-sensitive multiclass classification: problems where different errors have different costs. Our algorithm, COAL, makes predictions by regressing to each label's cost and predicting the…

Machine Learning · Computer Science 2021-10-13 Akshay Krishnamurthy , Alekh Agarwal , Tzu-Kuo Huang , Hal Daume , John Langford

Cost-aware Pre-training for Multiclass Cost-sensitive Deep Learning

Deep learning has been one of the most prominent machine learning techniques nowadays, being the state-of-the-art on a broad range of applications where automatic feature extraction is needed. Many such applications also demand varying…

Machine Learning · Computer Science 2016-05-25 Yu-An Chung , Hsuan-Tien Lin , Shao-Wen Yang

Active Robust Learning

In many practical applications of learning algorithms, unlabeled data is cheap and abundant whereas labeled data is expensive. Active learning algorithms developed to achieve better performance with lower cost. Usually Representativeness…

Machine Learning · Computer Science 2016-08-26 Hossein Ghafarian , Hadi Sadoghi Yazdi

Illustrating the Suitability of Greedy and Dynamic Algorithms Using The Economics Concept of "Opportunity Cost"

Students of Computer Science often wonder when, exactly, one can apply a greedy algorithm to a problem, and when one must use the more complicated and time-consuming techniques of dynamic programming. This paper argues that the existing…

Data Structures and Algorithms · Computer Science 2020-11-20 Eugene Callahan , Robert Murphy , Anas Elghafari

Nearly Optimal Active Preference Learning and Its Application to LLM Alignment

Aligning large language models (LLMs) depends on high-quality datasets of human preference labels, which are costly to collect. Although active learning has been studied to improve sample efficiency relative to passive collection, many…

Machine Learning · Computer Science 2026-02-03 Yao Zhao , Kwang-Sung Jun

Improve Cost Efficiency of Active Learning over Noisy Dataset

Active learning is a learning strategy whereby the machine learning algorithm actively identifies and labels data points to optimize its learning. This strategy is particularly effective in domains where an abundance of unlabeled data…

Machine Learning · Computer Science 2024-03-05 Zan-Kai Chong , Hiroyuki Ohsaki , Bryan Ng

When does Active Learning Work?

Active Learning (AL) methods seek to improve classifier performance when labels are expensive or scarce. We consider two central questions: Where does AL work? How much does it help? To address these questions, a comprehensive experimental…

Machine Learning · Statistics 2014-08-07 Lewis Evans , Niall M. Adams , Christoforos Anagnostopoulos

Cost-Based Budget Active Learning for Deep Learning

Majorly classical Active Learning (AL) approach usually uses statistical theory such as entropy and margin to measure instance utility, however it fails to capture the data distribution information contained in the unlabeled data. This can…

Machine Learning · Computer Science 2020-12-10 Patrick K. Gikunda , Nicolas Jouandeau