Related papers: Cost-minimising strategies for data labelling : op…

Hitting the Target: Stopping Active Learning at the Cost-Based Optimum

Active learning allows machine learning models to be trained using fewer labels while retaining similar performance to traditional supervised learning. An active learner selects the most informative data points, requests their labels, and…

Machine Learning · Computer Science 2023-11-22 Zac Pullar-Strecker , Katharina Dost , Eibe Frank , Jörg Wicker

Active Learning: Problem Settings and Recent Developments

In supervised learning, acquiring labeled training data for a predictive model can be very costly, but acquiring a large amount of unlabeled data is often quite easy. Active learning is a method of obtaining predictive models with high…

Machine Learning · Computer Science 2020-12-17 Hideitsu Hino

Consistency-based Semi-supervised Active Learning: Towards Minimizing Labeling Cost

Active learning (AL) combines data labeling and model training to minimize the labeling cost by prioritizing the selection of high value data that can best improve model performance. In pool-based active learning, accessible unlabeled data…

Machine Learning · Computer Science 2020-07-21 Mingfei Gao , Zizhao Zhang , Guo Yu , Sercan O. Arik , Larry S. Davis , Tomas Pfister

Reducing Label Effort: Self-Supervised meets Active Learning

Active learning is a paradigm aimed at reducing the annotation effort by training the model on actively selected informative and/or representative samples. Another paradigm to reduce the annotation effort is self-training that learns from a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Javad Zolfaghari Bengar , Joost van de Weijer , Bartlomiej Twardowski , Bogdan Raducanu

Active clustering for labeling training data

Gathering training data is a key step of any supervised learning task, and it is both critical and expensive. Critical, because the quantity and quality of the training data has a high impact on the performance of the learned function.…

Data Structures and Algorithms · Computer Science 2021-10-28 Quentin Lutz , Élie de Panafieu , Alex Scott , Maya Stein

Semi-supervised Batch Active Learning via Bilevel Optimization

Active learning is an effective technique for reducing the labeling cost by improving data efficiency. In this work, we propose a novel batch acquisition strategy for active learning in the setting where the model training is performed in a…

Machine Learning · Computer Science 2020-10-20 Zalán Borsos , Marco Tagliasacchi , Andreas Krause

Cost-Accuracy Aware Adaptive Labeling for Active Learning

Conventional active learning algorithms assume a single labeler that produces noiseless label at a given, fixed cost, and aim to achieve the best generalization performance for given classifier under a budget constraint. However, in many…

Machine Learning · Computer Science 2021-05-25 Ruijiang Gao , Maytal Saar-tsechansky

Not All Labels Are Equal: Rationalizing The Labeling Costs for Training Object Detection

Deep neural networks have reached high accuracy on object detection but their success hinges on large amounts of labeled data. To reduce the labels dependency, various active learning strategies have been proposed, typically based on the…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Ismail Elezi , Zhiding Yu , Anima Anandkumar , Laura Leal-Taixe , Jose M. Alvarez

Active Testing: Sample-Efficient Model Evaluation

We introduce a new framework for sample-efficient model evaluation that we call active testing. While approaches like active learning reduce the number of labels needed for model training, existing literature largely ignores the cost of…

Machine Learning · Statistics 2021-06-15 Jannik Kossen , Sebastian Farquhar , Yarin Gal , Tom Rainforth

Active learning for data streams: a survey

Online active learning is a paradigm in machine learning that aims to select the most informative data points to label from a data stream. The problem of minimizing the cost associated with collecting labeled observations has gained a lot…

Machine Learning · Statistics 2023-12-01 Davide Cacciarelli , Murat Kulahci

R\'{e}nyi Entropy Bounds on the Active Learning Cost-Performance Tradeoff

Semi-supervised classification, one of the most prominent fields in machine learning, studies how to combine the statistical knowledge of the often abundant unlabeled data with the often limited labeled data in order to maximize overall…

Information Theory · Computer Science 2020-02-07 Vahid Jamali , Antonia Tulino , Jaime Llorca , Elza Erkip

Active Learning Via Sequential Design and Uncertainty Sampling

Classification is an important task in many fields including biomedical research and machine learning. Traditionally, a classification rule is constructed based a bunch of labeled data. Recently, due to technological innovation and…

Methodology · Statistics 2014-06-19 Jing Wang , Eunsik Park , Yuan-chin Ivan Chang

Semi-supervised Active Regression

Labelled data often comes at a high cost as it may require recruiting human labelers or running costly experiments. At the same time, in many practical scenarios, one already has access to a partially labelled, potentially biased dataset…

Machine Learning · Computer Science 2021-06-15 Fnu Devvrit , Nived Rajaraman , Pranjal Awasthi

Improve Cost Efficiency of Active Learning over Noisy Dataset

Active learning is a learning strategy whereby the machine learning algorithm actively identifies and labels data points to optimize its learning. This strategy is particularly effective in domains where an abundance of unlabeled data…

Machine Learning · Computer Science 2024-03-05 Zan-Kai Chong , Hiroyuki Ohsaki , Bryan Ng

Targeted Active Learning for Bayesian Decision-Making

Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the accuracy is not the end goal when the results are used for…

Machine Learning · Statistics 2021-10-22 Louis Filstroff , Iiris Sundin , Petrus Mikkola , Aleksei Tiulpin , Juuso Kylmäoja , Samuel Kaski

Limitations of Assessing Active Learning Performance at Runtime

Classification algorithms aim to predict an unknown label (e.g., a quality class) for a new instance (e.g., a product). Therefore, training samples (instances and labels) are used to deduct classification hypotheses. Often, it is relatively…

Machine Learning · Computer Science 2019-01-30 Daniel Kottke , Jim Schellinger , Denis Huseljic , Bernhard Sick

Deep Bayesian Active Semi-Supervised Learning

In many applications the process of generating label information is expensive and time consuming. We present a new method that combines active and semi-supervised deep learning to achieve high generalization performance from a deep…

Machine Learning · Computer Science 2018-03-06 Matthias Rottmann , Karsten Kahl , Hanno Gottschalk

Compute-Efficient Active Learning

Active learning, a powerful paradigm in machine learning, aims at reducing labeling costs by selecting the most informative samples from an unlabeled dataset. However, the traditional active learning process often demands extensive…

Machine Learning · Computer Science 2024-01-17 Gábor Németh , Tamás Matuszka

Active Learning For Contextual Linear Optimization: A Margin-Based Approach

We develop the first active learning method for contextual linear optimization. Specifically, we introduce a label acquisition algorithm that sequentially decides whether to request the ``labels'' of feature samples from an unlabeled data…

Machine Learning · Computer Science 2025-01-31 Mo Liu , Paul Grigas , Heyuan Liu , Zuo-Jun Max Shen

How to Purchase Labels? A Cost-Effective Approach Using Active Learning Markets

We introduce and analyse active learning markets as a way to purchase labels, in situations where analysts aim to acquire additional data to improve model fitting, or to better train models for predictive analytics applications. This comes…

Machine Learning · Computer Science 2026-02-11 Xiwen Huang , Pierre Pinson