Related papers: Bounded Memory Active Learning through Enriched Qu…

The Power of Comparisons for Actively Learning Linear Classifiers

In the world of big data, large but costly to label datasets dominate many fields. Active learning, a semi-supervised alternative to the standard PAC-learning model, was introduced to explore whether adaptive labeling could learn concepts…

Machine Learning · Computer Science 2020-06-02 Max Hopkins , Daniel M. Kane , Shachar Lovett

Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers

Active learning is the iterative construction of a classification model through targeted labeling, enabling significant labeling cost savings. As most research on active learning has been carried out before transformer-based language models…

Computation and Language · Computer Science 2022-03-22 Christopher Schröder , Andreas Niekler , Martin Potthast

Active and Adaptive Sequential learning

A framework is introduced for actively and adaptively solving a sequence of machine learning problems, which are changing in bounded manner from one time step to the next. An algorithm is developed that actively queries the labels of the…

Machine Learning · Computer Science 2018-05-31 Yuheng Bu , Jiaxun Lu , Venugopal V. Veeravalli

Active Learning: Problem Settings and Recent Developments

In supervised learning, acquiring labeled training data for a predictive model can be very costly, but acquiring a large amount of unlabeled data is often quite easy. Active learning is a method of obtaining predictive models with high…

Machine Learning · Computer Science 2020-12-17 Hideitsu Hino

Query-augmented Active Metric Learning

In this paper we propose an active metric learning method for clustering with pairwise constraints. The proposed method actively queries the label of informative instance pairs, while estimating underlying metrics by incorporating unlabeled…

Machine Learning · Statistics 2021-11-10 Yujia Deng , Yubai Yuan , Haoda Fu , Annie Qu

Active metric learning and classification using similarity queries

Active learning is commonly used to train label-efficient models by adaptively selecting the most informative queries. However, most active learning strategies are designed to either learn a representation of the data (e.g., embedding or…

Machine Learning · Computer Science 2022-02-07 Namrata Nadagouda , Austin Xu , Mark A. Davenport

Noise-tolerant, Reliable Active Classification with Comparison Queries

With the explosion of massive, widely available unlabeled data in the past years, finding label and time efficient, robust learning algorithms has become ever more important in theory and in practice. We study the paradigm of active…

Machine Learning · Computer Science 2020-01-17 Max Hopkins , Daniel Kane , Shachar Lovett , Gaurav Mahajan

Many active learning and search approaches are intractable for large-scale industrial settings with billions of unlabeled examples. Existing approaches search globally for the optimal examples to label, scaling linearly or even…

Machine Learning · Computer Science 2021-07-23 Cody Coleman , Edward Chou , Julian Katz-Samuels , Sean Culatana , Peter Bailis , Alexander C. Berg , Robert Nowak , Roshan Sumbaly , Matei Zaharia , I. Zeki Yalniz

MaxiMin Active Learning in Overparameterized Model Classes}

Generating labeled training datasets has become a major bottleneck in Machine Learning (ML) pipelines. Active ML aims to address this issue by designing learning algorithms that automatically and adaptively select the most informative…

Machine Learning · Computer Science 2020-04-29 Mina Karzand , Robert D. Nowak

Exploiting Diversity of Unlabeled Data for Label-Efficient Semi-Supervised Active Learning

The availability of large labeled datasets is the key component for the success of deep learning. However, annotating labels on large datasets is generally time-consuming and expensive. Active learning is a research area that addresses the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Felix Buchert , Nassir Navab , Seong Tae Kim

Active Learning from Positive and Unlabeled Data

During recent years, active learning has evolved into a popular paradigm for utilizing user's feedback to improve accuracy of learning algorithms. Active learning works by selecting the most informative sample among unlabeled data and…

Machine Learning · Computer Science 2016-11-17 Alireza Ghasemi , Hamid R. Rabiee , Mohsen Fadaee , Mohammad T. Manzuri , Mohammad H. Rohban

Improve Cost Efficiency of Active Learning over Noisy Dataset

Active learning is a learning strategy whereby the machine learning algorithm actively identifies and labels data points to optimize its learning. This strategy is particularly effective in domains where an abundance of unlabeled data…

Machine Learning · Computer Science 2024-03-05 Zan-Kai Chong , Hiroyuki Ohsaki , Bryan Ng

Active Learning Via Sequential Design and Uncertainty Sampling

Classification is an important task in many fields including biomedical research and machine learning. Traditionally, a classification rule is constructed based a bunch of labeled data. Recently, due to technological innovation and…

Methodology · Statistics 2014-06-19 Jing Wang , Eunsik Park , Yuan-chin Ivan Chang

When Imbalance Comes Twice: Active Learning under Simulated Class Imbalance and Label Shift in Binary Semantic Segmentation

The aim of Active Learning is to select the most informative samples from an unlabelled set of data. This is useful in cases where the amount of data is large and labelling is expensive, such as in machine vision or medical imaging. Two…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Julien Combes , Alexandre Derville , Jean-François Coeurjolly

Boosting Active Learning for Speech Recognition with Noisy Pseudo-labeled Samples

The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-06 Jihwan Bang , Heesu Kim , YoungJoon Yoo , Jung-Woo Ha

Active Covering

We analyze the problem of active covering, where the learner is given an unlabeled dataset and can sequentially label query examples. The objective is to label query all of the positive examples in the fewest number of total label queries.…

Machine Learning · Computer Science 2021-06-07 Heinrich Jiang , Afshin Rostamizadeh

Budget-constrained Active Learning to Effectively De-censor Survival Data

Standard supervised learners attempt to learn a model from a labeled dataset. Given a small set of labeled instances, and a pool of unlabeled instances, a budgeted learner can use its given budget to pay to acquire the labels of some…

Machine Learning · Computer Science 2025-10-15 Ali Parsaee , Bei Jiang , Zachary Friggstad , Russell Greiner

Active Learning for the Optimal Design of Multinomial Classification in Physics

Optimal design for model training is a critical topic in machine learning. Active Learning aims at obtaining improved models by querying samples with maximum uncertainty according to the estimation model for artificially labeling; this has…

Quantum Physics · Physics 2021-12-15 Yongcheng Ding , José D. Martín-Guerrero , Yujing Song , Rafael Magdalena-Benedito , Xi Chen

Deep Active Learning with Contrastive Learning Under Realistic Data Pool Assumptions

Active learning aims to identify the most informative data from an unlabeled data pool that enables a model to reach the desired accuracy rapidly. This benefits especially deep neural networks which generally require a huge number of…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Jihyo Kim , Jeonghyeon Kim , Sangheum Hwang

Active Robust Learning

In many practical applications of learning algorithms, unlabeled data is cheap and abundant whereas labeled data is expensive. Active learning algorithms developed to achieve better performance with lower cost. Usually Representativeness…

Machine Learning · Computer Science 2016-08-26 Hossein Ghafarian , Hadi Sadoghi Yazdi