Related papers: Active Learning under Label Shift

When Imbalance Comes Twice: Active Learning under Simulated Class Imbalance and Label Shift in Binary Semantic Segmentation

The aim of Active Learning is to select the most informative samples from an unlabelled set of data. This is useful in cases where the amount of data is large and labelling is expensive, such as in machine vision or medical imaging. Two…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Julien Combes , Alexandre Derville , Jean-François Coeurjolly

Class-Balanced Active Learning for Image Classification

Active learning aims to reduce the labeling effort that is required to train algorithms by learning an acquisition function selecting the most relevant data for which a label should be requested from a large unlabeled data pool. Active…

Computer Vision and Pattern Recognition · Computer Science 2021-10-12 Javad Zolfaghari Bengar , Joost van de Weijer , Laura Lopez Fuentes , Bogdan Raducanu

When Deep Learners Change Their Mind: Learning Dynamics for Active Learning

Active learning aims to select samples to be annotated that yield the largest performance improvement for the learning algorithm. Many methods approach this problem by measuring the informativeness of samples and do this based on the…

Machine Learning · Computer Science 2021-08-02 Javad Zolfaghari Bengar , Bogdan Raducanu , Joost van de Weijer

Deep Active Learning for Sequence Labeling Based on Diversity and Uncertainty in Gradient

Recently, several studies have investigated active learning (AL) for natural language processing tasks to alleviate data dependency. However, for query selection, most of these studies mainly rely on uncertainty-based sampling, which…

Computation and Language · Computer Science 2020-11-30 Yekyung Kim

Importance Weighted Active Learning

We present a practical and statistically consistent scheme for actively learning binary classifiers under general loss functions. Our algorithm uses importance weighting to correct sampling bias, and by controlling the variance, we are able…

Machine Learning · Computer Science 2009-05-20 Alina Beygelzimer , Sanjoy Dasgupta , John Langford

Minority Class Oriented Active Learning for Imbalanced Datasets

Active learning aims to optimize the dataset annotation process when resources are constrained. Most existing methods are designed for balanced datasets. Their practical applicability is limited by the fact that a majority of real-life…

Machine Learning · Computer Science 2022-02-02 Umang Aggarwal , Adrian Popescu , Céline Hudelot

Consistency-based Semi-supervised Active Learning: Towards Minimizing Labeling Cost

Active learning (AL) combines data labeling and model training to minimize the labeling cost by prioritizing the selection of high value data that can best improve model performance. In pool-based active learning, accessible unlabeled data…

Machine Learning · Computer Science 2020-07-21 Mingfei Gao , Zizhao Zhang , Guo Yu , Sercan O. Arik , Larry S. Davis , Tomas Pfister

Active Learning with Importance Sampling

We consider an active learning setting where the algorithm has access to a large pool of unlabeled data and a small pool of labeled data. In each iteration, the algorithm chooses few unlabeled data points and obtains their labels from an…

Machine Learning · Computer Science 2019-10-11 Muni Sreenivas Pydi , Vishnu Suresh Lokhande

Multi-Label Bayesian Active Learning with Inter-Label Relationships

The primary challenge of multi-label active learning, differing it from multi-class active learning, lies in assessing the informativeness of an indefinite number of labels while also accounting for the inherited label correlation. Existing…

Machine Learning · Computer Science 2025-09-05 Yuanyuan Qi , Jueqing Lu , Xiaohao Yang , Joanne Enticott , Lan Du

Regularized Learning for Domain Adaptation under Label Shifts

We propose Regularized Learning under Label shifts (RLLS), a principled and a practical domain-adaptation algorithm to correct for shifts in the label distribution between a source and a target domain. We first estimate importance weights…

Machine Learning · Computer Science 2020-08-10 Kamyar Azizzadenesheli , Anqi Liu , Fanny Yang , Animashree Anandkumar

Learning to Sample: an Active Learning Framework

Meta-learning algorithms for active learning are emerging as a promising paradigm for learning the ``best'' active learning strategy. However, current learning-based active learning approaches still require sufficient training data so as to…

Machine Learning · Computer Science 2019-09-10 Jingyu Shao , Qing Wang , Fangbing Liu

Combating Label Distribution Shift for Active Domain Adaptation

We consider the problem of active domain adaptation (ADA) to unlabeled target data, of which subset is actively selected and labeled given a budget constraint. Inspired by recent analysis on a critical issue from label distribution mismatch…

Machine Learning · Computer Science 2022-08-16 Sehyun Hwang , Sohyun Lee , Sungyeon Kim , Jungseul Ok , Suha Kwak

Improve Cost Efficiency of Active Learning over Noisy Dataset

Active learning is a learning strategy whereby the machine learning algorithm actively identifies and labels data points to optimize its learning. This strategy is particularly effective in domains where an abundance of unlabeled data…

Machine Learning · Computer Science 2024-03-05 Zan-Kai Chong , Hiroyuki Ohsaki , Bryan Ng

Deep Active Learning with Contrastive Learning Under Realistic Data Pool Assumptions

Active learning aims to identify the most informative data from an unlabeled data pool that enables a model to reach the desired accuracy rapidly. This benefits especially deep neural networks which generally require a huge number of…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Jihyo Kim , Jeonghyeon Kim , Sangheum Hwang

To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation

Active learning (AL) techniques reduce labeling costs for training neural machine translation (NMT) models by selecting smaller representative subsets from unlabeled data for annotation. Diversity sampling techniques select heterogeneous…

Computation and Language · Computer Science 2024-12-19 Abdul Hameed Azeemi , Ihsan Ayyub Qazi , Agha Ali Raza

Model-Change Active Learning in Graph-Based Semi-Supervised Learning

Active learning in semi-supervised classification involves introducing additional labels for unlabelled data to improve the accuracy of the underlying classifier. A challenge is to identify which points to label to best improve performance…

Machine Learning · Statistics 2024-11-18 Kevin Miller , Andrea L. Bertozzi

BAL: Balancing Diversity and Novelty for Active Learning

The objective of Active Learning is to strategically label a subset of the dataset to maximize performance within a predetermined labeling budget. In this study, we harness features acquired through self-supervised learning. We introduce a…

Machine Learning · Computer Science 2023-12-27 Jingyao Li , Pengguang Chen , Shaozuo Yu , Shu Liu , Jiaya Jia

Deep Active Learning with Manifold-preserving Trajectory Sampling

Active learning (AL) is for optimizing the selection of unlabeled data for annotation (labeling), aiming to enhance model performance while minimizing labeling effort. The key question in AL is which unlabeled data should be selected for…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Yingrui Ji , Vijaya Sindhoori Kaza , Nishanth Artham , Tianyang Wang

Theory-inspired Label Shift Adaptation via Aligned Distribution Mixture

As a prominent challenge in addressing real-world issues within a dynamic environment, label shift, which refers to the learning setting where the source (training) and target (testing) label distributions do not match, has recently…

Machine Learning · Computer Science 2024-11-06 Ruidong Fan , Xiao Ouyang , Hong Tao , Yuhua Qian , Chenping Hou

Frugal Reinforcement-based Active Learning

Most of the existing learning models, particularly deep neural networks, are reliant on large datasets whose hand-labeling is expensive and time demanding. A current trend is to make the learning of these models frugal and less dependent on…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Sebastien Deschamps , Hichem Sahbi