Related papers: Data Shapley Valuation for Efficient Batch Active …

Dual Active Sampling on Batch-Incremental Active Learning

Recently, Convolutional Neural Networks (CNNs) have shown unprecedented success in the field of computer vision, especially on challenging image classification tasks by relying on a universal approach, i.e., training a deep model on a…

Computer Vision and Pattern Recognition · Computer Science 2019-05-23 Johan Phan , Massimiliano Ruocco , Francesco Scibilia

Semi-Supervised Active Learning with Temporal Output Discrepancy

While deep learning succeeds in a wide range of tasks, it highly depends on the massive collection of annotated data which is expensive and time-consuming. To lower the cost of data annotation, active learning has been proposed to…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Siyu Huang , Tianyang Wang , Haoyi Xiong , Jun Huan , Dejing Dou

Energy-Based Open-Set Active Learning for Object Classification

Active learning (AL) has emerged as a crucial methodology for minimizing labeling costs in deep learning by selecting the most valuable samples from a pool of unlabeled data for annotation. Traditional AL operates under a closed-set…

Machine Learning · Computer Science 2026-04-23 Zongyao Lyu , William J. Beksi

Identifying Wrongly Predicted Samples: A Method for Active Learning

State-of-the-art machine learning models require access to significant amount of annotated data in order to achieve the desired level of performance. While unlabelled data can be largely available and even abundant, annotation process can…

Machine Learning · Computer Science 2020-10-15 Rahaf Aljundi , Nikolay Chumerin , Daniel Olmeda Reino

Temporal Output Discrepancy for Loss Estimation-based Active Learning

While deep learning succeeds in a wide range of tasks, it highly depends on the massive collection of annotated data which is expensive and time-consuming. To lower the cost of data annotation, active learning has been proposed to…

Computer Vision and Pattern Recognition · Computer Science 2022-12-22 Siyu Huang , Tianyang Wang , Haoyi Xiong , Bihan Wen , Jun Huan , Dejing Dou

Exploiting Diversity of Unlabeled Data for Label-Efficient Semi-Supervised Active Learning

The availability of large labeled datasets is the key component for the success of deep learning. However, annotating labels on large datasets is generally time-consuming and expensive. Active learning is a research area that addresses the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Felix Buchert , Nassir Navab , Seong Tae Kim

Deep Active Learning with Manifold-preserving Trajectory Sampling

Active learning (AL) is for optimizing the selection of unlabeled data for annotation (labeling), aiming to enhance model performance while minimizing labeling effort. The key question in AL is which unlabeled data should be selected for…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Yingrui Ji , Vijaya Sindhoori Kaza , Nishanth Artham , Tianyang Wang

Robust Assignment of Labels for Active Learning with Sparse and Noisy Annotations

Supervised classification algorithms are used to solve a growing number of real-life problems around the globe. Their performance is strictly connected with the quality of labels used in training. Unfortunately, acquiring good-quality…

Machine Learning · Computer Science 2024-07-08 Daniel Kałuża , Andrzej Janusz , Dominik Ślęzak

Inconsistency-Based Data-Centric Active Open-Set Annotation

Active learning is a commonly used approach that reduces the labeling effort required to train deep neural networks. However, the effectiveness of current active learning methods is limited by their closed-world assumptions, which assume…

Machine Learning · Computer Science 2024-01-11 Ruiyu Mao , Ouyang Xu , Yunhui Guo

Boosting Active Learning via Improving Test Performance

Central to active learning (AL) is what data should be selected for annotation. Existing works attempt to select highly uncertain or informative data for annotation. Nevertheless, it remains unclear how selected data impacts the test…

Machine Learning · Computer Science 2022-01-25 Tianyang Wang , Xingjian Li , Pengkun Yang , Guosheng Hu , Xiangrui Zeng , Siyu Huang , Cheng-Zhong Xu , Min Xu

CHASe: Client Heterogeneity-Aware Data Selection for Effective Federated Active Learning

Active learning (AL) reduces human annotation costs for machine learning systems by strategically selecting the most informative unlabeled data for annotation, but performing it individually may still be insufficient due to restricted data…

Machine Learning · Computer Science 2025-04-25 Jun Zhang , Jue Wang , Huan Li , Zhongle Xie , Ke Chen , Lidan Shou

Boosting Active Learning for Speech Recognition with Noisy Pseudo-labeled Samples

The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-06 Jihwan Bang , Heesu Kim , YoungJoon Yoo , Jung-Woo Ha

Language Model-Driven Data Pruning Enables Efficient Active Learning

Active learning (AL) optimizes data labeling efficiency by selecting the most informative instances for annotation. A key component in this procedure is an acquisition function that guides the selection process and identifies the suitable…

Machine Learning · Computer Science 2024-10-08 Abdul Hameed Azeemi , Ihsan Ayyub Qazi , Agha Ali Raza

Active Learning at the ImageNet Scale

Active learning (AL) algorithms aim to identify an optimal subset of data for annotation, such that deep neural networks (DNN) can achieve better performance when trained on this labeled subset. AL is especially impactful in industrial…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Zeyad Ali Sami Emam , Hong-Min Chu , Ping-Yeh Chiang , Wojciech Czaja , Richard Leapman , Micah Goldblum , Tom Goldstein

Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning

Data Shapley has recently been proposed as a principled framework to quantify the contribution of individual datum in machine learning. It can effectively identify helpful or harmful data points for a learning algorithm. In this paper, we…

Machine Learning · Computer Science 2022-01-20 Yongchan Kwon , James Zou

Label-Efficient Point Cloud Semantic Segmentation: An Active Learning Approach

Deep learning models are the state-of-the-art methods for semantic point cloud segmentation, the success of which relies on the availability of large-scale annotated datasets. However, it can be extremely time-consuming and prohibitively…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Xian Shi , Xun Xu , Ke Chen , Lile Cai , Chuan Sheng Foo , Kui Jia

An Analysis of Active Learning Algorithms using Real-World Crowd-sourced Text Annotations

Active learning algorithms automatically identify the most informative samples from large amounts of unlabeled data and tremendously reduce human annotation effort in inducing a machine learning model. In a conventional active learning…

Machine Learning · Computer Science 2026-04-28 Varun Totakura , Ankita Singh , Yushun Dong , Shayok Chakraborty

Zero-shot Active Learning Using Self Supervised Learning

Deep learning algorithms are often said to be data hungry. The performance of such algorithms generally improve as more and more annotated data is fed into the model. While collecting unlabelled data is easier (as they can be scraped easily…

Machine Learning · Computer Science 2024-01-04 Abhishek Sinha , Shreya Singh

Fast-DataShapley: Neural Modeling for Training Data Valuation

The value and copyright of training data are crucial in the artificial intelligence industry. Service platforms should protect data providers' legitimate rights and fairly reward them for their contributions. Shapley value, a potent tool…

Machine Learning · Computer Science 2025-11-21 Haifeng Sun , Yu Xiong , Runze Wu , Xinyu Cai , Changjie Fan , Lan Zhang , Xiang-Yang Li

Active Scene Learning

Sketch recognition allows natural and efficient interaction in pen-based interfaces. A key obstacle to building accurate sketch recognizers has been the difficulty of creating large amounts of annotated training data. Several authors have…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Erelcan Yanik , Tevfik Metin Sezgin