Related papers: Multi-Label Active Learning from Crowds

An Analysis of Active Learning Algorithms using Real-World Crowd-sourced Text Annotations

Active learning algorithms automatically identify the most informative samples from large amounts of unlabeled data and tremendously reduce human annotation effort in inducing a machine learning model. In a conventional active learning…

Machine Learning · Computer Science 2026-04-28 Varun Totakura , Ankita Singh , Yushun Dong , Shayok Chakraborty

Active Multi-Label Crowd Consensus

Crowdsourcing is an economic and efficient strategy aimed at collecting annotations of data through an online platform. Crowd workers with different expertise are paid for their service, and the task requester usually has a limited budget.…

Machine Learning · Computer Science 2019-11-11 Jinzheng Tu , Guoxian Yu , Carlotta Domeniconi , Jun Wang , Xiangliang Zhang

Multi-Label Annotation Aggregation in Crowdsourcing

As a means of human-based computation, crowdsourcing has been widely used to annotate large-scale unlabeled datasets. One of the obvious challenges is how to aggregate these possibly noisy labels provided by a set of heterogeneous…

Machine Learning · Computer Science 2020-10-20 Xuan Wei , Daniel Dajun Zeng , Junming Yin

Topic Model Based Multi-Label Classification from the Crowd

Multi-label classification is a common supervised machine learning problem where each instance is associated with multiple classes. The key challenge in this problem is learning the correlations between the classes. An additional challenge…

Machine Learning · Computer Science 2016-04-05 Divya Padmanabhan , Satyanath Bhat , Shirish Shevade , Y. Narahari

An Active Learning Approach for Jointly Estimating Worker Performance and Annotation Reliability with Crowdsourced Data

Crowdsourcing platforms offer a practical solution to the problem of affordably annotating large datasets for training supervised classifiers. Unfortunately, poor worker performance frequently threatens to compromise annotation reliability,…

Machine Learning · Computer Science 2014-01-17 Liyue Zhao , Yu Zhang , Gita Sukthankar

Efficiency of active learning for the allocation of workers on crowdsourced classification tasks

Crowdsourcing has been successfully employed in the past as an effective and cheap way to execute classification tasks and has therefore attracted the attention of the research community. However, we still lack a theoretical understanding…

Human-Computer Interaction · Computer Science 2016-10-20 Edoardo Manino , Long Tran-Thanh , Nicholas R. Jennings

Active Learning for Crowd-Sourced Databases

Crowd-sourcing has become a popular means of acquiring labeled data for a wide variety of tasks where humans are more accurate than computers, e.g., labeling images, matching objects, or analyzing sentiment. However, relying solely on the…

Machine Learning · Computer Science 2014-12-23 Barzan Mozafari , Purnamrita Sarkar , Michael J. Franklin , Michael I. Jordan , Samuel Madden

Efficient PAC Learning from the Crowd

In recent years crowdsourcing has become the method of choice for gathering labeled training data for learning algorithms. Standard approaches to crowdsourcing view the process of acquiring labeled data separately from the process of…

Machine Learning · Computer Science 2017-04-17 Pranjal Awasthi , Avrim Blum , Nika Haghtalab , Yishay Mansour

ActiveLab: Active Learning with Re-Labeling by Multiple Annotators

In real-world data labeling applications, annotators often provide imperfect labels. It is thus common to employ multiple annotators to label data with some overlap between their examples. We study active learning in such settings, aiming…

Machine Learning · Computer Science 2024-07-29 Hui Wen Goh , Jonas Mueller

Candidate Labeling for Crowd Learning

Crowdsourcing has become very popular among the machine learning community as a way to obtain labels that allow a ground truth to be estimated for a given dataset. In most of the approaches that use crowdsourced labels, annotators are asked…

Machine Learning · Statistics 2018-08-09 Iker Beñaran-Muñoz , Jerónimo Hernández-González , Aritz Pérez

Label Selection Approach to Learning from Crowds

Supervised learning, especially supervised deep learning, requires large amounts of labeled data. One approach to collect large amounts of labeled data is by using a crowdsourcing platform where numerous workers perform the annotation…

Machine Learning · Computer Science 2023-08-22 Kosuke Yoshimura , Hisashi Kashima

Active Learning from Crowd in Document Screening

In this paper, we explore how to efficiently combine crowdsourcing and machine intelligence for the problem of document screening, where we need to screen documents with a set of machine-learning filters. Specifically, we focus on building…

Information Retrieval · Computer Science 2020-12-07 Evgeny Krivosheev , Burcu Sayin , Alessandro Bozzon , Zoltán Szlávik

PAL : Pretext-based Active Learning

The goal of pool-based active learning is to judiciously select a fixed-sized subset of unlabeled samples from a pool to query an oracle for their labels, in order to maximize the accuracy of a supervised learner. However, the unsaid…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Shubhang Bhatnagar , Sachin Goyal , Darshan Tank , Amit Sethi

CrowdAL: Towards a Blockchain-empowered Active Learning System in Crowd Data Labeling

Active Learning (AL) is a machine learning technique where the model selectively queries the most informative data points for labeling by human experts. Integrating AL with crowdsourcing leverages crowd diversity to enhance data labeling…

Cryptography and Security · Computer Science 2025-03-04 Shaojie Hou , Yuandou Wang , Zhiming Zhao

Attention-Aware Answers of the Crowd

Crowdsourcing is a relatively economic and efficient solution to collect annotations from the crowd through online platforms. Answers collected from workers with different expertise may be noisy and unreliable, and the quality of annotated…

Machine Learning · Computer Science 2020-01-08 Jingzheng Tu , Guoxian Yu , Jun Wang , Carlotta Domeniconi , Xiangliang Zhang

ACTOR: Active Learning with Annotator-specific Classification Heads to Embrace Human Label Variation

Label aggregation such as majority voting is commonly used to resolve annotator disagreement in dataset creation. However, this may disregard minority values and opinions. Recent studies indicate that learning from individual annotations…

Computation and Language · Computer Science 2023-10-24 Xinpeng Wang , Barbara Plank

Beyond Labels: Information-Efficient Human-in-the-Loop Learning using Ranking and Selection Queries

Integrating human expertise into machine learning systems often reduces the role of experts to labeling oracles, a paradigm that limits the amount of information exchanged and fails to capture the nuances of human judgment. We address this…

Human-Computer Interaction · Computer Science 2026-02-18 Belén Martín-Urcelay , Yoonsang Lee , Matthieu R. Bloch , Christopher J. Rozell

Active learning for data streams: a survey

Online active learning is a paradigm in machine learning that aims to select the most informative data points to label from a data stream. The problem of minimizing the cost associated with collecting labeled observations has gained a lot…

Machine Learning · Statistics 2023-12-01 Davide Cacciarelli , Murat Kulahci

Active Learning in Incomplete Label Multiple Instance Multiple Label Learning

In multiple instance multiple label learning, each sample, a bag, consists of multiple instances. To alleviate labeling complexity, each sample is associated with a set of bag-level labels leaving instances within the bag unlabeled. This…

Machine Learning · Computer Science 2021-07-28 Tam Nguyen , Raviv Raich

Towards Computationally Feasible Deep Active Learning

Active learning (AL) is a prominent technique for reducing the annotation effort required for training machine learning models. Deep learning offers a solution for several essential obstacles to deploying AL in practice but introduces many…

Computation and Language · Computer Science 2022-05-10 Akim Tsvigun , Artem Shelmanov , Gleb Kuzmin , Leonid Sanochkin , Daniil Larionov , Gleb Gusev , Manvel Avetisian , Leonid Zhukov