Related papers: Evaluating Classifiers Without Expert Labels

Global Multiclass Classification and Dataset Construction via Heterogeneous Local Experts

In the domains of dataset construction and crowdsourcing, a notable challenge is to aggregate labels from a heterogeneous set of labelers, each of whom is potentially an expert in some subset of tasks (and less reliable in others). To…

Machine Learning · Computer Science 2021-01-07 Surin Ahn , Ayfer Ozgur , Mert Pilanci

Supervised Collective Classification for Crowdsourcing

Crowdsourcing utilizes the wisdom of crowds for collective classification via information (e.g., labels of an item) provided by labelers. Current crowdsourcing algorithms are mainly unsupervised methods that are unaware of the quality of…

Social and Information Networks · Computer Science 2016-11-15 Pin-Yu Chen , Chia-Wei Lien , Fu-Jen Chu , Pai-Shun Ting , Shin-Ming Cheng

Estimating the Accuracies of Multiple Classifiers Without Labeled Data

In various situations one is given only the predictions of multiple classifiers over a large unlabeled test data. This scenario raises the following questions: Without any labeled data and without any a-priori knowledge about the…

Machine Learning · Statistics 2014-10-31 Ariel Jaffe , Boaz Nadler , Yuval Kluger

Streaming algorithms for evaluating noisy judges on unlabeled data -- binary classification

The evaluation of noisy binary classifiers on unlabeled data is treated as a streaming task: given a data sketch of the decisions by an ensemble, estimate the true prevalence of the labels as well as each classifier's accuracy on them. Two…

Machine Learning · Statistics 2023-09-11 Andrés Corrada-Emmanuel

Leveraging Human Feedback to Scale Educational Datasets: Combining Crowdworkers and Comparative Judgement

Machine Learning models have many potentially beneficial applications in education settings, but a key barrier to their development is securing enough data to train these models. Labelling educational data has traditionally relied on highly…

Computation and Language · Computer Science 2023-11-10 Owen Henkel , Libby Hills

Unsupervised Label Refinement Improves Dataless Text Classification

Dataless text classification is capable of classifying documents into previously unseen labels by assigning a score to any document paired with a label description. While promising, it crucially relies on accurate descriptions of the label…

Computation and Language · Computer Science 2020-12-09 Zewei Chu , Karl Stratos , Kevin Gimpel

Toward a Robust Crowd-labeling Framework using Expert Evaluation and Pairwise Comparison

Crowd-labeling emerged from the need to label large-scale and complex data, a tedious, expensive, and time-consuming task. One of the main challenges in the crowd-labeling task is to control for or determine in advance the proportion of…

Human-Computer Interaction · Computer Science 2016-07-11 Faiza Khan Khattak , Ansaf Salleb-Aouissi

Aggregating Soft Labels from Crowd Annotations Improves Uncertainty Estimation Under Distribution Shift

Selecting an effective training signal for machine learning tasks is difficult: expert annotations are expensive, and crowd-sourced annotations may not be reliable. Recent work has demonstrated that learning from a distribution over labels…

Computation and Language · Computer Science 2025-04-23 Dustin Wright , Isabelle Augenstein

How many labelers do you have? A closer look at gold-standard labels

The construction of most supervised learning datasets revolves around collecting multiple labels for each instance, then aggregating the labels to form a type of "gold-standard". We question the wisdom of this pipeline by developing a…

Statistics Theory · Mathematics 2024-06-06 Chen Cheng , Hilal Asi , John Duchi

Crowd-Certain: Label Aggregation in Crowdsourced and Ensemble Learning Classification

Crowdsourcing systems have been used to accumulate massive amounts of labeled data for applications such as computer vision and natural language processing. However, because crowdsourced labeling is inherently dynamic and uncertain,…

Machine Learning · Computer Science 2023-10-26 Mohammad S. Majdi , Jeffrey J. Rodriguez

Active clustering for labeling training data

Gathering training data is a key step of any supervised learning task, and it is both critical and expensive. Critical, because the quantity and quality of the training data has a high impact on the performance of the learned function.…

Data Structures and Algorithms · Computer Science 2021-10-28 Quentin Lutz , Élie de Panafieu , Alex Scott , Maya Stein

Classifier Risk Estimation under Limited Labeling Resources

In this paper we propose strategies for estimating performance of a classifier when labels cannot be obtained for the whole test set. The number of test instances which can be labeled is very small compared to the whole test data size. The…

Machine Learning · Computer Science 2018-02-21 Anurag Kumar , Bhiksha Raj

Efficient PAC Learning from the Crowd

In recent years crowdsourcing has become the method of choice for gathering labeled training data for learning algorithms. Standard approaches to crowdsourcing view the process of acquiring labeled data separately from the process of…

Machine Learning · Computer Science 2017-04-17 Pranjal Awasthi , Avrim Blum , Nika Haghtalab , Yishay Mansour

Ballpark Crowdsourcing: The Wisdom of Rough Group Comparisons

Crowdsourcing has become a popular method for collecting labeled training data. However, in many practical scenarios traditional labeling can be difficult for crowdworkers (for example, if the data is high-dimensional or unintuitive, or the…

Machine Learning · Statistics 2017-12-14 Tom Hope , Dafna Shahaf

Crowd Labeling: a survey

Recently, there has been a burst in the number of research projects on human computation via crowdsourcing. Multiple choice (or labeling) questions could be referred to as a common type of problem which is solved by this approach. As an…

Artificial Intelligence · Computer Science 2014-09-04 Jafar Muhammadi , Hamid Reza Rabiee , Abbas Hosseini

Unsupervised Ranking and Aggregation of Label Descriptions for Zero-Shot Classifiers

Zero-shot text classifiers based on label descriptions embed an input text and a set of labels into the same space: measures such as cosine similarity can then be used to select the most similar label description to the input text as the…

Computation and Language · Computer Science 2022-05-25 Angelo Basile , Marc Franco-Salvador , Paolo Rosso

OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

As larger and more comprehensive datasets become standard in contemporary machine learning, it becomes increasingly more difficult to obtain reliable, trustworthy label information with which to train sophisticated models. To address this…

Machine Learning · Computer Science 2021-06-08 Glenn Dawson , Robi Polikar

Ranking and combining multiple predictors without labeled data

In a broad range of classification and decision making problems, one is given the advice or predictions of several classifiers, of unknown reliability, over multiple questions or queries. This scenario is different from the standard…

Machine Learning · Statistics 2014-02-07 Fabio Parisi , Francesco Strino , Boaz Nadler , Yuval Kluger

CrowdGrader: Crowdsourcing the Evaluation of Homework Assignments

Crowdsourcing offers a practical method for ranking and scoring large amounts of items. To investigate the algorithms and incentives that can be used in crowdsourcing quality evaluations, we built CrowdGrader, a tool that lets students…

Social and Information Networks · Computer Science 2013-08-27 Luca de Alfaro , Michael Shavlovsky

End-to-End Learning from Noisy Crowd to Supervised Machine Learning Models

Labeling real-world datasets is time consuming but indispensable for supervised machine learning models. A common solution is to distribute the labeling task across a large number of non-expert workers via crowd-sourcing. Due to the varying…

Machine Learning · Computer Science 2020-11-16 Taraneh Younesian , Chi Hong , Amirmasoud Ghiassi , Robert Birke , Lydia Y. Chen