Related papers: Learn2Agree: Fitting with Multiple Annotators with…

Learning from Multiple Expert Annotators for Enhancing Anomaly Detection in Medical Image Analysis

Building an accurate computer-aided diagnosis system based on data-driven approaches requires a large amount of high-quality labeled data. In medical imaging analysis, multiple expert annotators often produce subjective estimates about…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Khiem H. Le , Tuan V. Tran , Hieu H. Pham , Hieu T. Nguyen , Tung T. Le , Ha Q. Nguyen

Learning from Imperfect Annotations

Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective,…

Machine Learning · Computer Science 2020-04-08 Emmanouil Antonios Platanios , Maruan Al-Shedivat , Eric Xing , Tom Mitchell

Calibrating Probabilistic Object Detectors with Annotator Disagreement

High degrees of disagreement among annotators can exist for ambiguous objects, e.g. in medical images, underscoring the challenges of establishing ground truth annotations in object detection tasks. Despite this, all existing object…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Zhi Qin Tan , Owen Addison , Yunpeng Li

Optimizing Resources for On-the-Fly Label Estimation with Multiple Unknown Medical Experts

Accurate ground truth estimation in medical screening programs often relies on coalitions of experts and peer second opinions. Algorithms that efficiently aggregate noisy annotations can enhance screening workflows, particularly when data…

Machine Learning · Computer Science 2025-10-07 Tim Bary , Tiffanie Godelaine , Axel Abels , Benoît Macq

Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling

Sequence labeling is a fundamental framework for various natural language processing problems. Its performance is largely influenced by the annotation quality and quantity in supervised learning scenarios, and obtaining ground truth labels…

Computation and Language · Computer Science 2020-04-17 Ouyu Lan , Xiao Huang , Bill Yuchen Lin , He Jiang , Liyuan Liu , Xiang Ren

Dealing with Disagreements: Looking Beyond the Majority Vote in Subjective Annotations

Majority voting and averaging are common approaches employed to resolve annotator disagreements and derive single ground truth labels from multiple annotations. However, annotators may systematically disagree with one another, often…

Computation and Language · Computer Science 2021-10-13 Aida Mostafazadeh Davani , Mark Díaz , Vinodkumar Prabhakaran

Multi-annotator Deep Learning: A Probabilistic Framework for Classification

Solving complex classification tasks using deep neural networks typically requires large amounts of annotated data. However, corresponding class labels are noisy when provided by error-prone annotators, e.g., crowdworkers. Training standard…

Machine Learning · Computer Science 2023-10-25 Marek Herde , Denis Huseljic , Bernhard Sick

DESC: Domain Adaptation for Depth Estimation via Semantic Consistency

Accurate real depth annotations are difficult to acquire, needing the use of special devices such as a LiDAR sensor. Self-supervised methods try to overcome this problem by processing video or stereo sequences, which may not always be…

Computer Vision and Pattern Recognition · Computer Science 2020-09-04 Adrian Lopez-Rodriguez , Krystian Mikolajczyk

D-LEMA: Deep Learning Ensembles from Multiple Annotations -- Application to Skin Lesion Segmentation

Medical image segmentation annotations suffer from inter- and intra-observer variations even among experts due to intrinsic differences in human annotators and ambiguous boundaries. Leveraging a collection of annotators' opinions for an…

Image and Video Processing · Electrical Eng. & Systems 2021-05-04 Zahra Mirikharaji , Kumar Abhishek , Saeed Izadi , Ghassan Hamarneh

Disjoint Contrastive Regression Learning for Multi-Sourced Annotations

Large-scale datasets are important for the development of deep learning models. Such datasets usually require a heavy workload of annotations, which are extremely time-consuming and expensive. To accelerate the annotation procedure,…

Machine Learning · Computer Science 2024-03-13 Xiaoqian Ruan , Gaoang Wang

Learning From Noisy Labels By Regularized Estimation Of Annotator Confusion

The predictive performance of supervised learning algorithms depends on the quality of labels. In a typical label collection process, multiple annotators provide subjective noisy estimates of the "truth" under the influence of their varying…

Machine Learning · Computer Science 2019-06-18 Ryutaro Tanno , Ardavan Saeedi , Swami Sankaranarayanan , Daniel C. Alexander , Nathan Silberman

Learning from Multiple Annotators by Incorporating Instance Features

Learning from multiple annotators aims to induce a high-quality classifier from training instances, where each of them is associated with a set of possibly noisy labels provided by multiple annotators under the influence of their varying…

Machine Learning · Computer Science 2021-06-30 Jingzheng Li , Hailong Sun , Jiyi Li , Zhijun Chen , Renshuai Tao , Yufei Ge

Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations

Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels…

Computation and Language · Computer Science 2024-03-08 Abhishek Anand , Negar Mokhberian , Prathyusha Naresh Kumar , Anweasha Saha , Zihao He , Ashwin Rao , Fred Morstatter , Kristina Lerman

Beyond Agreement: Rethinking Ground Truth in Educational AI Annotation

Humans can be notoriously imperfect evaluators. They are often biased, unreliable, and unfit to define "ground truth." Yet, given the surging need to produce large amounts of training data in educational applications using AI, traditional…

Artificial Intelligence · Computer Science 2025-08-04 Danielle R. Thomas , Conrad Borchers , Kenneth R. Koedinger

Multi-Perspective Stance Detection

Subjective NLP tasks usually rely on human annotations provided by multiple annotators, whose judgments may vary due to their diverse backgrounds and life experiences. Traditional methods often aggregate multiple annotations into a single…

Computation and Language · Computer Science 2025-10-17 Benedetta Muscato , Praveen Bushipaka , Gizem Gezici , Lucia Passaro , Fosca Giannotti

Multi-rater Prompting for Ambiguous Medical Image Segmentation

Multi-rater annotations commonly occur when medical images are independently annotated by multiple experts (raters). In this paper, we tackle two challenges arisen in multi-rater annotations for medical image segmentation (called ambiguous…

Computer Vision and Pattern Recognition · Computer Science 2024-08-26 Jinhong Wang , Yi Cheng , Jintai Chen , Hongxia Xu , Danny Chen , Jian Wu

Adjusting the Ground Truth Annotations for Connectivity-Based Learning to Delineate

Deep learning-based approaches to delineating 3D structure depend on accurate annotations to train the networks. Yet, in practice, people, no matter how conscientious, have trouble precisely delineating in 3D and on a large scale, in part…

Computer Vision and Pattern Recognition · Computer Science 2022-12-27 Doruk Oner , Leonardo Citraro , Mateusz Koziński , Pascal Fua

The Consensus Trap: Dissecting Subjectivity and the "Ground Truth" Illusion in Data Annotation

In machine learning, "ground truth" refers to the assumed correct labels used to train and evaluate models. However, the foundational "ground truth" paradigm rests on a positivistic fallacy that treats human disagreement as technical noise…

Artificial Intelligence · Computer Science 2026-04-28 Sheza Munir , Benjamin Mah , Krisha Kalsi , Shivani Kapania , Julian Posada , Edith Law , Ding Wang , Syed Ishtiaque Ahmed

What is the ground truth? Reliability of multi-annotator data for audio tagging

Crowdsourcing has become a common approach for annotating large amounts of data. It has the advantage of harnessing a large workforce to produce large amounts of data in a short time, but comes with the disadvantage of employing non-expert…

Audio and Speech Processing · Electrical Eng. & Systems 2021-04-12 Irene Martin-Morato , Annamaria Mesaros

Phase Consistent Ecological Domain Adaptation

We introduce two criteria to regularize the optimization involved in learning a classifier in a domain where no annotated data are available, leveraging annotated data in a different domain, a problem known as unsupervised domain…

Computer Vision and Pattern Recognition · Computer Science 2020-04-13 Yanchao Yang , Dong Lao , Ganesh Sundaramoorthi , Stefano Soatto