Related papers: Identifying Incorrect Annotations in Multi-Label C…

Multi-Label Learning from Single Positive Labels

Predicting all applicable labels for a given image is known as multi-label classification. Compared to the standard multi-class case (where each image has only one label), it is considerably more challenging to annotate training data for…

Computer Vision and Pattern Recognition · Computer Science 2021-10-25 Elijah Cole , Oisin Mac Aodha , Titouan Lorieul , Pietro Perona , Dan Morris , Nebojsa Jojic

Modeling Multiple Annotator Expertise in the Semi-Supervised Learning Scenario

Learning algorithms normally assume that there is at most one annotation or label per data point. However, in some scenarios, such as medical diagnosis and on-line collaboration,multiple annotations may be available. In either case,…

Machine Learning · Computer Science 2012-03-19 Yan Yan , Romer Rosales , Glenn Fung , Jennifer Dy

Multi-label Classification with Partial Annotations using Class-aware Selective Loss

Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Emanuel Ben-Baruch , Tal Ridnik , Itamar Friedman , Avi Ben-Cohen , Nadav Zamir , Asaf Noy , Lihi Zelnik-Manor

Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated Labels

Multi-label classification is a widely encountered problem in daily life, where an instance can be associated with multiple classes. In theory, this is a supervised learning method that requires a large amount of labeling. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-02 XIn Zhang , Yuqi Song , Fei Zuo , Xiaofeng Wang

Learning a Deep ConvNet for Multi-label Classification with Partial Labels

Deep ConvNets have shown great performance for single-label image classification (e.g. ImageNet), but it is necessary to move beyond the single-label classification task because pictures of everyday life are inherently multi-label.…

Computer Vision and Pattern Recognition · Computer Science 2019-02-27 Thibaut Durand , Nazanin Mehrasa , Greg Mori

An Effective Approach for Multi-label Classification with Missing Labels

Compared with multi-class classification, multi-label classification that contains more than one class is more suitable in real life scenarios. Obtaining fully labeled high-quality datasets for multi-label classification problems, however,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-26 Xin Zhang , Rabab Abdelfattah , Yuqi Song , Xiaofeng Wang

Estimating label quality and errors in semantic segmentation data via any model

The labor-intensive annotation process of semantic segmentation datasets is often prone to errors, since humans struggle to label every pixel correctly. We study algorithms to automatically detect such annotation errors, in particular…

Machine Learning · Computer Science 2023-07-12 Vedang Lad , Jonas Mueller

ActiveLab: Active Learning with Re-Labeling by Multiple Annotators

In real-world data labeling applications, annotators often provide imperfect labels. It is thus common to employ multiple annotators to label data with some overlap between their examples. We study active learning in such settings, aiming…

Machine Learning · Computer Science 2024-07-29 Hui Wen Goh , Jonas Mueller

ObjectLab: Automated Diagnosis of Mislabeled Images in Object Detection Data

Despite powering sensitive systems like autonomous vehicles, object detection remains fairly brittle in part due to annotation errors that plague most real-world training datasets. We propose ObjectLab, a straightforward algorithm to detect…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Ulyana Tkachenko , Aditya Thyagarajan , Jonas Mueller

Identifying Mislabeled Instances in Classification Datasets

A key requirement for supervised machine learning is labeled training data, which is created by annotating unlabeled data with the appropriate class. Because this process can in many cases not be done by machines, labeling needs to be…

Machine Learning · Computer Science 2019-12-12 Nicolas Michael Müller , Karla Markert

Rethinking Consistent Multi-Label Classification Under Inexact Supervision

Partial multi-label learning and complementary multi-label learning are two popular weakly supervised multi-label classification paradigms that aim to alleviate the high annotation costs of collecting precisely annotated multi-label data.…

Machine Learning · Computer Science 2026-02-26 Wei Wang , Tianhao Ma , Ming-Kun Xie , Gang Niu , Masashi Sugiyama

Learning from Concealed Labels

Annotating data for sensitive labels (e.g., disease, smoking) poses a potential threats to individual privacy in many real-world scenarios. To cope with this problem, we propose a novel setting to protect privacy of each instance, namely…

Machine Learning · Computer Science 2024-12-04 Zhongnian Li , Meng Wei , Peng Ying , Tongfeng Sun , Xinzheng Xu

Reliable Mislabel Detection for Video Capsule Endoscopy Data

The classification performance of deep neural networks relies strongly on access to large, accurately annotated datasets. In medical imaging, however, obtaining such datasets is particularly challenging since annotations must be provided by…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Julia Werner , Julius Oexle , Oliver Bause , Maxime Le Floch , Franz Brinkmann , Hannah Tolle , Jochen Hampe , Oliver Bringmann

Detecting Label Errors in Token Classification Data

Mislabeled examples are a common issue in real-world data, particularly for tasks like token classification where many labels must be chosen on a fine-grained basis. Here we consider the task of finding sentences that contain label errors…

Computation and Language · Computer Science 2022-10-24 Wei-Chen Wang , Jonas Mueller

Meta-Learning for Multi-Label Few-Shot Classification

Even with the luxury of having abundant data, multi-label classification is widely known to be a challenging task to address. This work targets the problem of multi-label meta-learning, where a model learns to predict multiple labels within…

Computer Vision and Pattern Recognition · Computer Science 2021-10-27 Christian Simon , Piotr Koniusz , Mehrtash Harandi

Learning with Different Amounts of Annotation: From Zero to Many Labels

Training NLP systems typically assumes access to annotated data that has a single human label per example. Given imperfect labeling from annotators and inherent ambiguity of language, we hypothesize that single label is not sufficient to…

Computation and Language · Computer Science 2021-09-14 Shujian Zhang , Chengyue Gong , Eunsol Choi

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets

Data is the engine of modern computer vision, which necessitates collecting large-scale datasets. This is expensive, and guaranteeing the quality of the labels is a major challenge. In this paper, we investigate efficient annotation…

Computer Vision and Pattern Recognition · Computer Science 2021-04-27 Yuan-Hong Liao , Amlan Kar , Sanja Fidler

Robust Assignment of Labels for Active Learning with Sparse and Noisy Annotations

Supervised classification algorithms are used to solve a growing number of real-life problems around the globe. Their performance is strictly connected with the quality of labels used in training. Unfortunately, acquiring good-quality…

Machine Learning · Computer Science 2024-07-08 Daniel Kałuża , Andrzej Janusz , Dominik Ślęzak

In real-world applications, as data availability increases, obtaining labeled data for machine learning (ML) projects remains challenging due to the high costs and intensive efforts required for data annotation. Many ML projects,…

Machine Learning · Computer Science 2024-12-24 Ismail Hakki Karaman , Gulser Koksal , Levent Eriskin , Salih Salihoglu

Is one annotation enough? A data-centric image classification benchmark for noisy and ambiguous label estimation

High-quality data is necessary for modern machine learning. However, the acquisition of such data is difficult due to noisy and ambiguous annotations of humans. The aggregation of such annotations to determine the label of an image leads to…

Computer Vision and Pattern Recognition · Computer Science 2022-11-07 Lars Schmarje , Vasco Grossmann , Claudius Zelenka , Sabine Dippel , Rainer Kiko , Mariusz Oszust , Matti Pastell , Jenny Stracke , Anna Valros , Nina Volkmann , Reinhard Koch