Related papers: Towards Good Practices for Efficiently Annotating …

Is one annotation enough? A data-centric image classification benchmark for noisy and ambiguous label estimation

High-quality data is necessary for modern machine learning. However, the acquisition of such data is difficult due to noisy and ambiguous annotations of humans. The aggregation of such annotations to determine the label of an image leads to…

Computer Vision and Pattern Recognition · Computer Science 2022-11-07 Lars Schmarje , Vasco Grossmann , Claudius Zelenka , Sabine Dippel , Rainer Kiko , Mariusz Oszust , Matti Pastell , Jenny Stracke , Anna Valros , Nina Volkmann , Reinhard Koch

Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection

Data collection from manual labeling provides domain-specific and task-aligned supervision for data-driven approaches, and a critical mass of well-annotated resources is required to achieve reasonable performance in natural language…

Computation and Language · Computer Science 2023-11-09 Zhengyuan Liu , Hai Leong Chieu , Nancy F. Chen

Label Assistant: A Workflow for Assisted Data Annotation in Image Segmentation Tasks

Recent research in the field of computer vision strongly focuses on deep learning architectures to tackle image processing problems. Deep neural networks are often considered in complex image processing scenarios since traditional computer…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Marcel P. Schilling , Luca Rettenberger , Friedrich Münke , Haijun Cui , Anna A. Popova , Pavel A. Levkin , Ralf Mikut , Markus Reischl

A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks

Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A strategy to improve label quality is to ask multiple annotators to label the…

Machine Learning · Computer Science 2023-12-22 Alexander Braylan , Madalyn Marabella , Omar Alonso , Matthew Lease

Much Ado About Time: Exhaustive Annotation of Temporal Data

Large-scale annotated datasets allow AI systems to learn from and build upon the knowledge of the crowd. Many crowdsourcing techniques have been developed for collecting image annotations. These techniques often implicitly rely on the fact…

Human-Computer Interaction · Computer Science 2016-10-07 Gunnar A. Sigurdsson , Olga Russakovsky , Ali Farhadi , Ivan Laptev , Abhinav Gupta

A Data Management Approach for Dataset Selection Using Human Computation

As the number of applications that use machine learning algorithms increases, the need for labeled data useful for training such algorithms intensifies. Getting labels typically involves employing humans to do the annotation, which directly…

Machine Learning · Computer Science 2013-07-16 Alexandros Ntoulas , Omar Alonso , Vasilis Kandylas

Rethinking Crowdsourcing Annotation: Partial Annotation with Salient Labels for Multi-Label Image Classification

Annotated images are required for both supervised model training and evaluation in image classification. Manually annotating images is arduous and expensive, especially for multi-labeled images. A recent trend for conducting such laboursome…

Computer Vision and Pattern Recognition · Computer Science 2022-12-07 Jianzhe Lin , Tianze Yu , Z. Jane Wang

Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation

Image annotation aims to annotate a given image with a variable number of class labels corresponding to diverse visual concepts. In this paper, we address two main issues in large-scale image annotation: 1) how to learn a rich feature…

Computer Vision and Pattern Recognition · Computer Science 2018-10-22 Yulei Niu , Zhiwu Lu , Ji-Rong Wen , Tao Xiang , Shih-Fu Chang

A Survey on Machine Learning Techniques for Auto Labeling of Video, Audio, and Text Data

Machine learning has been utilized to perform tasks in many different domains such as classification, object detection, image segmentation and natural language analysis. Data labeling has always been one of the most important tasks in…

Machine Learning · Computer Science 2021-09-09 Shikun Zhang , Omid Jafari , Parth Nagarkar

Self-supervised Semi-supervised Learning for Data Labeling and Quality Evaluation

As the adoption of deep learning techniques in industrial applications grows with increasing speed and scale, successful deployment of deep learning models often hinges on the availability, volume, and quality of annotated data. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Haoping Bai , Meng Cao , Ping Huang , Jiulong Shan

Robust Assignment of Labels for Active Learning with Sparse and Noisy Annotations

Supervised classification algorithms are used to solve a growing number of real-life problems around the globe. Their performance is strictly connected with the quality of labels used in training. Unfortunately, acquiring good-quality…

Machine Learning · Computer Science 2024-07-08 Daniel Kałuża , Andrzej Janusz , Dominik Ślęzak

Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation

Traditional image annotation tasks rely heavily on human effort for object selection and label assignment, making the process time-consuming and prone to decreased efficiency as annotators experience fatigue after extensive work. This paper…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 He Zhang , Xinyi Fu , John M. Carroll

Learning a Deep ConvNet for Multi-label Classification with Partial Labels

Deep ConvNets have shown great performance for single-label image classification (e.g. ImageNet), but it is necessary to move beyond the single-label classification task because pictures of everyday life are inherently multi-label.…

Computer Vision and Pattern Recognition · Computer Science 2019-02-27 Thibaut Durand , Nazanin Mehrasa , Greg Mori

Learning from Imperfect Annotations

Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective,…

Machine Learning · Computer Science 2020-04-08 Emmanouil Antonios Platanios , Maruan Al-Shedivat , Eric Xing , Tom Mitchell

No Need to Sacrifice Data Quality for Quantity: Crowd-Informed Machine Annotation for Cost-Effective Understanding of Visual Data

Labeling visual data is expensive and time-consuming. Crowdsourcing systems promise to enable highly parallelizable annotations through the participation of monetarily or otherwise motivated workers, but even this approach has its limits.…

Human-Computer Interaction · Computer Science 2024-09-04 Christopher Klugmann , Rafid Mahmood , Guruprasad Hegde , Amit Kale , Daniel Kondermann

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

Deep neural networks (DNNs) have demonstrated exceptional performance across various image segmentation tasks. However, the process of preparing datasets for training segmentation DNNs is both labor-intensive and costly, as it typically…

Computer Vision and Pattern Recognition · Computer Science 2025-01-24 Yixin Zhang , Shen Zhao , Hanxue Gu , Maciej A. Mazurowski

Active Learning for NLP with Large Language Models

Human annotation of training samples is expensive, laborious, and sometimes challenging, especially for Natural Language Processing (NLP) tasks. To reduce the labeling cost and enhance the sample efficiency, Active Learning (AL) technique…

Computation and Language · Computer Science 2024-01-17 Xuesong Wang

Active label cleaning for improved dataset quality under resource constraints

Imperfections in data annotation, known as label noise, are detrimental to the training of machine learning models and have an often-overlooked confounding effect on the assessment of model performance. Nevertheless, employing experts to…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Melanie Bernhardt , Daniel C. Castro , Ryutaro Tanno , Anton Schwaighofer , Kerem C. Tezcan , Miguel Monteiro , Shruthi Bannur , Matthew Lungren , Aditya Nori , Ben Glocker , Javier Alvarez-Valle , Ozan Oktay

Sample selection for efficient image annotation

Supervised object detection has been proven to be successful in many benchmark datasets achieving human-level performances. However, acquiring a large amount of labeled image samples for supervised detection training is tedious,…

Computer Vision and Pattern Recognition · Computer Science 2021-05-12 Bishwo Adhikari , Esa Rahtu , Heikki Huttunen

Unlocking ImageNet's Multi-Object Nature: Automated Large-Scale Multilabel Annotation

The original ImageNet benchmark enforces a single-label assumption, despite many images depicting multiple objects. This leads to label noise and limits the richness of the learning signal. Multi-label annotations more accurately reflect…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Junyu Chen , Md Yousuf Harun , Christopher Kanan