Related papers: Multi-granular Software Annotation using File-leve…

AutoFL: A Tool for Automatic Multi-granular Labelling of Software Repositories

Software comprehension, especially of new code bases, is time consuming for developers, especially in large projects with multiple functionalities spanning various domains. One strategy to reduce this effort involves annotating files with…

Software Engineering · Computer Science 2025-04-28 Cezar Sas , Andrea Capiluppi

Image Annotation using Multi-Layer Sparse Coding

Automatic annotation of images with descriptive words is a challenging problem with vast applications in the areas of image search and retrieval. This problem can be viewed as a label-assignment problem by a classifier dealing with a very…

Computer Vision and Pattern Recognition · Computer Science 2017-05-09 Amara Tariq , Hassan Foroosh

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets

Data is the engine of modern computer vision, which necessitates collecting large-scale datasets. This is expensive, and guaranteeing the quality of the labels is a major challenge. In this paper, we investigate efficient annotation…

Computer Vision and Pattern Recognition · Computer Science 2021-04-27 Yuan-Hong Liao , Amlan Kar , Sanja Fidler

SciAnnotate: A Tool for Integrating Weak Labeling Sources for Sequence Labeling

Weak labeling is a popular weak supervision strategy for Named Entity Recognition (NER) tasks, with the goal of reducing the necessity for hand-crafted annotations. Although there are numerous remarkable annotation tools for NER labeling,…

Computation and Language · Computer Science 2025-07-04 Mengyang Liu , Haozheng Luo , Leonard Thong , Yinghao Li , Chao Zhang , Le Song

How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model

Deep neural networks (DNNs) have demonstrated exceptional performance across various image segmentation tasks. However, the process of preparing datasets for training segmentation DNNs is both labor-intensive and costly, as it typically…

Computer Vision and Pattern Recognition · Computer Science 2025-01-24 Yixin Zhang , Shen Zhao , Hanxue Gu , Maciej A. Mazurowski

Weak-Annotation of HAR Datasets using Vision Foundation Models

As wearable-based data annotation remains, to date, a tedious, time-consuming task requiring researchers to dedicate substantial time, benchmark datasets within the field of Human Activity Recognition in lack richness and size compared to…

Human-Computer Interaction · Computer Science 2024-08-12 Marius Bock , Kristof Van Laerhoven , Michael Moeller

A Unified Active Learning Framework for Annotating Graph Data with Application to Software Source Code Performance Prediction

Most machine learning and data analytics applications, including performance engineering in software systems, require a large number of annotations and labelled data, which might not be available in advance. Acquiring annotations often…

Software Engineering · Computer Science 2023-09-21 Peter Samoaa , Linus Aronsson , Antonio Longa , Philipp Leitner , Morteza Haghir Chehreghani

Full or Weak annotations? An adaptive strategy for budget-constrained annotation campaigns

Annotating new datasets for machine learning tasks is tedious, time-consuming, and costly. For segmentation applications, the burden is particularly high as manual delineations of relevant image content are often extremely expensive or can…

Computer Vision and Pattern Recognition · Computer Science 2023-03-22 Javier Gamazo Tejero , Martin S. Zinkernagel , Sebastian Wolf , Raphael Sznitman , Pablo Márquez Neila

A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks

Human annotations are vital to supervised learning, yet annotators often disagree on the correct label, especially as annotation tasks increase in complexity. A strategy to improve label quality is to ask multiple annotators to label the…

Machine Learning · Computer Science 2023-12-22 Alexander Braylan , Madalyn Marabella , Omar Alonso , Matthew Lease

Label Assistant: A Workflow for Assisted Data Annotation in Image Segmentation Tasks

Recent research in the field of computer vision strongly focuses on deep learning architectures to tackle image processing problems. Deep neural networks are often considered in complex image processing scenarios since traditional computer…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Marcel P. Schilling , Luca Rettenberger , Friedrich Münke , Haijun Cui , Anna A. Popova , Pavel A. Levkin , Ralf Mikut , Markus Reischl

WeaNF: Weak Supervision with Normalizing Flows

A popular approach to decrease the need for costly manual annotation of large data sets is weak supervision, which introduces problems of noisy labels, coverage and bias. Methods for overcoming these problems have either relied on…

Computation and Language · Computer Science 2022-05-03 Andreas Stephan , Benjamin Roth

Feedback-driven object detection and iterative model improvement

Automated object detection has become increasingly valuable across diverse applications, yet efficient, high-quality annotation remains a persistent challenge. In this paper, we present the development and evaluation of a platform designed…

Computer Vision and Pattern Recognition · Computer Science 2025-03-28 Sönke Tenckhoff , Mario Koddenbrock , Erik Rodner

Learning from Imperfect Annotations

Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective,…

Machine Learning · Computer Science 2020-04-08 Emmanouil Antonios Platanios , Maruan Al-Shedivat , Eric Xing , Tom Mitchell

Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data

Existing text classification methods mainly focus on a fixed label set, whereas many real-world applications require extending to new fine-grained classes as the number of samples per label increases. To accommodate such requirements, we…

Computation and Language · Computer Science 2021-09-23 Dheeraj Mekala , Varun Gangal , Jingbo Shang

The Weak Supervision Landscape

Many ways of annotating a dataset for machine learning classification tasks that go beyond the usual class labels exist in practice. These are of interest as they can simplify or facilitate the collection of annotations, while not greatly…

Machine Learning · Computer Science 2022-03-31 Rafael Poyiadzi , Daniel Bacaicoa-Barber , Jesus Cid-Sueiro , Miquel Perello-Nieto , Peter Flach , Raul Santos-Rodriguez

Multi-label Classification with Partial Annotations using Class-aware Selective Loss

Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Emanuel Ben-Baruch , Tal Ridnik , Itamar Friedman , Avi Ben-Cohen , Nadav Zamir , Asaf Noy , Lihi Zelnik-Manor

Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning

Rich high-quality annotated data is critical for semantic segmentation learning, yet acquiring dense and pixel-wise ground-truth is both labor- and time-consuming. Coarse annotations (e.g., scribbles, coarse polygons) offer an economical…

Computer Vision and Pattern Recognition · Computer Science 2018-08-29 Yadan Luo , Ziwei Wang , Zi Huang , Yang Yang , Cong Zhao

Rethinking Crowdsourcing Annotation: Partial Annotation with Salient Labels for Multi-Label Image Classification

Annotated images are required for both supervised model training and evaluation in image classification. Manually annotating images is arduous and expensive, especially for multi-labeled images. A recent trend for conducting such laboursome…

Computer Vision and Pattern Recognition · Computer Science 2022-12-07 Jianzhe Lin , Tianze Yu , Z. Jane Wang

Block Annotation: Better Image Annotation for Semantic Segmentation with Sub-Image Decomposition

Image datasets with high-quality pixel-level annotations are valuable for semantic segmentation: labelling every pixel in an image ensures that rare classes and small objects are annotated. However, full-image annotations are expensive,…

Computer Vision and Pattern Recognition · Computer Science 2020-02-19 Hubert Lin , Paul Upchurch , Kavita Bala

Employing Weak Annotations for Medical Image Analysis Problems

To efficiently establish training databases for machine learning methods, collaborative and crowdsourcing platforms have been investigated to collectively tackle the annotation effort. However, when this concept is ported to the medical…

Computer Vision and Pattern Recognition · Computer Science 2017-08-22 Martin Rajchl , Lisa M. Koch , Christian Ledig , Jonathan Passerat-Palmbach , Kazunari Misawa , Kensaku Mori , Daniel Rueckert