Related papers: Task Programming: Learning Data Efficient Behavior…

TADA: Efficient Task-Agnostic Domain Adaptation for Transformers

Intermediate training of pre-trained transformer-based language models on domain-specific data leads to substantial gains for downstream tasks. To increase efficiency and prevent catastrophic forgetting alleviated from full domain-adaptive…

Computation and Language · Computer Science 2023-05-23 Chia-Chien Hung , Lukas Lange , Jannik Strötgen

Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems

Data annotated by humans is a source of knowledge by describing the peculiarities of the problem and therefore fueling the decision process of the trained model. Unfortunately, the annotation process for subjective natural language…

Computation and Language · Computer Science 2023-12-14 Kamil Kanclerz , Julita Bielaniewicz , Marcin Gruza , Jan Kocon , Stanisław Woźniak , Przemysław Kazienko

Labeling Where Adapting Fails: Cross-Domain Semantic Segmentation with Point Supervision via Active Selection

Training models dedicated to semantic segmentation requires a large amount of pixel-wise annotated data. Due to their costly nature, these annotations might not be available for the task at hand. To alleviate this problem, unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Fei Pan , Francois Rameau , Junsik Kim , In So Kweon

Analysis of Automatic Annotation Suggestions for Hard Discourse-Level Tasks in Expert Domains

Many complex discourse-level tasks can aid domain experts in their work but require costly expert annotations for data creation. To speed up and ease annotations, we investigate the viability of automatically generated annotation…

Computation and Language · Computer Science 2019-06-07 Claudia Schulz , Christian M. Meyer , Jan Kiesewetter , Michael Sailer , Elisabeth Bauer , Martin R. Fischer , Frank Fischer , Iryna Gurevych

A Unified Active Learning Framework for Annotating Graph Data with Application to Software Source Code Performance Prediction

Most machine learning and data analytics applications, including performance engineering in software systems, require a large number of annotations and labelled data, which might not be available in advance. Acquiring annotations often…

Software Engineering · Computer Science 2023-09-21 Peter Samoaa , Linus Aronsson , Antonio Longa , Philipp Leitner , Morteza Haghir Chehreghani

Action Segmentation with Mixed Temporal Domain Adaptation

The main progress for action segmentation comes from densely-annotated data for fully-supervised learning. Since manual annotation for frame-level actions is time-consuming and challenging, we propose to exploit auxiliary unlabeled videos,…

Computer Vision and Pattern Recognition · Computer Science 2021-04-19 Min-Hung Chen , Baopu Li , Yingze Bao , Ghassan AlRegib

Iterative Pseudo-Labeling with Deep Feature Annotation and Confidence-Based Sampling

Training deep neural networks is challenging when large and annotated datasets are unavailable. Extensive manual annotation of data samples is time-consuming, expensive, and error-prone, notably when it needs to be done by experts. To…

Machine Learning · Computer Science 2021-09-08 Barbara C Benato , Alexandru C Telea , Alexandre X Falcão

ESA: Annotation-Efficient Active Learning for Semantic Segmentation

Active learning enhances annotation efficiency by selecting the most revealing samples for labeling, thereby reducing reliance on extensive human input. Previous methods in semantic segmentation have centered on individual pixels or small…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Jinchao Ge , Zeyu Zhang , Minh Hieu Phan , Bowen Zhang , Akide Liu , Yang Zhao , Shuwen Zhao

AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions

This paper addresses text recognition for domains with limited manual annotations by a simple self-training strategy. Our approach should reduce human annotation effort when target domain data is plentiful, such as when transcribing a…

Computer Vision and Pattern Recognition · Computer Science 2022-01-26 Martin Kišš , Karel Beneš , Michal Hradiš

Automated Behavioral Analysis Using Instance Segmentation

Animal behavior analysis plays a crucial role in various fields, such as life science and biomedical research. However, the scarcity of available data and the high cost associated with obtaining a large number of labeled datasets pose…

Computer Vision and Pattern Recognition · Computer Science 2023-12-14 Chen Yang , Jeremy Forest , Matthew Einhorn , Thomas A. Cleland

Reducing Label Effort: Self-Supervised meets Active Learning

Active learning is a paradigm aimed at reducing the annotation effort by training the model on actively selected informative and/or representative samples. Another paradigm to reduce the annotation effort is self-training that learns from a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Javad Zolfaghari Bengar , Joost van de Weijer , Bartlomiej Twardowski , Bogdan Raducanu

Practical Annotation Strategies for Question Answering Datasets

Annotating datasets for question answering (QA) tasks is very costly, as it requires intensive manual labor and often domain-specific knowledge. Yet strategies for annotating QA datasets in a cost-effective manner are scarce. To provide a…

Computation and Language · Computer Science 2020-03-09 Bernhard Kratzwald , Xiang Yue , Huan Sun , Stefan Feuerriegel

Incorporating Human Domain Knowledge in 3D LiDAR-based Semantic Segmentation

This work studies semantic segmentation using 3D LiDAR data. Popular deep learning methods applied for this task require a large number of manual annotations to train the parameters. We propose a new method that makes full use of the…

Robotics · Computer Science 2019-05-24 Jilin Mei , Huijing Zhao

A Survey on Machine Learning Techniques for Auto Labeling of Video, Audio, and Text Data

Machine learning has been utilized to perform tasks in many different domains such as classification, object detection, image segmentation and natural language analysis. Data labeling has always been one of the most important tasks in…

Machine Learning · Computer Science 2021-09-09 Shikun Zhang , Omid Jafari , Parth Nagarkar

Deep Bayesian Self-Training

Supervised Deep Learning has been highly successful in recent years, achieving state-of-the-art results in most tasks. However, with the ongoing uptake of such methods in industrial applications, the requirement for large amounts of…

Computer Vision and Pattern Recognition · Computer Science 2019-07-18 Fabio De Sousa Ribeiro , Francesco Caliva , Mark Swainson , Kjartan Gudmundsson , Georgios Leontidis , Stefanos Kollias

Active Learning for Domain Adaptation: An Energy-Based Approach

Unsupervised domain adaptation has recently emerged as an effective paradigm for generalizing deep neural networks to new target domains. However, there is still enormous potential to be tapped to reach the fully supervised performance. In…

Machine Learning · Computer Science 2022-03-10 Binhui Xie , Longhui Yuan , Shuang Li , Chi Harold Liu , Xinjing Cheng , Guoren Wang

Interpreting Expert Annotation Differences in Animal Behavior

Hand-annotated data can vary due to factors such as subjective differences, intra-rater variability, and differing annotator expertise. We study annotations from different experts who labelled the same behavior classes on a set of animal…

Machine Learning · Computer Science 2021-06-14 Megan Tjandrasuwita , Jennifer J. Sun , Ann Kennedy , Swarat Chaudhuri , Yisong Yue

Beyond Task-Driven Features for Object Detection

Task-driven features learned by modern object detectors optimize end task loss yet often capture shortcut correlations that fail to reflect underlying annotation structure. Such representations limit transfer, interpretability, and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Meilun Zhou , Alina Zare

Effortless Active Labeling for Long-Term Test-Time Adaptation

Long-term test-time adaptation (TTA) is a challenging task due to error accumulation. Recent approaches tackle this issue by actively labeling a small proportion of samples in each batch, yet the annotation burden quickly grows as the batch…

Computer Vision and Pattern Recognition · Computer Science 2025-03-20 Guowei Wang , Changxing Ding

Label Efficient Learning of Transferable Representations across Domains and Tasks

We propose a framework that learns a representation transferable across different domains and tasks in a label efficient manner. Our approach battles domain shift with a domain adversarial loss, and generalizes the embedding to novel task…

Machine Learning · Statistics 2017-12-04 Zelun Luo , Yuliang Zou , Judy Hoffman , Li Fei-Fei