Related papers: Info-Coevolution: An Efficient Framework for Data …

Compute-Efficient Active Learning

Active learning, a powerful paradigm in machine learning, aims at reducing labeling costs by selecting the most informative samples from an unlabeled dataset. However, the traditional active learning process often demands extensive…

Machine Learning · Computer Science 2024-01-17 Gábor Németh , Tamás Matuszka

Optimizing Active Learning for Low Annotation Budgets

When we can not assume a large amount of annotated data , active learning is a good strategy. It consists in learning a model on a small amount of annotated data (annotation budget) and in choosing the best set of points to annotate in…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Umang Aggarwal , Adrian Popescu , Céline Hudelot

Selective Annotation Makes Language Models Better Few-Shot Learners

Many recent approaches to natural language tasks are built on the remarkable abilities of large language models. Large language models can perform in-context learning, where they learn a new task from a few task demonstrations, without any…

Computation and Language · Computer Science 2022-09-07 Hongjin Su , Jungo Kasai , Chen Henry Wu , Weijia Shi , Tianlu Wang , Jiayi Xin , Rui Zhang , Mari Ostendorf , Luke Zettlemoyer , Noah A. Smith , Tao Yu

Assisted Text Annotation Using Active Learning to Achieve High Quality with Little Effort

Large amounts of annotated data have become more important than ever, especially since the rise of deep learning techniques. However, manual annotations are costly. We propose a tool that enables researchers to create large, high-quality,…

Digital Libraries · Computer Science 2021-12-23 Franziska Weeber , Felix Hamborg , Karsten Donnay , Bela Gipp

From Random to Informed Data Selection: A Diversity-Based Approach to Optimize Human Annotation and Few-Shot Learning

A major challenge in Natural Language Processing is obtaining annotated data for supervised learning. An option is the use of crowdsourcing platforms for data annotation. However, crowdsourcing introduces issues related to the annotator's…

Computation and Language · Computer Science 2024-01-25 Alexandre Alcoforado , Thomas Palmeira Ferraz , Lucas Hideki Okamura , Israel Campos Fama , Arnold Moya Lavado , Bárbara Dias Bueno , Bruno Veloso , Anna Helena Reali Costa

Practical Annotation Strategies for Question Answering Datasets

Annotating datasets for question answering (QA) tasks is very costly, as it requires intensive manual labor and often domain-specific knowledge. Yet strategies for annotating QA datasets in a cost-effective manner are scarce. To provide a…

Computation and Language · Computer Science 2020-03-09 Bernhard Kratzwald , Xiang Yue , Huan Sun , Stefan Feuerriegel

A Unified Active Learning Framework for Annotating Graph Data with Application to Software Source Code Performance Prediction

Most machine learning and data analytics applications, including performance engineering in software systems, require a large number of annotations and labelled data, which might not be available in advance. Acquiring annotations often…

Software Engineering · Computer Science 2023-09-21 Peter Samoaa , Linus Aronsson , Antonio Longa , Philipp Leitner , Morteza Haghir Chehreghani

Suggestive Annotation: A Deep Active Learning Framework for Biomedical Image Segmentation

Image segmentation is a fundamental problem in biomedical image analysis. Recent advances in deep learning have achieved promising results on many biomedical image segmentation benchmarks. However, due to large variations in biomedical…

Computer Vision and Pattern Recognition · Computer Science 2017-06-16 Lin Yang , Yizhe Zhang , Jianxu Chen , Siyuan Zhang , Danny Z. Chen

Learning from Imperfect Annotations

Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective,…

Machine Learning · Computer Science 2020-04-08 Emmanouil Antonios Platanios , Maruan Al-Shedivat , Eric Xing , Tom Mitchell

Feedback-driven object detection and iterative model improvement

Automated object detection has become increasingly valuable across diverse applications, yet efficient, high-quality annotation remains a persistent challenge. In this paper, we present the development and evaluation of a platform designed…

Computer Vision and Pattern Recognition · Computer Science 2025-03-28 Sönke Tenckhoff , Mario Koddenbrock , Erik Rodner

Active, Continual Fine Tuning of Convolutional Neural Networks for Reducing Annotation Efforts

The splendid success of convolutional neural networks (CNNs) in computer vision is largely attributable to the availability of massive annotated datasets, such as ImageNet and Places. However, in medical imaging, it is challenging to create…

Machine Learning · Computer Science 2021-04-13 Zongwei Zhou , Jae Y. Shin , Suryakanth R. Gurudu , Michael B. Gotway , Jianming Liang

When Dynamic Data Selection Meets Data Augmentation

Dynamic data selection aims to accelerate training with lossless performance. However, reducing training data inherently limits data diversity, potentially hindering generalization. While data augmentation is widely used to enhance…

Machine Learning · Computer Science 2025-05-13 Suorong Yang , Peng Ye , Furao Shen , Dongzhan Zhou

Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Supervised deep learning requires a large amount of training samples with annotations (e.g. label class for classification task, pixel- or voxel-wised label map for segmentation tasks), which are expensive and time-consuming to obtain.…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Yuanhan Mo , Shuo Wang , Chengliang Dai , Rui Zhou , Zhongzhao Teng , Wenjia Bai , Yike Guo

Annotation Error Detection: Analyzing the Past and Present for a More Coherent Future

Annotated data is an essential ingredient in natural language processing for training and evaluating machine learning models. It is therefore very desirable for the annotations to be of high quality. Recent work, however, has shown that…

Computation and Language · Computer Science 2022-09-27 Jan-Christoph Klie , Bonnie Webber , Iryna Gurevych

Annotation-efficient learning for OCT segmentation

Deep learning has been successfully applied to OCT segmentation. However, for data from different manufacturers and imaging protocols, and for different regions of interest (ROIs), it requires laborious and time-consuming data annotation…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Haoran Zhang , Jianlong Yang , Ce Zheng , Shiqing Zhao , Aili Zhang

Reducing Label Effort: Self-Supervised meets Active Learning

Active learning is a paradigm aimed at reducing the annotation effort by training the model on actively selected informative and/or representative samples. Another paradigm to reduce the annotation effort is self-training that learns from a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Javad Zolfaghari Bengar , Joost van de Weijer , Bartlomiej Twardowski , Bogdan Raducanu

Active Learning for Delineation of Curvilinear Structures

Many recent delineation techniques owe much of their increased effectiveness to path classification algorithms that make it possible to distinguish promising paths from others. The downside of this development is that they require annotated…

Computer Vision and Pattern Recognition · Computer Science 2015-12-03 Agata Mosinska , Raphael Sznitman , Przemysław Głowacki , Pascal Fua

Cross-Model Image Annotation Platform with Active Learning

We have seen significant leapfrog advancement in machine learning in recent decades. The central idea of machine learnability lies on constructing learning algorithms that learn from good data. The availability of more data being made…

Computer Vision and Pattern Recognition · Computer Science 2020-08-07 Ng Hui Xian Lynnette , Henry Ng Siong Hock , Nguwi Yok Yen

Deep Bayesian Self-Training

Supervised Deep Learning has been highly successful in recent years, achieving state-of-the-art results in most tasks. However, with the ongoing uptake of such methods in industrial applications, the requirement for large amounts of…

Computer Vision and Pattern Recognition · Computer Science 2019-07-18 Fabio De Sousa Ribeiro , Francesco Caliva , Mark Swainson , Kjartan Gudmundsson , Georgios Leontidis , Stefanos Kollias

Identifying Wrongly Predicted Samples: A Method for Active Learning

State-of-the-art machine learning models require access to significant amount of annotated data in order to achieve the desired level of performance. While unlabelled data can be largely available and even abundant, annotation process can…

Machine Learning · Computer Science 2020-10-15 Rahaf Aljundi , Nikolay Chumerin , Daniel Olmeda Reino