Related papers: A Unified Active Learning Framework for Annotating…

Active Learning of Driving Scenario Trajectories

Annotated driving scenario trajectories are crucial for verification and validation of autonomous vehicles. However, annotation of such trajectories based only on explicit rules (i.e. knowledge-based methods) may be prone to errors, such as…

Machine Learning · Computer Science 2023-01-02 Sanna Jarl , Linus Aronsson , Sadegh Rahrovani , Morteza Haghir Chehreghani

Robust Assignment of Labels for Active Learning with Sparse and Noisy Annotations

Supervised classification algorithms are used to solve a growing number of real-life problems around the globe. Their performance is strictly connected with the quality of labels used in training. Unfortunately, acquiring good-quality…

Machine Learning · Computer Science 2024-07-08 Daniel Kałuża , Andrzej Janusz , Dominik Ślęzak

Boosting Active Learning via Improving Test Performance

Central to active learning (AL) is what data should be selected for annotation. Existing works attempt to select highly uncertain or informative data for annotation. Nevertheless, it remains unclear how selected data impacts the test…

Machine Learning · Computer Science 2022-01-25 Tianyang Wang , Xingjian Li , Pengkun Yang , Guosheng Hu , Xiangrui Zeng , Siyu Huang , Cheng-Zhong Xu , Min Xu

Reducing Label Effort: Self-Supervised meets Active Learning

Active learning is a paradigm aimed at reducing the annotation effort by training the model on actively selected informative and/or representative samples. Another paradigm to reduce the annotation effort is self-training that learns from a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Javad Zolfaghari Bengar , Joost van de Weijer , Bartlomiej Twardowski , Bogdan Raducanu

A Graph-Based Approach for Active Learning in Regression

Active learning aims to reduce labeling efforts by selectively asking humans to annotate the most important data points from an unlabeled pool and is an example of human-machine interaction. Though active learning has been extensively…

Machine Learning · Computer Science 2020-01-31 Hongjing Zhang , S. S. Ravi , Ian Davidson

Active Testing: An Efficient and Robust Framework for Estimating Accuracy

Much recent work on visual recognition aims to scale up learning to massive, noisily-annotated datasets. We address the problem of scaling- up the evaluation of such models to large-scale datasets with noisy labels. Current protocols for…

Computer Vision and Pattern Recognition · Computer Science 2018-07-03 Phuc Nguyen , Deva Ramanan , Charless Fowlkes

An Adaptive Supervision Framework for Active Learning in Object Detection

Active learning approaches in computer vision generally involve querying strong labels for data. However, previous works have shown that weak supervision can be effective in training models for vision tasks while greatly reducing annotation…

Computer Vision and Pattern Recognition · Computer Science 2019-10-16 Sai Vikas Desai , Akshay L Chandra , Wei Guo , Seishi Ninomiya , Vineeth N Balasubramanian

Identifying Wrongly Predicted Samples: A Method for Active Learning

State-of-the-art machine learning models require access to significant amount of annotated data in order to achieve the desired level of performance. While unlabelled data can be largely available and even abundant, annotation process can…

Machine Learning · Computer Science 2020-10-15 Rahaf Aljundi , Nikolay Chumerin , Daniel Olmeda Reino

Active Scene Learning

Sketch recognition allows natural and efficient interaction in pen-based interfaces. A key obstacle to building accurate sketch recognizers has been the difficulty of creating large amounts of annotated training data. Several authors have…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Erelcan Yanik , Tevfik Metin Sezgin

Optimizing Active Learning for Low Annotation Budgets

When we can not assume a large amount of annotated data , active learning is a good strategy. It consists in learning a model on a small amount of annotated data (annotation budget) and in choosing the best set of points to annotate in…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Umang Aggarwal , Adrian Popescu , Céline Hudelot

Comparative Code Structure Analysis using Deep Learning for Performance Prediction

Performance analysis has always been an afterthought during the application development process, focusing on application correctness first. The learning curve of the existing static and dynamic analysis tools are steep, which requires…

Machine Learning · Computer Science 2021-04-23 Nathan Pinnow , Tarek Ramadan , Tanzima Z. Islam , Chase Phelps , Jayaraman J. Thiagarajan

Precise Learning of Source Code Contextual Semantics via Hierarchical Dependence Structure and Graph Attention Networks

Deep learning is being used extensively in a variety of software engineering tasks, e.g., program classification and defect prediction. Although the technique eliminates the required process of feature engineering, the construction of…

Software Engineering · Computer Science 2021-11-24 Zhehao Zhao , Bo Yang , Ge Li , Huai Liu , Zhi Jin

Data-efficient Active Learning for Structured Prediction with Partial Annotation and Self-Training

In this work we propose a pragmatic method that reduces the annotation cost for structured label spaces using active learning. Our approach leverages partial annotation, which reduces labeling costs for structured outputs by selecting only…

Computation and Language · Computer Science 2023-10-20 Zhisong Zhang , Emma Strubell , Eduard Hovy

Learning Fast Matching Models from Weak Annotations

This paper proposes a novel training scheme for fast matching models in Search Ads, which is motivated by the real challenges in model training. The first challenge stems from the pursuit of high throughput, which prohibits the deployment…

Information Retrieval · Computer Science 2019-04-23 Xue Li , Zhipeng Luo , Hao Sun , Jianjin Zhang , Weihao Han , Xianqi Chu , Liangjie Zhang , Qi Zhang

An Efficient Active Learning Pipeline for Legal Text Classification

Active Learning (AL) is a powerful tool for learning with less labeled data, in particular, for specialized domains, like legal documents, where unlabeled data is abundant, but the annotation requires domain expertise and is thus expensive.…

Computation and Language · Computer Science 2022-11-16 Sepideh Mamooler , Rémi Lebret , Stéphane Massonnet , Karl Aberer

Multi-utility Learning: Structured-output Learning with Multiple Annotation-specific Loss Functions

Structured-output learning is a challenging problem; particularly so because of the difficulty in obtaining large datasets of fully labelled instances for training. In this paper we try to overcome this difficulty by presenting a…

Computer Vision and Pattern Recognition · Computer Science 2014-06-24 Roman Shapovalov , Dmitry Vetrov , Anton Osokin , Pushmeet Kohli

An Analysis of Active Learning Algorithms using Real-World Crowd-sourced Text Annotations

Active learning algorithms automatically identify the most informative samples from large amounts of unlabeled data and tremendously reduce human annotation effort in inducing a machine learning model. In a conventional active learning…

Machine Learning · Computer Science 2026-04-28 Varun Totakura , Ankita Singh , Yushun Dong , Shayok Chakraborty

Deep Active Learning with Budget Annotation

Digital data collected over the decades and data currently being produced with use of information technology is vastly the unlabeled data or data without description. The unlabeled data is relatively easy to acquire but expensive to label…

Machine Learning · Computer Science 2022-08-02 Kinyua Gikunda

ALE: A Simulation-Based Active Learning Evaluation Framework for the Parameter-Driven Comparison of Query Strategies for NLP

Supervised machine learning and deep learning require a large amount of labeled data, which data scientists obtain in a manual, and time-consuming annotation process. To mitigate this challenge, Active Learning (AL) proposes promising data…

Computation and Language · Computer Science 2023-08-08 Philipp Kohl , Nils Freyer , Yoka Krämer , Henri Werth , Steffen Wolf , Bodo Kraft , Matthias Meinecke , Albert Zündorf

LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation

Successfully training a deep neural network demands a huge corpus of labeled data. However, each label only provides limited information to learn from and collecting the requisite number of labels involves massive human effort. In this…

Computation and Language · Computer Science 2020-04-17 Dong-Ho Lee , Rahul Khanna , Bill Yuchen Lin , Jamin Chen , Seyeon Lee , Qinyuan Ye , Elizabeth Boschee , Leonardo Neves , Xiang Ren