Related papers: Deep Bayesian Self-Training

Bayesian Methods for Semi-supervised Text Annotation

Human annotations are an important source of information in the development of natural language understanding approaches. As under the pressure of productivity annotators can assign different labels to a given text, the quality of produced…

Computation and Language · Computer Science 2020-10-29 Kristian Miok , Gregor Pirs , Marko Robnik-Sikonja

Bayesian Active Learning for Sim-to-Real Robotic Perception

While learning from synthetic training data has recently gained an increased attention, in real-world robotic applications, there are still performance deficiencies due to the so-called Sim-to-Real gap. In practice, this gap is hard to…

Robotics · Computer Science 2022-08-02 Jianxiang Feng , Jongseok Lee , Maximilian Durner , Rudolph Triebel

Iterative Pseudo-Labeling with Deep Feature Annotation and Confidence-Based Sampling

Training deep neural networks is challenging when large and annotated datasets are unavailable. Extensive manual annotation of data samples is time-consuming, expensive, and error-prone, notably when it needs to be done by experts. To…

Machine Learning · Computer Science 2021-09-08 Barbara C Benato , Alexandru C Telea , Alexandre X Falcão

A Bayesian Data Augmentation Approach for Learning Deep Models

Data augmentation is an essential part of the training process applied to deep learning models. The motivation is that a robust training process for deep learning models depends on large annotated datasets, which are expensive to be…

Computer Vision and Pattern Recognition · Computer Science 2017-10-31 Toan Tran , Trung Pham , Gustavo Carneiro , Lyle Palmer , Ian Reid

Deep Bayesian Active Learning, A Brief Survey on Recent Advances

Active learning frameworks offer efficient data annotation without remarkable accuracy degradation. In other words, active learning starts training the model with a small size of labeled data while exploring the space of unlabeled data in…

Machine Learning · Computer Science 2022-04-22 Salman Mohamadi , Hamidreza Amindavar

Combining Probabilistic Logic and Deep Learning for Self-Supervised Learning

Deep learning has proven effective for various application tasks, but its applicability is limited by the reliance on annotated examples. Self-supervised learning has emerged as a promising direction to alleviate the supervision bottleneck,…

Machine Learning · Computer Science 2021-07-28 Hoifung Poon , Hai Wang , Hunter Lang

Robust Assignment of Labels for Active Learning with Sparse and Noisy Annotations

Supervised classification algorithms are used to solve a growing number of real-life problems around the globe. Their performance is strictly connected with the quality of labels used in training. Unfortunately, acquiring good-quality…

Machine Learning · Computer Science 2024-07-08 Daniel Kałuża , Andrzej Janusz , Dominik Ślęzak

Uncertainty-aware Self-training for Text Classification with Few Labels

Recent success of large-scale pre-trained language models crucially hinge on fine-tuning them on large amounts of labeled data for the downstream task, that are typically expensive to acquire. In this work, we study self-training as one of…

Computation and Language · Computer Science 2020-06-30 Subhabrata Mukherjee , Ahmed Hassan Awadallah

Auto-Annotation Quality Prediction for Semi-Supervised Learning with Ensembles

Auto-annotation by ensemble of models is an efficient method of learning on unlabeled data. Wrong or inaccurate annotations generated by the ensemble may lead to performance degradation of the trained model. To deal with this problem we…

Computer Vision and Pattern Recognition · Computer Science 2024-03-14 Dror Simon , Miriam Farber , Roman Goldenberg

Active Learning for Sequence Tagging with Deep Pre-trained Models and Bayesian Uncertainty Estimates

Annotating training data for sequence tagging of texts is usually very time-consuming. Recent advances in transfer learning for natural language processing in conjunction with active learning open the possibility to significantly reduce the…

Computation and Language · Computer Science 2021-02-19 Artem Shelmanov , Dmitri Puzyrev , Lyubov Kupriyanova , Denis Belyakov , Daniil Larionov , Nikita Khromov , Olga Kozlova , Ekaterina Artemova , Dmitry V. Dylov , Alexander Panchenko

Learning from Imperfect Annotations

Many machine learning systems today are trained on large amounts of human-annotated data. Data annotation tasks that require a high level of competency make data acquisition expensive, while the resulting labels are often subjective,…

Machine Learning · Computer Science 2020-04-08 Emmanouil Antonios Platanios , Maruan Al-Shedivat , Eric Xing , Tom Mitchell

Self-supervised Semi-supervised Learning for Data Labeling and Quality Evaluation

As the adoption of deep learning techniques in industrial applications grows with increasing speed and scale, successful deployment of deep learning models often hinges on the availability, volume, and quality of annotated data. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Haoping Bai , Meng Cao , Ping Huang , Jiulong Shan

Addressing Training Bias via Automated Image Annotation

Build accurate DNN models requires training on large labeled, context specific datasets, especially those matching the target scenario. We believe advances in wireless localization, working in unison with cameras, can produce automated…

Computer Vision and Pattern Recognition · Computer Science 2018-10-11 Zhujun Xiao , Yanzi Zhu , Yuxin Chen , Ben Y. Zhao , Junchen Jiang , Haitao Zheng

Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection

Data collection from manual labeling provides domain-specific and task-aligned supervision for data-driven approaches, and a critical mass of well-annotated resources is required to achieve reasonable performance in natural language…

Computation and Language · Computer Science 2023-11-09 Zhengyuan Liu , Hai Leong Chieu , Nancy F. Chen

DST: Data Selection and joint Training for Learning with Noisy Labels

Training a deep neural network heavily relies on a large amount of training data with accurate annotations. To alleviate this problem, various methods have been proposed to annotate the data automatically. However, automatically generating…

Computer Vision and Pattern Recognition · Computer Science 2021-03-02 Yi Wei , Xue Mei , Xin Liu , Pengxiang Xu

Model Debiasing by Learnable Data Augmentation

Deep Neural Networks are well known for efficiently fitting training data, yet experiencing poor generalization capabilities whenever some kind of bias dominates over the actual task labels, resulting in models learning "shortcuts". In…

Machine Learning · Computer Science 2024-08-12 Pietro Morerio , Ruggero Ragonesi , Vittorio Murino

Annotation Error Detection: Analyzing the Past and Present for a More Coherent Future

Annotated data is an essential ingredient in natural language processing for training and evaluating machine learning models. It is therefore very desirable for the annotations to be of high quality. Recent work, however, has shown that…

Computation and Language · Computer Science 2022-09-27 Jan-Christoph Klie , Bonnie Webber , Iryna Gurevych

Deep Active Learning with Budget Annotation

Digital data collected over the decades and data currently being produced with use of information technology is vastly the unlabeled data or data without description. The unlabeled data is relatively easy to acquire but expensive to label…

Machine Learning · Computer Science 2022-08-02 Kinyua Gikunda

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets

Data is the engine of modern computer vision, which necessitates collecting large-scale datasets. This is expensive, and guaranteeing the quality of the labels is a major challenge. In this paper, we investigate efficient annotation…

Computer Vision and Pattern Recognition · Computer Science 2021-04-27 Yuan-Hong Liao , Amlan Kar , Sanja Fidler

Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Supervised deep learning requires a large amount of training samples with annotations (e.g. label class for classification task, pixel- or voxel-wised label map for segmentation tasks), which are expensive and time-consuming to obtain.…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Yuanhan Mo , Shuo Wang , Chengliang Dai , Rui Zhou , Zhongzhao Teng , Wenjia Bai , Yike Guo