Related papers: Deep Active Learning with Augmentation-based Consi…

Regularising for invariance to data augmentation improves supervised learning

Data augmentation is used in machine learning to make the classifier invariant to label-preserving transformations. Usually this invariance is only encouraged implicitly by including a single augmented input during training. However,…

Machine Learning · Computer Science 2022-03-08 Aleksander Botev , Matthias Bauer , Soham De

Augmentation-induced Consistency Regularization for Classification

Deep neural networks have become popular in many supervised learning tasks, but they may suffer from overfitting when the training dataset is limited. To mitigate this, many researchers use data augmentation, which is a widely used and…

Machine Learning · Computer Science 2022-05-27 Jianhan Wu , Shijing Si , Jianzong Wang , Jing Xiao

Boosting Active Learning for Speech Recognition with Noisy Pseudo-labeled Samples

The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-06 Jihwan Bang , Heesu Kim , YoungJoon Yoo , Jung-Woo Ha

Active Learning Using Uncertainty Information

Many active learning methods belong to the retraining-based approaches, which select one unlabeled instance, add it to the training set with its possible labels, retrain the classification model, and evaluate the criteria that we base our…

Machine Learning · Statistics 2017-03-01 Yazhou Yang , Marco Loog

Data Augmentation Revisited: Rethinking the Distribution Gap between Clean and Augmented Data

Data augmentation has been widely applied as an effective methodology to improve generalization in particular when training deep neural networks. Recently, researchers proposed a few intensive data augmentation techniques, which indeed…

Machine Learning · Computer Science 2019-11-22 Zhuoxun He , Lingxi Xie , Xin Chen , Ya Zhang , Yanfeng Wang , Qi Tian

Unsupervised Data Augmentation for Consistency Training

Semi-supervised learning lately has shown much promise in improving deep learning models when labeled data is scarce. Common among recent approaches is the use of consistency training on a large amount of unlabeled data to constrain model…

Machine Learning · Computer Science 2020-11-06 Qizhe Xie , Zihang Dai , Eduard Hovy , Minh-Thang Luong , Quoc V. Le

The Effects of Regularization and Data Augmentation are Class Dependent

Regularization is a fundamental technique to prevent over-fitting and to improve generalization performances by constraining a model's complexity. Current Deep Networks heavily rely on regularizers such as Data-Augmentation (DA) or…

Machine Learning · Computer Science 2022-04-12 Randall Balestriero , Leon Bottou , Yann LeCun

Model Debiasing by Learnable Data Augmentation

Deep Neural Networks are well known for efficiently fitting training data, yet experiencing poor generalization capabilities whenever some kind of bias dominates over the actual task labels, resulting in models learning "shortcuts". In…

Machine Learning · Computer Science 2024-08-12 Pietro Morerio , Ruggero Ragonesi , Vittorio Murino

A Study on the Impact of Data Augmentation for Training Convolutional Neural Networks in the Presence of Noisy Labels

Label noise is common in large real-world datasets, and its presence harms the training process of deep neural networks. Although several works have focused on the training strategies to address this problem, there are few studies that…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Emeson Santana , Gustavo Carneiro , Filipe R. Cordeiro

MixPUL: Consistency-based Augmentation for Positive and Unlabeled Learning

Learning from positive and unlabeled data (PU learning) is prevalent in practical applications where only a couple of examples are positively labeled. Previous PU learning studies typically rely on existing samples such that the data…

Machine Learning · Computer Science 2020-04-21 Tong Wei , Feng Shi , Hai Wang , Wei-Wei Tu. Yu-Feng Li

Sample Efficiency of Data Augmentation Consistency Regularization

Data augmentation is popular in the training of large neural networks; currently, however, there is no clear theoretical comparison between different algorithmic choices on how to use augmented data. In this paper, we take a step in this…

Machine Learning · Computer Science 2022-06-17 Shuo Yang , Yijun Dong , Rachel Ward , Inderjit S. Dhillon , Sujay Sanghavi , Qi Lei

Revisiting Consistency Regularization for Semi-Supervised Learning

Consistency regularization is one of the most widely-used techniques for semi-supervised learning (SSL). Generally, the aim is to train a model that is invariant to various data augmentations. In this paper, we revisit this idea and find…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Yue Fan , Anna Kukleva , Bernt Schiele

Data augmentation instead of explicit regularization

Contrary to most machine learning models, modern deep artificial neural networks typically include multiple components that contribute to regularization. Despite the fact that some (explicit) regularization techniques, such as weight decay…

Computer Vision and Pattern Recognition · Computer Science 2020-11-13 Alex Hernández-García , Peter König

Affinity and Diversity: Quantifying Mechanisms of Data Augmentation

Though data augmentation has become a standard component of deep neural network training, the underlying mechanism behind the effectiveness of these techniques remains poorly understood. In practice, augmentation policies are often chosen…

Machine Learning · Computer Science 2020-06-08 Raphael Gontijo-Lopes , Sylvia J. Smullin , Ekin D. Cubuk , Ethan Dyer

Uncertainty-Aware Consistency Regularization for Cross-Domain Semantic Segmentation

Unsupervised domain adaptation (UDA) aims to adapt existing models of the source domain to a new target domain with only unlabeled data. Most existing methods suffer from noticeable negative transfer resulting from either the error-prone…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Qianyu Zhou , Zhengyang Feng , Qiqi Gu , Guangliang Cheng , Xuequan Lu , Jianping Shi , Lizhuang Ma

Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Data augmentation reduces the generalization error by forcing a model to learn invariant representations given different transformations of the input image. In computer vision, on top of the standard image processing functions, data…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Rowel Atienza

Robustness through Data Augmentation Loss Consistency

While deep learning through empirical risk minimization (ERM) has succeeded at achieving human-level performance at a variety of complex tasks, ERM is not robust to distribution shifts or adversarial attacks. Synthetic data augmentation…

Machine Learning · Computer Science 2023-01-25 Tianjian Huang , Shaunak Halbe , Chinnadhurai Sankar , Pooyan Amini , Satwik Kottur , Alborz Geramifard , Meisam Razaviyayn , Ahmad Beirami

Understanding the Detrimental Class-level Effects of Data Augmentation

Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Polina Kirichenko , Mark Ibrahim , Randall Balestriero , Diane Bouchacourt , Ramakrishna Vedantam , Hamed Firooz , Andrew Gordon Wilson

A Mathematics Framework of Artificial Shifted Population Risk and Its Further Understanding Related to Consistency Regularization

Data augmentation is an important technique in training deep neural networks as it enhances their ability to generalize and remain robust. While data augmentation is commonly used to expand the sample size and act as a consistency…

Machine Learning · Computer Science 2025-02-18 Xiliang Yang , Shenyang Deng , Shicong Liu , Yuanchi Suo , Wing. W. Y NG , Jianjun Zhang

RandAugment: Practical automated data augmentation with a reduced search space

Recent work has shown that data augmentation has the potential to significantly improve the generalization of deep learning models. Recently, automated augmentation strategies have led to state-of-the-art results in image classification and…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Ekin D. Cubuk , Barret Zoph , Jonathon Shlens , Quoc V. Le