Related papers: Regularising for invariance to data augmentation i…

Deep Active Learning with Augmentation-based Consistency Estimation

In active learning, the focus is mainly on the selection strategy of unlabeled data for enhancing the generalization capability of the next learning cycle. For this, various uncertainty measurement methods have been proposed. On the other…

Computer Vision and Pattern Recognition · Computer Science 2020-11-06 SeulGi Hong , Heonjin Ha , Junmo Kim , Min-Kook Choi

Implicit Regularization in Deep Learning

In an attempt to better understand generalization in deep learning, we study several possible explanations. We show that implicit regularization induced by the optimization method is playing a key role in generalization and success of deep…

Machine Learning · Computer Science 2017-09-11 Behnam Neyshabur

Data Augmentation and Regularization for Learning Group Equivariance

In many machine learning tasks, known symmetries can be used as an inductive bias to improve model performance. In this paper, we consider learning group equivariance through training with data augmentation. We summarize results from a…

Machine Learning · Statistics 2025-02-11 Oskar Nordenfors , Axel Flinth

Understanding Learning Invariance in Deep Linear Networks

Equivariant and invariant machine learning models exploit symmetries and structural patterns in data to improve sample efficiency. While empirical studies suggest that data-driven methods such as regularization and data augmentation can…

Machine Learning · Statistics 2025-06-17 Hao Duan , Guido Montúfar

Revisiting Explicit Regularization in Neural Networks for Well-Calibrated Predictive Uncertainty

From the statistical learning perspective, complexity control via explicit regularization is a necessity for improving the generalization of over-parameterized models. However, the impressive generalization performance of neural networks…

Machine Learning · Computer Science 2021-02-09 Taejong Joo , Uijung Chung

Squared $\ell_2$ Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations

Data augmentation is one of the most popular techniques for improving the robustness of neural networks. In addition to directly training the model with original samples and augmented samples, a torrent of methods regularizing the distance…

Machine Learning · Computer Science 2020-11-30 Haohan Wang , Zeyi Huang , Xindi Wu , Eric P. Xing

Data Augmentation Revisited: Rethinking the Distribution Gap between Clean and Augmented Data

Data augmentation has been widely applied as an effective methodology to improve generalization in particular when training deep neural networks. Recently, researchers proposed a few intensive data augmentation techniques, which indeed…

Machine Learning · Computer Science 2019-11-22 Zhuoxun He , Lingxi Xie , Xin Chen , Ya Zhang , Yanfeng Wang , Qi Tian

Soft Augmentation for Image Classification

Modern neural networks are over-parameterized and thus rely on strong regularization such as data augmentation and weight decay to reduce overfitting and improve generalization. The dominant form of data augmentation applies invariant…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Yang Liu , Shen Yan , Laura Leal-Taixé , James Hays , Deva Ramanan

Data augmentation instead of explicit regularization

Contrary to most machine learning models, modern deep artificial neural networks typically include multiple components that contribute to regularization. Despite the fact that some (explicit) regularization techniques, such as weight decay…

Computer Vision and Pattern Recognition · Computer Science 2020-11-13 Alex Hernández-García , Peter König

Automatic Data Augmentation via Invariance-Constrained Learning

Underlying data structures, such as symmetries or invariances to transformations, are often exploited to improve the solution of learning tasks. However, embedding these properties in models or learning algorithms can be challenging and…

Machine Learning · Computer Science 2023-09-19 Ignacio Hounie , Luiz F. O. Chamon , Alejandro Ribeiro

The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective

Data augmentation (DA) is a powerful workhorse for bolstering performance in modern machine learning. Specific augmentations like translations and scaling in computer vision are traditionally believed to improve generalization by generating…

Machine Learning · Computer Science 2024-02-29 Chi-Heng Lin , Chiraag Kaushik , Eva L. Dyer , Vidya Muthukumar

On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity

Most complex machine learning and modelling techniques are prone to over-fitting and may subsequently generalise poorly to future data. Artificial neural networks are no different in this regard and, despite having a level of implicit…

Machine Learning · Statistics 2022-05-26 Vincent Szolnoky , Viktor Andersson , Balazs Kulcsar , Rebecka Jörnsten

Learning robust visual representations using data augmentation invariance

Deep convolutional neural networks trained for image object categorization have shown remarkable similarities with representations found across the primate ventral visual stream. Yet, artificial and biological networks still exhibit…

Computer Vision and Pattern Recognition · Computer Science 2019-06-12 Alex Hernández-García , Peter König , Tim C. Kietzmann

Sample Efficiency of Data Augmentation Consistency Regularization

Data augmentation is popular in the training of large neural networks; currently, however, there is no clear theoretical comparison between different algorithmic choices on how to use augmented data. In this paper, we take a step in this…

Machine Learning · Computer Science 2022-06-17 Shuo Yang , Yijun Dong , Rachel Ward , Inderjit S. Dhillon , Sujay Sanghavi , Qi Lei

Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance. Injecting noises to hidden units during training, e.g., dropout, is…

Machine Learning · Computer Science 2017-11-10 Hyeonwoo Noh , Tackgeun You , Jonghwan Mun , Bohyung Han

On the Benefits of Invariance in Neural Networks

Many real world data analysis problems exhibit invariant structure, and models that take advantage of this structure have shown impressive empirical performance, particularly in deep learning. While the literature contains a variety of…

Machine Learning · Computer Science 2020-05-04 Clare Lyle , Mark van der Wilk , Marta Kwiatkowska , Yarin Gal , Benjamin Bloem-Reddy

Generative Hints

Data augmentation is widely used in vision to introduce variation and mitigate overfitting, by enabling models to learn invariant properties. However, augmentation only indirectly captures these properties and does not explicitly constrain…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Andy Dimnaku , Abdullah Yusuf Kavranoglu , Yaser Abu-Mostafa

Learning to Transform for Generalizable Instance-wise Invariance

Computer vision research has long aimed to build systems that are robust to spatial transformations found in natural data. Traditionally, this is done using data augmentation or hard-coding invariances into the architecture. However, too…

Computer Vision and Pattern Recognition · Computer Science 2024-02-19 Utkarsh Singhal , Carlos Esteves , Ameesh Makadia , Stella X. Yu

Augmentation-induced Consistency Regularization for Classification

Deep neural networks have become popular in many supervised learning tasks, but they may suffer from overfitting when the training dataset is limited. To mitigate this, many researchers use data augmentation, which is a widely used and…

Machine Learning · Computer Science 2022-05-27 Jianhan Wu , Shijing Si , Jianzong Wang , Jing Xiao

Regularizing Adversarial Imitation Learning Using Causal Invariance

Imitation learning methods are used to infer a policy in a Markov decision process from a dataset of expert demonstrations by minimizing a divergence measure between the empirical state occupancy measures of the expert and the policy. The…

Machine Learning · Computer Science 2023-08-21 Ivan Ovinnikov , Joachim M. Buhmann