Related papers: Negative Data Augmentation

LatentAugment: Data Augmentation via Guided Manipulation of GAN's Latent Space

Data Augmentation (DA) is a technique to increase the quantity and diversity of the training data, and by that alleviate overfitting and improve generalisation. However, standard DA produces synthetic data for augmentation with limited…

Computer Vision and Pattern Recognition · Computer Science 2023-07-24 Lorenzo Tronchin , Minh H. Vu , Paolo Soda , Tommy Löfstedt

Data Augmentation via Structured Adversarial Perturbations

Data augmentation is a major component of many machine learning methods with state-of-the-art performance. Common augmentation strategies work by drawing random samples from a space of transformations. Unfortunately, such sampling…

Machine Learning · Computer Science 2020-11-06 Calvin Luo , Hossein Mobahi , Samy Bengio

Exploring Bias in GAN-based Data Augmentation for Small Samples

For machine learning task, lacking sufficient samples mean the trained model has low confidence to approach the ground truth function. Until recently, after the generative adversarial networks (GAN) had been proposed, we see the hope of…

Machine Learning · Computer Science 2019-05-22 Mengxiao Hu , Jinlong Li

Sampling Strategies for GAN Synthetic Data

Generative Adversarial Networks (GANs) have been used widely to generate large volumes of synthetic data. This data is being utilized for augmenting with real examples in order to train deep Convolutional Neural Networks (CNNs). Studies…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Binod Bhattarai , Seungryul Baek , Rumeysa Bodur , Tae-Kyun Kim

DADA: Deep Adversarial Data Augmentation for Extremely Low Data Regime Classification

Deep learning has revolutionized the performance of classification, but meanwhile demands sufficient labeled data for training. Given insufficient data, while many techniques have been developed to help combat overfitting, the challenge…

Computer Vision and Pattern Recognition · Computer Science 2018-09-05 Xiaofeng Zhang , Zhangyang Wang , Dong Liu , Qing Ling

Data Augmentation Approaches in Natural Language Processing: A Survey

As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where deep learning techniques may fail. It is widely applied in computer vision then introduced to natural language processing and achieves improvements in…

Computation and Language · Computer Science 2022-06-28 Bohan Li , Yutai Hou , Wanxiang Che

Do We Need All the Synthetic Data? Targeted Image Augmentation via Diffusion Models

Synthetically augmenting training datasets with diffusion models has become an effective strategy for improving the generalization of image classifiers. However, existing approaches typically increase dataset size by 10-30x and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Dang Nguyen , Jiping Li , Jinghao Zheng , Baharan Mirzasoleiman

Toward Understanding Generative Data Augmentation

Generative data augmentation, which scales datasets by obtaining fake labeled examples from a trained conditional generative model, boosts classification performance in various learning tasks including (semi-)supervised learning, few-shot…

Machine Learning · Computer Science 2023-05-30 Chenyu Zheng , Guoqiang Wu , Chongxuan Li

Rethinking Data Augmentation for Low-Resource Neural Machine Translation: A Multi-Task Learning Approach

In the context of neural machine translation, data augmentation (DA) techniques may be used for generating additional training samples when the available parallel data are scarce. Many DA approaches aim at expanding the support of the…

Computation and Language · Computer Science 2021-09-09 Víctor M. Sánchez-Cartagena , Miquel Esplà-Gomis , Juan Antonio Pérez-Ortiz , Felipe Sánchez-Martínez

Turning Waste into Wealth: Leveraging Low-Quality Samples for Enhancing Continuous Conditional Generative Adversarial Networks

Continuous Conditional Generative Adversarial Networks (CcGANs) enable generative modeling conditional on continuous scalar variables (termed regression labels). However, they can produce subpar fake images due to limited training data.…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Xin Ding , Yongwei Wang , Zuheng Xu

Data Augmentation Revisited: Rethinking the Distribution Gap between Clean and Augmented Data

Data augmentation has been widely applied as an effective methodology to improve generalization in particular when training deep neural networks. Recently, researchers proposed a few intensive data augmentation techniques, which indeed…

Machine Learning · Computer Science 2019-11-22 Zhuoxun He , Lingxi Xie , Xin Chen , Ya Zhang , Yanfeng Wang , Qi Tian

Adversarial Learning of General Transformations for Data Augmentation

Data augmentation (DA) is fundamental against overfitting in large convolutional neural networks, especially with a limited training dataset. In images, DA is usually based on heuristic transformations, like geometric or color…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Saypraseuth Mounsaveng , David Vazquez , Ismail Ben Ayed , Marco Pedersoli

On Data Augmentation for GAN Training

Recent successes in Generative Adversarial Networks (GAN) have affirmed the importance of using more data in GAN training. Yet it is expensive to collect data in many domains such as medical applications. Data Augmentation (DA) has been…

Computer Vision and Pattern Recognition · Computer Science 2021-02-24 Ngoc-Trung Tran , Viet-Hung Tran , Ngoc-Bao Nguyen , Trung-Kien Nguyen , Ngai-Man Cheung

Generative Adversarial Networks for Data Augmentation

One way to expand the available dataset for training AI models in the medical field is through the use of Generative Adversarial Networks (GANs) for data augmentation. GANs work by employing a generator network to create new data samples…

Artificial Intelligence · Computer Science 2023-06-09 Angona Biswas , MD Abdullah Al Nasim , Al Imran , Anika Tabassum Sejuty , Fabliha Fairooz , Sai Puppala , Sajedul Talukder

Augmentation-Aware Self-Supervision for Data-Efficient GAN Training

Training generative adversarial networks (GANs) with limited data is challenging because the discriminator is prone to overfitting. Previously proposed differentiable augmentation demonstrates improved data efficiency of training GANs.…

Machine Learning · Computer Science 2023-12-29 Liang Hou , Qi Cao , Yige Yuan , Songtao Zhao , Chongyang Ma , Siyuan Pan , Pengfei Wan , Zhongyuan Wang , Huawei Shen , Xueqi Cheng

DreamDA: Generative Data Augmentation with Diffusion Models

The acquisition of large-scale, high-quality data is a resource-intensive and time-consuming endeavor. Compared to conventional Data Augmentation (DA) techniques (e.g. cropping and rotation), exploiting prevailing diffusion models for data…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yunxiang Fu , Chaoqi Chen , Yu Qiao , Yizhou Yu

Domain Generalization by Rejecting Extreme Augmentations

Data augmentation is one of the most effective techniques for regularizing deep learning models and improving their recognition performance in a variety of tasks and domains. However, this holds for standard in-domain settings, in which the…

Machine Learning · Computer Science 2025-10-09 Masih Aminbeidokhti , Fidel A. Guerrero Peña , Heitor Rapela Medeiros , Thomas Dubail , Eric Granger , Marco Pedersoli

The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective

Data augmentation (DA) is a powerful workhorse for bolstering performance in modern machine learning. Specific augmentations like translations and scaling in computer vision are traditionally believed to improve generalization by generating…

Machine Learning · Computer Science 2024-02-29 Chi-Heng Lin , Chiraag Kaushik , Eva L. Dyer , Vidya Muthukumar

Learning More with Less: GAN-based Medical Image Augmentation

Convolutional Neural Network (CNN)-based accurate prediction typically requires large-scale annotated training data. In Medical Imaging, however, both obtaining medical data and annotating them by expert physicians are challenging; to…

Computer Vision and Pattern Recognition · Computer Science 2019-05-30 Changhee Han , Kohei Murao , Shin'ichi Satoh , Hideki Nakayama

A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments

Data-Augmentation (DA) is known to improve performance across tasks and datasets. We propose a method to theoretically analyze the effect of DA and study questions such as: how many augmented samples are needed to correctly estimate the…

Machine Learning · Computer Science 2022-02-18 Randall Balestriero , Ishan Misra , Yann LeCun