Related papers: Diversify Your Vision Datasets with Automatic Diff…

Data Augmentation for Image Classification using Generative AI

Scaling laws dictate that the performance of AI models is proportional to the amount of available data. Data augmentation is a promising solution to expanding the dataset size. Traditional approaches focused on augmentation using rotation,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Fazle Rahat , M Shifat Hossain , Md Rubel Ahmed , Sumit Kumar Jha , Rickard Ewetz

A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification Tasks

In recent years, one of the most popular techniques in the computer vision community has been the deep learning technique. As a data-driven technique, deep model requires enormous amounts of accurately labelled training data, which is often…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Zihan Yang , Richard O. Sinnott , James Bailey , Qiuhong Ke

AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation

Convolutional neural networks have been widely applied to medical image segmentation and have achieved considerable performance. However, the performance may be significantly affected by the domain gap between training data (source domain)…

Image and Video Processing · Electrical Eng. & Systems 2022-07-28 Junyan Lyu , Yiqi Zhang , Yijin Huang , Li Lin , Pujin Cheng , Xiaoying Tang

Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models

In recent years, much progress has been made in learning robotic manipulation policies that follow natural language instructions. Such methods typically learn from corpora of robot-language data that was either collected with specific tasks…

Robotics · Computer Science 2023-07-04 Ted Xiao , Harris Chan , Pierre Sermanet , Ayzaan Wahid , Anthony Brohan , Karol Hausman , Sergey Levine , Jonathan Tompson

Effective Data Augmentation With Diffusion Models

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Brandon Trabucco , Kyle Doherty , Max Gurinas , Ruslan Salakhutdinov

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification

Data augmentation aims to enrich training samples for alleviating the overfitting issue in low-resource or class-imbalanced situations. Traditional methods first devise task-specific operations such as Synonym Substitute, then preset the…

Computation and Language · Computer Science 2021-09-03 Shuhuai Ren , Jinchao Zhang , Lei Li , Xu Sun , Jie Zhou

Deep AutoAugment

While recent automated data augmentation methods lead to state-of-the-art results, their design spaces and the derived data augmentation strategies still incorporate strong human priors. In this work, instead of fixing a set of hand-picked…

Computer Vision and Pattern Recognition · Computer Science 2022-03-16 Yu Zheng , Zhi Zhang , Shen Yan , Mi Zhang

DIAGen: Semantically Diverse Image Augmentation with Generative Models for Few-Shot Learning

Simple data augmentation techniques, such as rotations and flips, are widely used to enhance the generalization power of computer vision models. However, these techniques often fail to modify high-level semantic attributes of a class. To…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Tobias Lingenberg , Markus Reuter , Gopika Sudhakaran , Dominik Gojny , Stefan Roth , Simone Schaub-Meyer

Do We Need All the Synthetic Data? Targeted Image Augmentation via Diffusion Models

Synthetically augmenting training datasets with diffusion models has become an effective strategy for improving the generalization of image classifiers. However, existing approaches typically increase dataset size by 10-30x and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Dang Nguyen , Jiping Li , Jinghao Zheng , Baharan Mirzasoleiman

Data Augmentation Approaches in Natural Language Processing: A Survey

As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where deep learning techniques may fail. It is widely applied in computer vision then introduced to natural language processing and achieves improvements in…

Computation and Language · Computer Science 2022-06-28 Bohan Li , Yutai Hou , Wanxiang Che

DIRA: Dynamic Domain Incremental Regularised Adaptation

Autonomous systems (AS) often use Deep Neural Network (DNN) classifiers to allow them to operate in complex, high-dimensional, non-linear, and dynamically changing environments. Due to the complexity of these environments, DNN classifiers…

Machine Learning · Computer Science 2024-08-16 Abanoub Ghobrial , Xuan Zheng , Darryl Hond , Hamid Asgari , Kerstin Eder

Adversarial Bayesian Augmentation for Single-Source Domain Generalization

Generalizing to unseen image domains is a challenging problem primarily due to the lack of diverse training data, inaccessible target data, and the large domain shift that may exist in many real-world settings. As such data augmentation is…

Computer Vision and Pattern Recognition · Computer Science 2023-10-04 Sheng Cheng , Tejas Gokhale , Yezhou Yang

SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation

In Fine-Grained Visual Classification (FGVC), distinguishing highly similar subcategories remains a formidable challenge, often necessitating datasets with extensive variability. The acquisition and annotation of such FGVC datasets are…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Qiyu Liao , Xin Yuan , Min Xu , Dadong Wang

Automatic Data Augmentation Learning using Bilevel Optimization for Histopathological Images

Training a deep learning model to classify histopathological images is challenging, because of the color and shape variability of the cells and tissues, and the reduced amount of available data, which does not allow proper learning of those…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Saypraseuth Mounsaveng , Issam Laradji , David Vázquez , Marco Perdersoli , Ismail Ben Ayed

Enabling Data Diversity: Efficient Automatic Augmentation via Regularized Adversarial Training

Data augmentation has proved extremely useful by increasing training data variance to alleviate overfitting and improve deep neural networks' generalization performance. In medical image analysis, a well-designed augmentation policy usually…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Yunhe Gao , Zhiqiang Tang , Mu Zhou , Dimitris Metaxas

IA-VLA: Input Augmentation for Vision-Language-Action models in settings with semantically complex tasks

Vision-language-action models (VLAs) have become an increasingly popular approach for addressing robot manipulation problems in recent years. However, such models need to output actions at a rate suitable for robot control, which limits the…

Robotics · Computer Science 2025-09-30 Eric Hannus , Miika Malin , Tran Nguyen Le , Ville Kyrki

DreamDA: Generative Data Augmentation with Diffusion Models

The acquisition of large-scale, high-quality data is a resource-intensive and time-consuming endeavor. Compared to conventional Data Augmentation (DA) techniques (e.g. cropping and rotation), exploiting prevailing diffusion models for data…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yunxiang Fu , Chaoqi Chen , Yu Qiao , Yizhou Yu

Image Augmentation Agent for Weakly Supervised Semantic Segmentation

Weakly-supervised semantic segmentation (WSSS) has achieved remarkable progress using only image-level labels. However, most existing WSSS methods focus on designing new network structures and loss functions to generate more accurate dense…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Wangyu Wu , Xianglin Qiu , Siqi Song , Zhenhong Chen , Xiaowei Huang , Fei Ma , Jimin Xiao

Limited Linguistic Diversity in Embodied AI Datasets

Language plays a critical role in Vision-Language-Action (VLA) models, yet the linguistic characteristics of the datasets used to train and evaluate these systems remain poorly documented. In this work, we present a systematic dataset audit…

Computation and Language · Computer Science 2026-04-29 Selma Wanna , Agnes Luhtaru , Jonathan Salfity , Ryan Barron , Juston Moore , Cynthia Matuszek , Mitch Pryor

DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models

Although instruction tuning is widely used to adjust behavior in Large Language Models (LLMs), extensive empirical evidence and research indicates that it is primarily a process where the model fits to specific task formats, rather than…

Artificial Intelligence · Computer Science 2024-08-21 Yuanhao Zeng , Fei Ren , Xinpeng Zhou , Yihang Wang , Yingxia Shao