Related papers: Data Augmentation: a Combined Inductive-Deductive …

Image Data Augmentation for Deep Learning: A Survey

Deep learning has achieved remarkable results in many computer vision tasks. Deep neural networks typically rely on large amounts of training data to avoid overfitting. However, labeled data for real-world applications may be limited. By…

Computer Vision and Pattern Recognition · Computer Science 2023-11-07 Suorong Yang , Weikang Xiao , Mengchen Zhang , Suhan Guo , Jian Zhao , Furao Shen

Transductive Label Augmentation for Improved Deep Network Learning

A major impediment to the application of deep learning to real-world problems is the scarcity of labeled data. Small training sets are in fact of no use to deep networks as, due to the large number of trainable parameters, they will very…

Computer Vision and Pattern Recognition · Computer Science 2018-05-29 Ismail Elezi , Alessandro Torcinovich , Sebastiano Vascon , Marcello Pelillo

Semantic Augmentation in Images using Language

Deep Learning models are incredibly data-hungry and require very large labeled datasets for supervised learning. As a consequence, these models often suffer from overfitting, limiting their ability to generalize to real-world examples.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Sahiti Yerramilli , Jayant Sravan Tamarapalli , Tanmay Girish Kulkarni , Jonathan Francis , Eric Nyberg

Joining datasets via data augmentation in the label space for neural networks

Most, if not all, modern deep learning systems restrict themselves to a single dataset for neural network training and inference. In this article, we are interested in systematic ways to join datasets that are made of similar purposes.…

Machine Learning · Computer Science 2021-06-18 Jake Zhao , Mingfeng Ou , Linji Xue , Yunkai Cui , Sai Wu , Gang Chen

DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models

Recently, a number of image-mixing-based augmentation techniques have been introduced to improve the generalization of deep neural networks. In these techniques, two or more randomly selected natural images are mixed together to generate an…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Khawar Islam , Muhammad Zaigham Zaheer , Arif Mahmood , Karthik Nandakumar

Effective Data Augmentation With Diffusion Models

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Brandon Trabucco , Kyle Doherty , Max Gurinas , Ruslan Salakhutdinov

The Effectiveness of Data Augmentation in Image Classification using Deep Learning

In this paper, we explore and compare multiple solutions to the problem of data augmentation in image classification. Previous work has demonstrated the effectiveness of data augmentation through simple techniques, such as cropping,…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Luis Perez , Jason Wang

Synthetic Image Data for Deep Learning

Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Jason W. Anderson , Marcin Ziolkowski , Ken Kennedy , Amy W. Apon

Deep Learning Approaches for Data Augmentation in Medical Imaging: A Review

Deep learning has become a popular tool for medical image analysis, but the limited availability of training data remains a major challenge, particularly in the medical field where data acquisition can be costly and subject to privacy…

Image and Video Processing · Electrical Eng. & Systems 2024-06-11 Aghiles Kebaili , Jérôme Lapuyade-Lahorgue , Su Ruan

DreamDA: Generative Data Augmentation with Diffusion Models

The acquisition of large-scale, high-quality data is a resource-intensive and time-consuming endeavor. Compared to conventional Data Augmentation (DA) techniques (e.g. cropping and rotation), exploiting prevailing diffusion models for data…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yunxiang Fu , Chaoqi Chen , Yu Qiao , Yizhou Yu

Annotating Ambiguous Images: General Annotation Strategy for High-Quality Data with Real-World Biomedical Validation

In the field of image classification, existing methods often struggle with biased or ambiguous data, a prevalent issue in real-world scenarios. Current strategies, including semi-supervised learning and class blending, offer partial…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Lars Schmarje , Vasco Grossmann , Claudius Zelenka , Johannes Brünger , Reinhard Koch

SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation

Data augmentation for domain-specific image classification tasks often struggles to simultaneously address diversity, faithfulness, and label clarity of generated data, leading to suboptimal performance in downstream tasks. While existing…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixuan Dong , Fang-Yi Su , Jung-Hsien Chiang

Discriminative Cross-Modal Data Augmentation for Medical Imaging Applications

While deep learning methods have shown great success in medical image analysis, they require a number of medical images to train. Due to data privacy concerns and unavailability of medical annotators, it is oftentimes very difficult to…

Image and Video Processing · Electrical Eng. & Systems 2020-10-08 Yue Yang , Pengtao Xie

Closing the Reality Gap with Unsupervised Sim-to-Real Image Translation

Deep learning approaches have become the standard solution to many problems in computer vision and robotics, but obtaining sufficient training data in high enough quality is challenging, as human labor is error prone, time consuming, and…

Machine Learning · Computer Science 2021-06-16 Jan Blumenkamp , Andreas Baude , Tim Laue

Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images

Despite continued advancement in recent years, deep neural networks still rely on large amounts of training data to avoid overfitting. However, labeled training data for real-world applications such as healthcare is limited and difficult to…

Machine Learning · Computer Science 2023-01-13 Mohamed Akrout , Bálint Gyepesi , Péter Holló , Adrienn Poór , Blága Kincső , Stephen Solis , Katrina Cirone , Jeremy Kawahara , Dekker Slade , Latif Abid , Máté Kovács , István Fazekas

Data augmentation using learned transformations for one-shot medical image segmentation

Image segmentation is an important task in many medical applications. Methods based on convolutional neural networks attain state-of-the-art accuracy; however, they typically rely on supervised training with large labeled datasets. Labeling…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Amy Zhao , Guha Balakrishnan , Frédo Durand , John V. Guttag , Adrian V. Dalca

Image Data Augmentation Approaches: A Comprehensive Survey and Future directions

Deep learning (DL) algorithms have shown significant performance in various computer vision tasks. However, having limited labelled data lead to a network overfitting problem, where network performance is bad on unseen data as compared to…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Teerath Kumar , Alessandra Mileo , Rob Brennan , Malika Bendechache

Deep Image Harmonization by Bridging the Reality Gap

Image harmonization has been significantly advanced with large-scale harmonization dataset. However, the current way to build dataset is still labor-intensive, which adversely affects the extendability of dataset. To address this problem,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-13 Junyan Cao , Wenyan Cong , Li Niu , Jianfu Zhang , Liqing Zhang

Using Synthetic Images to Augment Small Medical Image Datasets

Recent years have witnessed a growing academic and industrial interest in deep learning (DL) for medical imaging. To perform well, DL models require very large labeled datasets. However, most medical imaging datasets are small, with a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Minh H. Vu , Lorenzo Tronchin , Tufve Nyholm , Tommy Löfstedt

Albumentations: fast and flexible image augmentations

Data augmentation is a commonly used technique for increasing both the size and the diversity of labeled training sets by leveraging input transformations that preserve output labels. In computer vision domain, image augmentations have…

Computer Vision and Pattern Recognition · Computer Science 2020-02-27 Alexander Buslaev , Alex Parinov , Eugene Khvedchenya , Vladimir I. Iglovikov , Alexandr A. Kalinin