English
Related papers

Related papers: DreamDA: Generative Data Augmentation with Diffusi…

200 papers

Recently, a number of image-mixing-based augmentation techniques have been introduced to improve the generalization of deep neural networks. In these techniques, two or more randomly selected natural images are mixed together to generate an…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Khawar Islam , Muhammad Zaigham Zaheer , Arif Mahmood , Karthik Nandakumar

Despite continued advancement in recent years, deep neural networks still rely on large amounts of training data to avoid overfitting. However, labeled training data for real-world applications such as healthcare is limited and difficult to…

In this paper, we present an effective data augmentation framework leveraging the Large Language Model (LLM) and Diffusion Model (DM) to tackle the challenges inherent in data-scarce scenarios. Recently, DMs have opened up the possibility…

Computer Vision and Pattern Recognition · Computer Science 2024-09-26 Kyuheon Jung , Yongdeuk Seo , Seongwoo Cho , Jaeyoung Kim , Hyun-seok Min , Sungchul Choi

Data augmentation for domain-specific image classification tasks often struggles to simultaneously address diversity, faithfulness, and label clarity of generated data, leading to suboptimal performance in downstream tasks. While existing…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixuan Dong , Fang-Yi Su , Jung-Hsien Chiang

Data augmentation is widely used to train deep learning models to address data scarcity. However, traditional data augmentation (TDA) typically relies on simple geometric transformation, such as random rotation and rescaling, resulting in…

Computer Vision and Pattern Recognition · Computer Science 2025-08-27 Dekai Zhu , Stefan Gavranovic , Flavien Boussuge , Benjamin Busam , Slobodan Ilic

Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Quang Nguyen , Truong Vu , Anh Tran , Khoi Nguyen

Data augmentation plays a crucial role in deep learning, enhancing the generalization and robustness of learning-based models. Standard approaches involve simple transformations like rotations and flips for generating extra data. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Shichao Dong , Ze Yang , Guosheng Lin

Image data augmentation constitutes a critical methodology in modern computer vision tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; thereby, improving the performance and robustness of…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Panagiotis Alimisis , Ioannis Mademlis , Panagiotis Radoglou-Grammatikis , Panagiotis Sarigiannidis , Georgios Th. Papadopoulos

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Brandon Trabucco , Kyle Doherty , Max Gurinas , Ruslan Salakhutdinov

Diffusion-based data augmentation (DiffDA) has emerged as a promising approach to improving classification performance under data scarcity. However, existing works vary significantly in task configurations, model choices, and experimental…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Zekun Li , Yinghuan Shi , Yang Gao , Dong Xu

Collecting and annotating datasets for pixel-level semantic segmentation tasks are highly labor-intensive. Data augmentation provides a viable solution by enhancing model generalization without additional real-world data collection.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Huy Che , Dinh-Duy Phan , Duc-Khai Lam

Generating high-quality labeled image datasets is crucial for training accurate and robust machine learning models in the field of computer vision. However, the process of manually labeling real images is often time-consuming and costly. To…

Computer Vision and Pattern Recognition · Computer Science 2023-09-04 Michael Shenoda , Edward Kim

Data Augmentation (DA), i.e., synthesizing faithful and diverse samples to expand the original training set, is a prevalent and effective strategy to improve the performance of various data-scarce tasks. With the powerful image generation…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Yanghao Wang , Long Chen

Data augmentation is a ubiquitous technique for increasing the size of labeled training sets by leveraging task-specific data transformations that preserve class labels. While it is often easy for domain experts to specify individual…

Machine Learning · Statistics 2018-12-10 Alexander J. Ratner , Henry R. Ehrenberg , Zeshan Hussain , Jared Dunnmon , Christopher Ré

Research on unsupervised domain adaptation (UDA) for semantic segmentation of remote sensing images has been extensively conducted. However, research on how to achieve domain adaptation in practical scenarios where source domain data is…

Computer Vision and Pattern Recognition · Computer Science 2025-09-24 Wenjie Liu , Hongmin Liu , Lixin Zhang , Bin Fan

Deep learning-based food image classification enables precise identification of food categories, further facilitating accurate nutritional analysis. However, real-world food images often show a skewed distribution, with some food types…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 GaYeon Koh , Hyun-Jic Oh , Jeonghyun Noh , Won-Ki Jeong

Synthetically augmenting training datasets with diffusion models has become an effective strategy for improving the generalization of image classifiers. However, existing approaches typically increase dataset size by 10-30x and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Dang Nguyen , Jiping Li , Jinghao Zheng , Baharan Mirzasoleiman

Simple data augmentation techniques, such as rotations and flips, are widely used to enhance the generalization power of computer vision models. However, these techniques often fail to modify high-level semantic attributes of a class. To…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Tobias Lingenberg , Markus Reuter , Gopika Sudhakaran , Dominik Gojny , Stefan Roth , Simone Schaub-Meyer

Medical image data is less accessible than in other domains due to privacy and regulatory constraints. In addition, labeling requires costly, time-intensive manual image annotation by clinical experts. To overcome these challenges,…

Image and Video Processing · Electrical Eng. & Systems 2025-07-11 Fangyijie Wang , Kevin Whelan , Félix Balado , Kathleen M. Curran , Guénolé Silvestre

Deep learning has become a popular tool for medical image analysis, but the limited availability of training data remains a major challenge, particularly in the medical field where data acquisition can be costly and subject to privacy…

Image and Video Processing · Electrical Eng. & Systems 2024-06-11 Aghiles Kebaili , Jérôme Lapuyade-Lahorgue , Su Ruan
‹ Prev 1 2 3 10 Next ›