Related papers: Semantic-Guided Generative Image Augmentation Meth…

Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance

Data augmentation is crucial for pixel-wise annotation tasks like semantic segmentation, where labeling requires significant effort and intensive labor. Traditional methods, involving simple transformations such as rotations and flips,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Quang-Huy Che , Duc-Tri Le , Bich-Nga Pham , Duc-Khai Lam , Vinh-Tiep Nguyen

SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation

Data augmentation for domain-specific image classification tasks often struggles to simultaneously address diversity, faithfulness, and label clarity of generated data, leading to suboptimal performance in downstream tasks. While existing…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixuan Dong , Fang-Yi Su , Jung-Hsien Chiang

SGIA: Enhancing Fine-Grained Visual Classification with Sequence Generative Image Augmentation

In Fine-Grained Visual Classification (FGVC), distinguishing highly similar subcategories remains a formidable challenge, often necessitating datasets with extensive variability. The acquisition and annotation of such FGVC datasets are…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Qiyu Liao , Xin Yuan , Min Xu , Dadong Wang

Composite Data Augmentations for Synthetic Image Detection Against Real-World Perturbations

The advent of accessible Generative AI tools enables anyone to create and spread synthetic images on social media, often with the intention to mislead, thus posing a significant threat to online information integrity. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Efthymia Amarantidou , Christos Koutlis , Symeon Papadopoulos , Panagiotis C. Petrantonakis

GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Data augmentation is widely used to enhance generalization in visual classification tasks. However, traditional methods struggle when source and target domains differ, as in domain adaptation, due to their inability to address domain gaps.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Khawar Islam , Muhammad Zaigham Zaheer , Arif Mahmood , Karthik Nandakumar , Naveed Akhtar

GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation

Augmentation by generative modelling yields a promising alternative to the accumulation of surgical data, where ethical, organisational and regulatory aspects must be considered. Yet, the joint synthesis of (image, mask) pairs for…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Yannik Frisch , Christina Bornberg , Moritz Fuchs , Anirban Mukhopadhyay

Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model

Text-to-image (T2I) generative models have recently emerged as a powerful tool, enabling the creation of photo-realistic images and giving rise to a multitude of applications. However, the effective integration of T2I models into…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Zhicai Wang , Longhui Wei , Tan Wang , Heyu Chen , Yanbin Hao , Xiang Wang , Xiangnan He , Qi Tian

Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection

Recent studies emphasize the crucial role of data augmentation in enhancing the performance of object detection models. However,existing methodologies often struggle to effectively harmonize dataset diversity with semantic coordination.To…

Computer Vision and Pattern Recognition · Computer Science 2024-08-07 Sen Nie , Zhuo Wang , Xinxin Wang , Kun He

Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation

Weakly-supervised semantic segmentation (WSSS), which aims to train segmentation models solely using image-level labels, has achieved significant attention. Existing methods primarily focus on generating high-quality pseudo labels using…

Computer Vision and Pattern Recognition · Computer Science 2024-11-27 Wangyu Wu , Tianhong Dai , Xiaowei Huang , Fei Ma , Jimin Xiao

GeNIe: Generative Hard Negative Images Through Diffusion

Data augmentation is crucial in training deep models, preventing them from overfitting to limited data. Recent advances in generative AI, e.g., diffusion models, have enabled more sophisticated augmentation techniques that produce data…

Computer Vision and Pattern Recognition · Computer Science 2024-11-08 Soroush Abbasi Koohpayegani , Anuj Singh , K L Navaneet , Hamed Pirsiavash , Hadi Jamali-Rad

Boosting Generative Image Modeling via Joint Image-Feature Synthesis

Latent diffusion models (LDMs) dominate high-quality image generation, yet integrating representation learning with generative modeling remains a challenge. We introduce a novel generative image modeling framework that seamlessly bridges…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Theodoros Kouzelis , Efstathios Karypidis , Ioannis Kakogeorgiou , Spyros Gidaris , Nikos Komodakis

Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts

Image synthesis approaches, e.g., generative adversarial networks, have been popular as a form of data augmentation in medical image analysis tasks. It is primarily beneficial to overcome the shortage of publicly accessible data and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-05 Shiyi Du , Xiaosong Wang , Yongyi Lu , Yuyin Zhou , Shaoting Zhang , Alan Yuille , Kang Li , Zongwei Zhou

Semantics-Guided Generative Image Compression

Advancements in text-to-image generative AI with large multimodal models are spreading into the field of image compression, creating high-quality representation of images at extremely low bit rates. This work introduces novel components to…

Image and Video Processing · Electrical Eng. & Systems 2025-06-02 Cheng-Lin Wu , Hyomin Choi , Ivan V. Bajić

Advances in Diffusion Models for Image Data Augmentation: A Review of Methods, Models, Evaluation Metrics and Future Research Directions

Image data augmentation constitutes a critical methodology in modern computer vision tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; thereby, improving the performance and robustness of…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Panagiotis Alimisis , Ioannis Mademlis , Panagiotis Radoglou-Grammatikis , Panagiotis Sarigiannidis , Georgios Th. Papadopoulos

Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

Salient Concept-Aware Generative Data Augmentation

Recent generative data augmentation methods conditioned on both image and text prompts struggle to balance between fidelity and diversity, as it is challenging to preserve essential image details while aligning with varied text prompts.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-20 Tianchen Zhao , Xuanbai Chen , Zhihua Li , Jun Fang , Dongsheng An , Xiang Xu , Zhuowen Tu , Yifan Xing

Do We Need All the Synthetic Data? Targeted Image Augmentation via Diffusion Models

Synthetically augmenting training datasets with diffusion models has become an effective strategy for improving the generalization of image classifiers. However, existing approaches typically increase dataset size by 10-30x and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Dang Nguyen , Jiping Li , Jinghao Zheng , Baharan Mirzasoleiman

R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation

Collecting and annotating datasets for pixel-level semantic segmentation tasks are highly labor-intensive. Data augmentation provides a viable solution by enhancing model generalization without additional real-world data collection.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Huy Che , Dinh-Duy Phan , Duc-Khai Lam

Gen-SIS: Generative Self-augmentation Improves Self-supervised Learning

Self-supervised learning (SSL) methods have emerged as strong visual representation learners by training an image encoder to maximize similarity between features of different views of the same image. To perform this view-invariance task,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Varun Belagali , Srikar Yellapragada , Alexandros Graikos , Saarthak Kapse , Zilinghan Li , Tarak Nath Nandi , Ravi K Madduri , Prateek Prasanna , Joel Saltz , Dimitris Samaras

Diffusion Models Beat GANs on Image Synthesis

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For…

Machine Learning · Computer Science 2021-06-02 Prafulla Dhariwal , Alex Nichol