English
Related papers

Related papers: Semantic-Guided Generative Image Augmentation Meth…

200 papers

Data augmentation is crucial for pixel-wise annotation tasks like semantic segmentation, where labeling requires significant effort and intensive labor. Traditional methods, involving simple transformations such as rotations and flips,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Quang-Huy Che , Duc-Tri Le , Bich-Nga Pham , Duc-Khai Lam , Vinh-Tiep Nguyen

Data augmentation for domain-specific image classification tasks often struggles to simultaneously address diversity, faithfulness, and label clarity of generated data, leading to suboptimal performance in downstream tasks. While existing…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Yixuan Dong , Fang-Yi Su , Jung-Hsien Chiang

In Fine-Grained Visual Classification (FGVC), distinguishing highly similar subcategories remains a formidable challenge, often necessitating datasets with extensive variability. The acquisition and annotation of such FGVC datasets are…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Qiyu Liao , Xin Yuan , Min Xu , Dadong Wang

The advent of accessible Generative AI tools enables anyone to create and spread synthetic images on social media, often with the intention to mislead, thus posing a significant threat to online information integrity. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Efthymia Amarantidou , Christos Koutlis , Symeon Papadopoulos , Panagiotis C. Petrantonakis

Data augmentation is widely used to enhance generalization in visual classification tasks. However, traditional methods struggle when source and target domains differ, as in domain adaptation, due to their inability to address domain gaps.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Khawar Islam , Muhammad Zaigham Zaheer , Arif Mahmood , Karthik Nandakumar , Naveed Akhtar

Augmentation by generative modelling yields a promising alternative to the accumulation of surgical data, where ethical, organisational and regulatory aspects must be considered. Yet, the joint synthesis of (image, mask) pairs for…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Yannik Frisch , Christina Bornberg , Moritz Fuchs , Anirban Mukhopadhyay

Text-to-image (T2I) generative models have recently emerged as a powerful tool, enabling the creation of photo-realistic images and giving rise to a multitude of applications. However, the effective integration of T2I models into…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Zhicai Wang , Longhui Wei , Tan Wang , Heyu Chen , Yanbin Hao , Xiang Wang , Xiangnan He , Qi Tian

Recent studies emphasize the crucial role of data augmentation in enhancing the performance of object detection models. However,existing methodologies often struggle to effectively harmonize dataset diversity with semantic coordination.To…

Computer Vision and Pattern Recognition · Computer Science 2024-08-07 Sen Nie , Zhuo Wang , Xinxin Wang , Kun He

Weakly-supervised semantic segmentation (WSSS), which aims to train segmentation models solely using image-level labels, has achieved significant attention. Existing methods primarily focus on generating high-quality pseudo labels using…

Computer Vision and Pattern Recognition · Computer Science 2024-11-27 Wangyu Wu , Tianhong Dai , Xiaowei Huang , Fei Ma , Jimin Xiao

Data augmentation is crucial in training deep models, preventing them from overfitting to limited data. Recent advances in generative AI, e.g., diffusion models, have enabled more sophisticated augmentation techniques that produce data…

Computer Vision and Pattern Recognition · Computer Science 2024-11-08 Soroush Abbasi Koohpayegani , Anuj Singh , K L Navaneet , Hamed Pirsiavash , Hadi Jamali-Rad

Latent diffusion models (LDMs) dominate high-quality image generation, yet integrating representation learning with generative modeling remains a challenge. We introduce a novel generative image modeling framework that seamlessly bridges…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Theodoros Kouzelis , Efstathios Karypidis , Ioannis Kakogeorgiou , Spyros Gidaris , Nikos Komodakis

Image synthesis approaches, e.g., generative adversarial networks, have been popular as a form of data augmentation in medical image analysis tasks. It is primarily beneficial to overcome the shortage of publicly accessible data and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-05 Shiyi Du , Xiaosong Wang , Yongyi Lu , Yuyin Zhou , Shaoting Zhang , Alan Yuille , Kang Li , Zongwei Zhou

Advancements in text-to-image generative AI with large multimodal models are spreading into the field of image compression, creating high-quality representation of images at extremely low bit rates. This work introduces novel components to…

Image and Video Processing · Electrical Eng. & Systems 2025-06-02 Cheng-Lin Wu , Hyomin Choi , Ivan V. Bajić

Image data augmentation constitutes a critical methodology in modern computer vision tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; thereby, improving the performance and robustness of…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Panagiotis Alimisis , Ioannis Mademlis , Panagiotis Radoglou-Grammatikis , Panagiotis Sarigiannidis , Georgios Th. Papadopoulos

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

Recent generative data augmentation methods conditioned on both image and text prompts struggle to balance between fidelity and diversity, as it is challenging to preserve essential image details while aligning with varied text prompts.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-20 Tianchen Zhao , Xuanbai Chen , Zhihua Li , Jun Fang , Dongsheng An , Xiang Xu , Zhuowen Tu , Yifan Xing

Synthetically augmenting training datasets with diffusion models has become an effective strategy for improving the generalization of image classifiers. However, existing approaches typically increase dataset size by 10-30x and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Dang Nguyen , Jiping Li , Jinghao Zheng , Baharan Mirzasoleiman

Collecting and annotating datasets for pixel-level semantic segmentation tasks are highly labor-intensive. Data augmentation provides a viable solution by enhancing model generalization without additional real-world data collection.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Huy Che , Dinh-Duy Phan , Duc-Khai Lam

Self-supervised learning (SSL) methods have emerged as strong visual representation learners by training an image encoder to maximize similarity between features of different views of the same image. To perform this view-invariance task,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Varun Belagali , Srikar Yellapragada , Alexandros Graikos , Saarthak Kapse , Zilinghan Li , Tarak Nath Nandi , Ravi K Madduri , Prateek Prasanna , Joel Saltz , Dimitris Samaras

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For…

Machine Learning · Computer Science 2021-06-02 Prafulla Dhariwal , Alex Nichol
‹ Prev 1 2 3 10 Next ›