Related papers: DiffuGen: Adaptable Approach for Generating Labele…

Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation

Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Quang Nguyen , Truong Vu , Anh Tran , Khoi Nguyen

DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models

Recently, a number of image-mixing-based augmentation techniques have been introduced to improve the generalization of deep neural networks. In these techniques, two or more randomly selected natural images are mixed together to generate an…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Khawar Islam , Muhammad Zaigham Zaheer , Arif Mahmood , Karthik Nandakumar

Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners

Diffusion models, such as Stable Diffusion, have shown incredible performance on text-to-image generation. Since text-to-image generation often requires models to generate visual concepts with fine-grained details and attributes specified…

Computer Vision and Pattern Recognition · Computer Science 2024-04-26 Xuehai He , Weixi Feng , Tsu-Jui Fu , Varun Jampani , Arjun Akula , Pradyumna Narayana , Sugato Basu , William Yang Wang , Xin Eric Wang

DreamDA: Generative Data Augmentation with Diffusion Models

The acquisition of large-scale, high-quality data is a resource-intensive and time-consuming endeavor. Compared to conventional Data Augmentation (DA) techniques (e.g. cropping and rotation), exploiting prevailing diffusion models for data…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yunxiang Fu , Chaoqi Chen , Yu Qiao , Yizhou Yu

Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

Neural networks struggle with image classification when biases are learned and misleads correlations, affecting their generalization and performance. Previous methods require attribute labels (e.g. background, color) or utilizes Generative…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Donggeun Ko , Dongjun Lee , Namjun Park , Wonkyeong Shim , Jaekwang Kim

ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

Modern diffusion-based image generative models have made significant progress and become promising to enrich training data for the object detection task. However, the generation quality and the controllability for complex scenes containing…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Jingyuan Zhu , Shiyu Li , Yuxuan Liu , Ping Huang , Jiulong Shan , Huimin Ma , Jian Yuan

SkinDualGen: Prompt-Driven Diffusion for Simultaneous Image-Mask Generation in Skin Lesions

Medical image analysis plays a pivotal role in the early diagnosis of diseases such as skin lesions. However, the scarcity of data and the class imbalance significantly hinder the performance of deep learning models. We propose a novel…

Image and Video Processing · Electrical Eng. & Systems 2025-07-29 Zhaobin Xu

Nested Diffusion Processes for Anytime Image Generation

Diffusion models are the current state-of-the-art in image generation, synthesizing high-quality images by breaking down the generation process into many fine-grained denoising steps. Despite their good performance, diffusion models are…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Noam Elata , Bahjat Kawar , Tomer Michaeli , Michael Elad

Advancing Image Classification with Discrete Diffusion Classification Modeling

Image classification is a well-studied task in computer vision, and yet it remains challenging under high-uncertainty conditions, such as when input images are corrupted or training data are limited. Conventional classification approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Omer Belhasin , Shelly Golan , Ran El-Yaniv , Michael Elad

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

Accurate and controllable image editing is a challenging task that has attracted significant attention recently. Notably, DragGAN is an interactive point-based image editing framework that achieves impressive editing results with…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Yujun Shi , Chuhui Xue , Jun Hao Liew , Jiachun Pan , Hanshu Yan , Wenqing Zhang , Vincent Y. F. Tan , Song Bai

Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition

The task of steel surface defect recognition is an industrial problem with great industry values. The data insufficiency is the major challenge in training a robust defect recognition network. Existing methods have investigated to enlarge…

Computer Vision and Pattern Recognition · Computer Science 2024-05-06 Yichun Tai , Kun Yang , Tao Peng , Zhenzhen Huang , Zhijiang Zhang

Training on Thin Air: Improve Image Classification with Generated Data

Acquiring high-quality data for training discriminative models is a crucial yet challenging aspect of building effective predictive systems. In this paper, we present Diffusion Inversion, a simple yet effective method that leverages the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Yongchao Zhou , Hshmat Sahak , Jimmy Ba

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

Recent advances in text-to-image generation with diffusion models present transformative capabilities in image quality. However, user controllability of the generated image, and fast adaptation to new tasks still remains an open challenge,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-17 Omer Bar-Tal , Lior Yariv , Yaron Lipman , Tali Dekel

Diffusion-Based Data Augmentation for Medical Image Segmentation

Medical image segmentation models struggle with rare abnormalities due to scarce annotated pathological data. We propose DiffAug a novel framework that combines textguided diffusion-based generation with automatic segmentation validation to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Maham Nazir , Muhammad Aqeel , Francesco Setti

Label-Consistent Dataset Distillation with Detector-Guided Refinement

Dataset distillation (DD) aims to generate a compact yet informative dataset that achieves performance comparable to the original dataset, thereby reducing demands on storage and computational resources. Although diffusion models have made…

Computer Vision and Pattern Recognition · Computer Science 2026-02-19 Yawen Zou , Guang Li , Zi Wang , Chunzhi Gu , Chao Zhang

Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map

Synthetic dataset generation in Computer Vision, particularly for industrial applications, is still underexplored. Industrial defect segmentation, for instance, requires highly accurate labels, yet acquiring such data is costly and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-28 Emanuele Caruso , Alessandro Simoni , Francesco Pelosin

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Diffusion models have demonstrated remarkable success in various image generation tasks, but their performance is often limited by the uniform processing of inputs across varying conditions and noise levels. To address this limitation, we…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Minglei Shi , Ziyang Yuan , Haotian Yang , Xintao Wang , Mingwu Zheng , Xin Tao , Wenliang Zhao , Wenzhao Zheng , Jie Zhou , Jiwen Lu , Pengfei Wan , Di Zhang , Kun Gai

B\'ezier Meets Diffusion: Robust Generation Across Domains for Medical Image Segmentation

Training robust learning algorithms across different medical imaging modalities is challenging due to the large domain gap. Unsupervised domain adaptation (UDA) mitigates this problem by using annotated images from the source domain and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Chen Li , Meilong Xu , Xiaoling Hu , Weimin Lyu , Chao Chen

Image Neural Field Diffusion Models

Diffusion models have shown an impressive ability to model complex data distributions, with several key advantages over GANs, such as stable training, better coverage of the training distribution's modes, and the ability to solve inverse…

Computer Vision and Pattern Recognition · Computer Science 2024-06-12 Yinbo Chen , Oliver Wang , Richard Zhang , Eli Shechtman , Xiaolong Wang , Michael Gharbi

Discriminative Class Tokens for Text-to-Image Diffusion Models

Recent advances in text-to-image diffusion models have enabled the generation of diverse and high-quality images. While impressive, the images often fall short of depicting subtle details and are susceptible to errors due to ambiguity in…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Idan Schwartz , Vésteinn Snæbjarnarson , Hila Chefer , Ryan Cotterell , Serge Belongie , Lior Wolf , Sagie Benaim