Related papers: DiffDoctor: Diagnosing Image Diffusion Models Befo…

DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing

Large-scale Text-to-Image (T2I) diffusion models have revolutionized image generation over the last few years. Although owning diverse and high-quality generation capabilities, translating these abilities to fine-grained image editing…

Computer Vision and Pattern Recognition · Computer Science 2024-02-06 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

DiffusionQC: Artifact Detection in Histopathology via Diffusion Model

Digital pathology plays a vital role across modern medicine, offering critical insights for disease diagnosis, prognosis, and treatment. However, histopathology images often contain artifacts introduced during slide preparation and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Zhenzhen Wang , Zhongliang Zhou , Zhuoyu Wen , Jeong Hwan Kook , John B Wojcik , John Kang

TextDiffuser: Diffusion Models as Text Painters

Diffusion models have gained increasing attention for their impressive generation abilities but currently struggle with rendering accurate and coherent text. To address this issue, we introduce TextDiffuser, focusing on generating images…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Jingye Chen , Yupan Huang , Tengchao Lv , Lei Cui , Qifeng Chen , Furu Wei

DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models

Recent generative models show impressive results in photo-realistic image generation. However, artifacts often inevitably appear in the generated results, leading to downgraded user experience and reduced performance in downstream tasks.…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Yueqin Yin , Lianghua Huang , Yu Liu , Kaiqi Huang

Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes

Recent progress in generative models has made it easier for a wide audience to edit and create image content, raising concerns about the proliferation of deepfakes, especially in healthcare. Despite the availability of numerous techniques…

Image and Video Processing · Electrical Eng. & Systems 2024-10-22 Fred Grabovski , Lior Yasur , Guy Amit , Yisroel Mirsky

Accelerating Diffusion Decoders via Multi-Scale Sampling and One-Step Distillation

Image tokenization plays a central role in modern generative modeling by mapping visual inputs into compact representations that serve as an intermediate signal between pixels and generative models. Diffusion-based decoders have recently…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Chuhan Wang , Hao Chen

Assessing the use of Diffusion models for motion artifact correction in brain MRI

Magnetic Resonance Imaging generally requires long exposure times, while being sensitive to patient motion, resulting in artifacts in the acquired images, which may hinder their diagnostic relevance. Despite research efforts to decrease the…

Image and Video Processing · Electrical Eng. & Systems 2025-02-04 Paolo Angella , Vito Paolo Pastore , Matteo Santacesaria

Diffusion-Based Data Augmentation for Medical Image Segmentation

Medical image segmentation models struggle with rare abnormalities due to scarce annotated pathological data. We propose DiffAug a novel framework that combines textguided diffusion-based generation with automatic segmentation validation to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Maham Nazir , Muhammad Aqeel , Francesco Setti

Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection

In this study, we show that diffusion models can be used in industrial scenarios to improve the data augmentation procedure in the context of surface defect detection. In general, defect detection classifiers are trained on ground-truth…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Luigi Capogrosso , Federico Girella , Francesco Taioli , Michele Dalla Chiara , Muhammad Aqeel , Franco Fummi , Francesco Setti , Marco Cristani

See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

Despite recent advances in diffusion models, AI generated images still often contain visual artifacts that compromise realism. Although more thorough pre-training and bigger models might reduce artifacts, there is no assurance that they can…

Computer Vision and Pattern Recognition · Computer Science 2026-03-27 Jaehyun Park , Minyoung Ahn , Minkyu Kim , Jonghyun Lee , Jae-Gil Lee , Dongmin Park

Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration

Image restoration is a classic low-level problem aimed at recovering high-quality images from low-quality images with various degradations such as blur, noise, rain, haze, etc. However, due to the inherent complexity and non-uniqueness of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Yuhong Zhang , Hengsheng Zhang , Xinning Chai , Zhengxue Cheng , Rong Xie , Li Song , Wenjun Zhang

Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts

Visual artifacts remain a persistent challenge in diffusion models, even with training on massive datasets. Current solutions primarily rely on supervised detectors, yet lack understanding of why these artifacts occur in the first place. In…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Yu Cao , Zengqun Zhao , Ioannis Patras , Shaogang Gong

Characterizing Photorealism and Artifacts in Diffusion Model-Generated Images

Diffusion model-generated images can appear indistinguishable from authentic photographs, but these images often contain artifacts and implausibilities that reveal their AI-generated provenance. Given the challenge to public trust in media…

Human-Computer Interaction · Computer Science 2025-02-18 Negar Kamali , Karyn Nakamura , Aakriti Kumar , Angelos Chatzimparmpas , Jessica Hullman , Matthew Groh

DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models

Recent data-driven image colorization methods have enabled automatic or reference-based colorization, while still suffering from unsatisfactory and inaccurate object-level color control. To address these issues, we propose a new method…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Jianxin Lin , Peng Xiao , Yijun Wang , Rongju Zhang , Xiangxiang Zeng

Towards Multi-domain Face Landmark Detection with Synthetic Data from Diffusion model

Recently, deep learning-based facial landmark detection for in-the-wild faces has achieved significant improvement. However, there are still challenges in face landmark detection in other domains (e.g. cartoon, caricature, etc). This is due…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Yuanming Li , Gwantae Kim , Jeong-gi Kwak , Bon-hwa Ku , Hanseok Ko

DiffusionFF: A Diffusion-based Framework for Joint Face Forgery Detection and Fine-Grained Artifact Localization

The rapid evolution of deepfake technologies demands robust and reliable face forgery detection algorithms. While determining whether an image has been manipulated remains essential, the ability to precisely localize forgery clues is also…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Siran Peng , Haoyuan Zhang , Li Gao , Tianshuo Zhang , Xiangyu Zhu , Bao Li , Weisong Zhao , Zhen Lei

A Difference-in-Difference Approach to Detecting AI-Generated Images

Diffusion models are able to produce AI-generated images that are almost indistinguishable from real ones. This raises concerns about their potential misuse and poses substantial challenges for detecting them. Many existing detectors rely…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Xinyi Qi , Kai Ye , Chengchun Shi , Ying Yang , Hongyi Zhou , Jin Zhu

Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models

Deep generative models have emerged as promising tools for detecting arbitrary anomalies in data, dispensing with the necessity for manual labelling. Recently, autoregressive transformers have achieved state-of-the-art performance for…

Computer Vision and Pattern Recognition · Computer Science 2022-06-08 Walter H. L. Pinaya , Mark S. Graham , Robert Gray , Pedro F Da Costa , Petru-Daniel Tudosiu , Paul Wright , Yee H. Mah , Andrew D. MacKinnon , James T. Teo , Rolf Jager , David Werring , Geraint Rees , Parashkev Nachev , Sebastien Ourselin , M. Jorge Cardoso

Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation

Recent years have seen significant progress in human image generation, particularly with the advancements in diffusion models. However, existing diffusion methods encounter challenges when producing consistent hand anatomy and the generated…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Anton Pelykh , Ozge Mercanoglu Sincan , Richard Bowden

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Recent work showed that large diffusion models can be reused as highly precise monocular depth estimators by casting depth estimation as an image-conditional image generation task. While the proposed model achieved state-of-the-art results,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Gonzalo Martin Garcia , Karim Knaebel , Christian Schmidt , Daan de Geus , Alexander Hermans , Bastian Leibe