Related papers: SDiFL: Stable Diffusion-Driven Framework for Image…

DLSF: Dual-Layer Synergistic Fusion for High-Fidelity Image Syn-thesis

With the rapid advancement of diffusion-based generative models, Stable Diffusion (SD) has emerged as a state-of-the-art framework for high-fidelity im-age synthesis. However, existing SD models suffer from suboptimal feature aggregation,…

Graphics · Computer Science 2025-07-21 Zhen-Qi Chen , Yuan-Fu Yang

DiffusionFF: A Diffusion-based Framework for Joint Face Forgery Detection and Fine-Grained Artifact Localization

The rapid evolution of deepfake technologies demands robust and reliable face forgery detection algorithms. While determining whether an image has been manipulated remains essential, the ability to precisely localize forgery clues is also…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Siran Peng , Haoyuan Zhang , Li Gao , Tianshuo Zhang , Xiangyu Zhu , Bao Li , Weisong Zhao , Zhen Lei

StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model

The rapid progress in generative models has given rise to the critical task of AI-Generated Content Stealth (AIGC-S), which aims to create AI-generated images that can evade both forensic detectors and human inspection. This task is crucial…

Computer Vision and Pattern Recognition · Computer Science 2024-08-13 Ziyin Zhou , Ke Sun , Zhongxi Chen , Huafeng Kuang , Xiaoshuai Sun , Rongrong Ji

Diffusion-Guided Semantic Consistency for Multimodal Heterogeneity

Federated learning (FL) is severely challenged by non-independent and identically distributed (non-IID) client data, a problem that degrades global model performance, especially in multimodal perception settings. Conventional methods often…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Jing Liu , Zhengliang Guo , Yan Wang , Xiaoguang Zhu , Yao Du , Zehua Wang , Victor C. M. Leung

CBDiff:Conditional Bernoulli Diffusion Models for Image Forgery Localization

Image Forgery Localization (IFL) is a crucial task in image forensics, aimed at accurately identifying manipulated or tampered regions within an image at the pixel level. Existing methods typically generate a single deterministic…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Zhou Lei , Pan Gang , Wang Jiahao , Sun Di

FakeInversion: Learning to Detect Images from Unseen Text-to-Image Models by Inverting Stable Diffusion

Due to the high potential for abuse of GenAI systems, the task of detecting synthetic images has recently become of great interest to the research community. Unfortunately, existing image-space detectors quickly become obsolete as new…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 George Cazenavette , Avneesh Sud , Thomas Leung , Ben Usman

ID$^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition

Synthetic face recognition (SFR) aims to generate synthetic face datasets that mimic the distribution of real face data, which allows for training face recognition models in a privacy-preserving manner. Despite the remarkable potential of…

Computer Vision and Pattern Recognition · Computer Science 2024-10-25 Shen Li , Jianqing Xu , Jiaying Wu , Miao Xiong , Ailin Deng , Jiazhen Ji , Yuge Huang , Wenjie Feng , Shouhong Ding , Bryan Hooi

DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion

The rapid progress of Deepfake technology has made face swapping highly realistic, raising concerns about the malicious use of fabricated facial content. Existing methods often struggle to generalize to unseen domains due to the diverse…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Ke Sun , Shen Chen , Taiping Yao , Hong Liu , Xiaoshuai Sun , Shouhong Ding , Rongrong Ji

Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression

Recent advances in Artificial Intelligence Generated Content (AIGC) have garnered significant interest, accompanied by an increasing need to transmit and compress the vast number of AI-generated images (AIGIs). However, there is a…

Image and Video Processing · Electrical Eng. & Systems 2024-12-18 Ruijie Chen , Qi Mao , Zhengxue Cheng

Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition

The task of steel surface defect recognition is an industrial problem with great industry values. The data insufficiency is the major challenge in training a robust defect recognition network. Existing methods have investigated to enlarge…

Computer Vision and Pattern Recognition · Computer Science 2024-05-06 Yichun Tai , Kun Yang , Tao Peng , Zhenzhen Huang , Zhijiang Zhang

SD-Acc: Accelerating Stable Diffusion through Phase-aware Sampling and Hardware Co-Optimizations

The emergence of diffusion models has significantly advanced generative AI, improving the quality, realism, and creativity of image and video generation. Among them, Stable Diffusion (StableDiff) stands out as a key model for text-to-image…

Hardware Architecture · Computer Science 2025-07-03 Zhican Wang , Guanghui He , Hongxiang Fan

Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion

Text-to-image generation via Stable Diffusion models (SDM) have demonstrated remarkable capabilities. However, their computational intensity, particularly in the iterative denoising process, hinders real-time deployment in latency-sensitive…

Computer Vision and Pattern Recognition · Computer Science 2025-05-08 Shuaiting Li , Juncan Deng , Zeyu Wang , Kedong Xu , Rongtao Deng , Hong Gu , Haibin Shen , Kejie Huang

Selective Domain-Invariant Feature for Generalizable Deepfake Detection

With diverse presentation forgery methods emerging continually, detecting the authenticity of images has drawn growing attention. Although existing methods have achieved impressive accuracy in training dataset detection, they still perform…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yingxin Lai , Guoqing Yang Yifan He , Zhiming Luo , Shaozi Li

MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize

While diffusion-based generative models have made significant strides in visual content creation, conventional approaches face computational challenges, especially for high-resolution images, as they denoise the entire image from noisy…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Haohang Xu , Longyu Chen , Yichen Zhang , Shuangrui Ding , Zhipeng Zhang

Consolidating Diffusion-Generated Video Detection with Unified Multimodal Forgery Learning

The proliferation of videos generated by diffusion models has raised increasing concerns about information security, highlighting the urgent need for reliable detection of synthetic media. Existing methods primarily focus on image-level…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Xiaohong Liu , Xiufeng Song , Huayu Zheng , Lei Bai , Xiaoming Liu , Guangtao Zhai

Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation

Conditional diffusion models have demonstrated impressive performance in image manipulation tasks. The general pipeline involves adding noise to the image and then denoising it. However, this method faces a trade-off problem: adding too…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Luozhou Wang , Shuai Yang , Shu Liu , Ying-cong Chen

Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process

Diffusion models have demonstrated their effectiveness across various generative tasks. However, when applied to medical image segmentation, these models encounter several challenges, including significant resource and time requirements.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Tianyu Lin , Zhiguang Chen , Zhonghao Yan , Weijiang Yu , Fudan Zheng

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

We present SDXL, a latent diffusion model for text-to-image synthesis. Compared to previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone: The increase of model parameters is mainly due to more attention…

Computer Vision and Pattern Recognition · Computer Science 2023-07-06 Dustin Podell , Zion English , Kyle Lacey , Andreas Blattmann , Tim Dockhorn , Jonas Müller , Joe Penna , Robin Rombach

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

In the field of class incremental learning (CIL), generative replay has become increasingly prominent as a method to mitigate the catastrophic forgetting, alongside the continuous improvements in generative models. However, its application…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Junsu Kim , Hoseong Cho , Jihyeon Kim , Yihalem Yimolal Tiruneh , Seungryul Baek

Image Forgery Localization via Guided Noise and Multi-Scale Feature Aggregation

Image Forgery Localization (IFL) technology aims to detect and locate the forged areas in an image, which is very important in the field of digital forensics. However, existing IFL methods suffer from feature degradation during training…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Yakun Niu , Pei Chen , Lei Zhang , Lei Tan , Yingjian Chen