Related papers: CleanDIFT: Diffusion Features without Noise

Diffusion Features to Bridge Domain Gap for Semantic Segmentation

Pre-trained diffusion models have demonstrated remarkable proficiency in synthesizing images across a wide range of scenarios with customizable prompts, indicating their effective capacity to capture universal features. Motivated by this,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Yuxiang Ji , Boyong He , Chenyuan Qu , Zhuoyue Tan , Chuan Qin , Liaoni Wu

A Noise is Worth Diffusion Guidance

Diffusion models excel in generating high-quality images. However, current diffusion models struggle to produce reliable images without guidance methods, such as classifier-free guidance (CFG). Are guidance methods truly necessary?…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Donghoon Ahn , Jiwon Kang , Sanghyun Lee , Jaewon Min , Minjae Kim , Wooseok Jang , Hyoungwon Cho , Sayak Paul , SeonHwa Kim , Eunju Cha , Kyong Hwan Jin , Seungryong Kim

DiffLoss: unleashing diffusion model as constraint for training image restoration network

Image restoration aims to enhance low quality images, producing high quality images that exhibit natural visual characteristics and fine semantic attributes. Recently, the diffusion model has emerged as a powerful technique for image…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Jiangtong Tan , Feng Zhao

Dynamic Addition of Noise in a Diffusion Model for Anomaly Detection

Diffusion models have found valuable applications in anomaly detection by capturing the nominal data distribution and identifying anomalies via reconstruction. Despite their merits, they struggle to localize anomalies of varying scales,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-25 Justin Tebbe , Jawad Tayyub

Diffusion Representations for Fine-Grained Image Classification: A Marine Plankton Case Study

Diffusion models have emerged as state-of-the-art generative methods for image synthesis, yet their potential as general-purpose feature encoders remains underexplored. Trained for denoising and generation without labels, they can be…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 A. Nieto Juscafresa , Á. Mazcuñán Herreros , J. Sullivan

TransFusion: Transcribing Speech with Multinomial Diffusion

Diffusion models have shown exceptional scaling properties in the image synthesis domain, and initial attempts have shown similar benefits for applying diffusion to unconditional text synthesis. Denoising diffusion models attempt to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-17 Matthew Baas , Kevin Eloff , Herman Kamper

Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off

While foundation models demonstrate impressive performance across various tasks, they remain vulnerable to adversarial inputs. Current research explores various approaches to enhance model robustness, with Diffusion Denoised Smoothing…

Machine Learning · Computer Science 2025-05-22 Yury Belousov , Brian Pulfer , Vitaliy Kinakh , Slava Voloshynovskiy

Bring the Power of Diffusion Model to Defect Detection

Due to the high complexity and technical requirements of industrial production processes, surface defects will inevitably appear, which seriously affects the quality of products. Although existing lightweight detection networks are highly…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Xuyi Yu

Cloud Diffusion Part 1: Theory and Motivation

Diffusion models for image generation function by progressively adding noise to an image set and training a model to separate out the signal from the noise. The noise profile used by these models is white noise -- that is, noise based on…

Computer Vision and Pattern Recognition · Computer Science 2025-07-09 Andrew Randono

ExposureDiffusion: Learning to Expose for Low-light Image Enhancement

Previous raw image-based low-light image enhancement methods predominantly relied on feed-forward neural networks to learn deterministic mappings from low-light to normally-exposed images. However, they failed to capture critical…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Yufei Wang , Yi Yu , Wenhan Yang , Lanqing Guo , Lap-Pui Chau , Alex C. Kot , Bihan Wen

Diffusion Model-Based Image Editing: A Survey

Denoising diffusion models have emerged as a powerful tool for various image generation and editing tasks, facilitating the synthesis of visual content in an unconditional or input-conditional manner. The core idea behind them is learning…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Yi Huang , Jiancheng Huang , Yifan Liu , Mingfu Yan , Jiaxi Lv , Jianzhuang Liu , Wei Xiong , He Zhang , Liangliang Cao , Shifeng Chen

Investigating the Design Space of Diffusion Models for Speech Enhancement

Diffusion models are a new class of generative models that have shown outstanding performance in image generation literature. As a consequence, studies have attempted to apply diffusion models to other tasks, such as speech enhancement. A…

Audio and Speech Processing · Electrical Eng. & Systems 2024-10-10 Philippe Gonzalez , Zheng-Hua Tan , Jan Østergaard , Jesper Jensen , Tommy Sonne Alstrøm , Tobias May

Latent Feature-Guided Diffusion Models for Shadow Removal

Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images. In this paper, we propose the use of diffusion models as they offer a promising approach to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Kangfu Mei , Luis Figueroa , Zhe Lin , Zhihong Ding , Scott Cohen , Vishal M. Patel

Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features

We present Diff3F as a simple, robust, and class-agnostic feature descriptor that can be computed for untextured input shapes (meshes or point clouds). Our method distills diffusion features from image foundational models onto input shapes.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Niladri Shekhar Dutt , Sanjeev Muralikrishnan , Niloy J. Mitra

Cold Diffusion for Speech Enhancement

Diffusion models have recently shown promising results for difficult enhancement tasks such as the conditional and unconditional restoration of natural images and audio signals. In this work, we explore the possibility of leveraging a…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-24 Hao Yen , François G. Germain , Gordon Wichern , Jonathan Le Roux

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence

Diffusion models have been shown to be capable of generating high-quality images, suggesting that they could contain meaningful internal representations. Unfortunately, the feature maps that encode a diffusion model's internal information…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Grace Luo , Lisa Dunlap , Dong Huk Park , Aleksander Holynski , Trevor Darrell

Diffusion Posterior Proximal Sampling for Image Restoration

Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements…

Image and Video Processing · Electrical Eng. & Systems 2024-08-07 Hongjie Wu , Linchao He , Mingqin Zhang , Dongdong Chen , Kunming Luo , Mengting Luo , Ji-Zhe Zhou , Hu Chen , Jiancheng Lv

DNF-Intrinsic: Deterministic Noise-Free Diffusion for Indoor Inverse Rendering

Recent methods have shown that pre-trained diffusion models can be fine-tuned to enable generative inverse rendering by learning image-conditioned noise-to-intrinsic mapping. Despite their remarkable progress, they struggle to robustly…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Rongjia Zheng , Qing Zhang , Chengjiang Long , Wei-Shi Zheng

FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process

The emergence of text-to-image generation models has led to the recognition that image enhancement, performed as post-processing, would significantly improve the visual quality of the generated images. Exploring diffusion models to enhance…

Computer Vision and Pattern Recognition · Computer Science 2024-09-12 Yang Luo , Yiheng Zhang , Zhaofan Qiu , Ting Yao , Zhineng Chen , Yu-Gang Jiang , Tao Mei

Spectral Progressive Diffusion for Efficient Image and Video Generation

Diffusion models have been shown to implicitly generate visual content autoregressively in the frequency domain, where low-frequency components are generated earlier in the denoising process while high-frequency details emerge only in later…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Howard Xiao , Brian Chao , Lior Yariv , Gordon Wetzstein