Related papers: Interpretable Diffusion via Information Decomposit…

DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

Text-to-image diffusion models have made significant progress in generating naturalistic images from textual inputs, and demonstrate the capacity to learn and represent complex visual-semantic relationships. While these diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-11-18 Rushikesh Zawar , Shaurya Dewan , Prakanshul Saxena , Yingshan Chang , Andrew Luo , Yonatan Bisk

Diffusion Model-Based Image Editing: A Survey

Denoising diffusion models have emerged as a powerful tool for various image generation and editing tasks, facilitating the synthesis of visual content in an unconditional or input-conditional manner. The core idea behind them is learning…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Yi Huang , Jiancheng Huang , Yifan Liu , Mingfu Yan , Jiaxi Lv , Jianzhuang Liu , Wei Xiong , He Zhang , Liangliang Cao , Shifeng Chen

Information-Theoretic Diffusion

Denoising diffusion models have spurred significant gains in density modeling and image generation, precipitating an industrial revolution in text-guided AI art generation. We introduce a new mathematical foundation for diffusion models…

Machine Learning · Computer Science 2023-02-09 Xianghao Kong , Rob Brekelmans , Greg Ver Steeg

Compositional Image Decomposition with Diffusion Models

Given an image of a natural scene, we are able to quickly decompose it into a set of components such as objects, lighting, shadows, and foreground. We can then envision a scene where we combine certain components with those from other…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Jocelin Su , Nan Liu , Yanbo Wang , Joshua B. Tenenbaum , Yilun Du

Explaining generative diffusion models via visual analysis for interpretable decision-making process

Diffusion models have demonstrated remarkable performance in generation tasks. Nevertheless, explaining the diffusion process remains challenging due to it being a sequence of denoising noisy images that are difficult for experts to…

Computer Vision and Pattern Recognition · Computer Science 2024-02-19 Ji-Hoon Park , Yeong-Joon Ju , Seong-Whan Lee

Image Embedding for Denoising Generative Models

Denoising Diffusion models are gaining increasing popularity in the field of generative modeling for several reasons, including the simple and stable training, the excellent generative quality, and the solid probabilistic foundation. In…

Computer Vision and Pattern Recognition · Computer Science 2023-01-19 Andrea Asperti , Davide Evangelista , Samuele Marro , Fabio Merizzi

Decomposing stimulus-specific sensory neural information via diffusion models

To understand sensory coding, we must ask not only how much information neurons encode, but also what that information is about. This requires decomposing mutual information into contributions from individual stimuli and stimulus features:…

Neurons and Cognition · Quantitative Biology 2025-10-23 Steeve Laquitaine , Simone Azeglio , Carlo Paris , Ulisse Ferrari , Matthew Chalk

Conditional Image Synthesis with Diffusion Models: A Survey

Conditional image synthesis based on user-specified requirements is a key component in creating complex visual content. In recent years, diffusion-based generative modeling has become a highly effective way for conditional image synthesis,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Zheyuan Zhan , Defang Chen , Jian-Ping Mei , Zhenghe Zhao , Jiawei Chen , Chun Chen , Siwei Lyu , Can Wang

MMG: Mutual Information Estimation via the MMSE Gap in Diffusion

Mutual information (MI) is one of the most general ways to measure relationships between random variables, but estimating this quantity for complex systems is challenging. Denoising diffusion models have recently set a new bar for density…

Machine Learning · Computer Science 2025-11-20 Longxuan Yu , Xing Shi , Xianghao Kong , Tong Jia , Greg Ver Steeg

InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models

While diffusion models excel at generating high-quality samples, their latent variables typically lack semantic meaning and are not suitable for representation learning. Here, we propose InfoDiffusion, an algorithm that augments diffusion…

Machine Learning · Computer Science 2023-06-16 Yingheng Wang , Yair Schiff , Aaron Gokaslan , Weishen Pan , Fei Wang , Christopher De Sa , Volodymyr Kuleshov

Denoising Functional Maps: Diffusion Models for Shape Correspondence

Estimating correspondences between pairs of deformable shapes remains a challenging problem. Despite substantial progress, existing methods lack broad generalization capabilities and require category-specific training data. To address these…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Aleksei Zhuravlev , Zorah Lähner , Vladislav Golyanik

Degradation-Robust Fusion: An Efficient Degradation-Aware Diffusion Framework for Multimodal Image Fusion in Arbitrary Degradation Scenarios

Complex degradations like noise, blur, and low resolution are typical challenges in real world image fusion tasks, limiting the performance and practicality of existing methods. End to end neural network based approaches are generally…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yu Shi , Yu Liu , Zhong-Cheng Wu , Juan Cheng , Huafeng Li , Xun Chen

Learning to Compose Visual Relations

The visual world around us can be described as a structured set of objects and their associated relations. An image of a room may be conjured given only the description of the underlying objects and their associated relations. While there…

Computer Vision and Pattern Recognition · Computer Science 2021-11-18 Nan Liu , Shuang Li , Yilun Du , Joshua B. Tenenbaum , Antonio Torralba

TransFusion: Transcribing Speech with Multinomial Diffusion

Diffusion models have shown exceptional scaling properties in the image synthesis domain, and initial attempts have shown similar benefits for applying diffusion to unconditional text synthesis. Denoising diffusion models attempt to…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-17 Matthew Baas , Kevin Eloff , Herman Kamper

Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression

Diffusion models generate high-quality images through progressive denoising but are computationally intensive due to large model sizes and repeated sampling. Knowledge distillation, which transfers knowledge from a complex teacher to a…

Machine Learning · Computer Science 2025-04-04 Dohyun Kim , Sehwan Park , Geonhee Han , Seung Wook Kim , Paul Hongsuck Seo

Diffusion Models in Vision: A Survey

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Mubarak Shah

The Hidden Language of Diffusion Models

Text-to-image diffusion models have demonstrated an unparalleled ability to generate high-quality, diverse images from a textual prompt. However, the internal representations learned by these models remain an enigma. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2023-10-06 Hila Chefer , Oran Lang , Mor Geva , Volodymyr Polosukhin , Assaf Shocher , Michal Irani , Inbar Mosseri , Lior Wolf

Learning to Represent Image and Text with Denotation Graph

Learning to fuse vision and language information and representing them is an important research problem with many applications. Recent progresses have leveraged the ideas of pre-training (from language modeling) and attention layers in…

Computer Vision and Pattern Recognition · Computer Science 2020-10-08 Bowen Zhang , Hexiang Hu , Vihan Jain , Eugene Ie , Fei Sha

Real-World Denoising via Diffusion Model

Real-world image denoising is an extremely important image processing problem, which aims to recover clean images from noisy images captured in natural environments. In recent years, diffusion models have achieved very promising results in…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Cheng Yang , Lijing Liang , Zhixun Su

Interpretable Diffusion Models with B-cos Networks

Text-to-image diffusion models generate images by iteratively denoising random noise, conditioned on a prompt. While these models have enabled impressive progress in image generation, they often fail to accurately reflect all semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Nicola Bernold , Moritz Vandenhirtz , Alice Bizeul , Julia E. Vogt