Related papers: Diffusion-based Visual Anagram as Multi-task Learn…

Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models

We address the problem of synthesizing multi-view optical illusions: images that change appearance upon a transformation, such as a flip or rotation. We propose a simple, zero-shot method for obtaining these illusions from off-the-shelf…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Daniel Geng , Inbum Park , Andrew Owens

A Variational Perspective on Solving Inverse Problems with Diffusion Models

Diffusion models have emerged as a key pillar of foundation models in visual domains. One of their critical applications is to universally solve different downstream inverse tasks via a single diffusion prior without re-training for each…

Machine Learning · Computer Science 2023-10-03 Morteza Mardani , Jiaming Song , Jan Kautz , Arash Vahdat

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

Novel-view synthesis through diffusion models has demonstrated remarkable potential for generating diverse and high-quality images. Yet, the independent process of image generation in these prevailing methods leads to challenges in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Xianghui Yang , Yan Zuo , Sameera Ramasinghe , Loris Bazzani , Gil Avraham , Anton van den Hengel

Diffusion Models in Vision: A Survey

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Mubarak Shah

Explaining generative diffusion models via visual analysis for interpretable decision-making process

Diffusion models have demonstrated remarkable performance in generation tasks. Nevertheless, explaining the diffusion process remains challenging due to it being a sequence of denoising noisy images that are difficult for experts to…

Computer Vision and Pattern Recognition · Computer Science 2024-02-19 Ji-Hoon Park , Yeong-Joon Ju , Seong-Whan Lee

The Art of Deception: Color Visual Illusions and Diffusion Models

Visual illusions in humans arise when interpreting out-of-distribution stimuli: if the observer is adapted to certain statistics, perception of outliers deviates from reality. Recent studies have shown that artificial neural networks (ANNs)…

Computer Vision and Pattern Recognition · Computer Science 2024-12-16 Alex Gomez-Villa , Kai Wang , Alejandro C. Parraga , Bartlomiej Twardowski , Jesus Malo , Javier Vazquez-Corral , Joost van de Weijer

VMDiff: Visual Mixing Diffusion for Limitless Cross-Object Synthesis

Creating novel images by fusing visual cues from multiple sources is a fundamental yet underexplored problem in image-to-image generation, with broad applications in artistic creation, virtual reality and visual media. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Zeren Xiong , Yue Yu , Zedong Zhang , Shuo Chen , Jian Yang , Jun Li

Diffusion Model for Generative Image Denoising

In supervised learning for image denoising, usually the paired clean images and noisy images are collected or synthesised to train a denoising model. L2 norm loss or other distance functions are used as the objective function for training.…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Yutong Xie , Minne Yuan , Bin Dong , Quanzheng Li

Visual Bridge: Universal Visual Perception Representations Generating

Recent advances in diffusion models have achieved remarkable success in isolated computer vision tasks such as text-to-image generation, depth estimation, and optical flow. However, these models are often restricted by a…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Yilin Gao , Shuguang Dou , Junzhou Li , Zhiheng Yu , Yin Li , Dongsheng Jiang , Shugong Xu

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

With the success of image generation, generative diffusion models are increasingly adopted for discriminative tasks, as pixel generation provides a unified perception interface. However, directly repurposing the generative denoising process…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Ziqi Pang , Xin Xu , Yu-Xiong Wang

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by gradually adding noise to data points and learns the reverse denoising process to generate new samples, has been shown to handle complex data…

Computer Vision and Pattern Recognition · Computer Science 2023-10-16 Zhengxiong Luo , Dayou Chen , Yingya Zhang , Yan Huang , Liang Wang , Yujun Shen , Deli Zhao , Jingren Zhou , Tieniu Tan

Image Inversion: A Survey from GANs to Diffusion and Beyond

Image inversion is a fundamental task in generative models, aiming to map images back to their latent representations to enable downstream applications such as editing, restoration, and style transfer. This paper provides a comprehensive…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Yinan Chen , Jiangning Zhang , Yali Bi , Xiaobin Hu , Teng Hu , Zhucun Xue , Ran Yi , Yong Liu , Ying Tai

Do text-free diffusion models learn discriminative visual representations?

While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which addresses both families of tasks simultaneously. We…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Soumik Mukhopadhyay , Matthew Gwilliam , Yosuke Yamaguchi , Vatsal Agarwal , Namitha Padmanabhan , Archana Swaminathan , Tianyi Zhou , Jun Ohya , Abhinav Shrivastava

Degradation-Robust Fusion: An Efficient Degradation-Aware Diffusion Framework for Multimodal Image Fusion in Arbitrary Degradation Scenarios

Complex degradations like noise, blur, and low resolution are typical challenges in real world image fusion tasks, limiting the performance and practicality of existing methods. End to end neural network based approaches are generally…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yu Shi , Yu Liu , Zhong-Cheng Wu , Juan Cheng , Huafeng Li , Xun Chen

Diffusion idea exploration for art generation

Cross-Modal learning tasks have picked up pace in recent times. With plethora of applications in diverse areas, generation of novel content using multiple modalities of data has remained a challenging problem. To address the same, various…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Nikhil Verma

Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision

Denoising diffusion models are a powerful type of generative models used to capture complex distributions of real-world signals. However, their applicability is limited to scenarios where training samples are readily available, which is not…

Computer Vision and Pattern Recognition · Computer Science 2023-11-20 Ayush Tewari , Tianwei Yin , George Cazenavette , Semon Rezchikov , Joshua B. Tenenbaum , Frédo Durand , William T. Freeman , Vincent Sitzmann

OmniPrism: Learning Disentangled Visual Concept for Image Generation

Creative visual concept generation often draws inspiration from specific concepts in a reference image to produce relevant outcomes. However, existing methods are typically constrained to single-aspect concept generation or are easily…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yangyang Li , Daqing Liu , Wu Liu , Allen He , Xinchen Liu , Yongdong Zhang , Guoqing Jin

Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later

How do diffusion generative models convert pure noise into meaningful images? In a variety of pretrained diffusion models (including conditional latent space models like Stable Diffusion), we observe that the reverse diffusion process that…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Binxu Wang , John J. Vastola

Modelos Generativos basados en Mecanismos de Difusi\'on

Diffusion-based generative models are a design framework that allows generating new images from processes analogous to those found in non-equilibrium thermodynamics. These models model the reversal of a physical diffusion process in which…

Artificial Intelligence · Computer Science 2023-02-21 Jordi de la Torre

Denoising Diffusion Bridge Models

Diffusion models are powerful generative models that map noise to data using stochastic processes. However, for many applications such as image editing, the model input comes from a distribution that is not random noise. As such, diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Linqi Zhou , Aaron Lou , Samar Khanna , Stefano Ermon