Related papers: Visual Diffusion Models are Geometric Solvers

Image Neural Field Diffusion Models

Diffusion models have shown an impressive ability to model complex data distributions, with several key advantages over GANs, such as stable training, better coverage of the training distribution's modes, and the ability to solve inverse…

Computer Vision and Pattern Recognition · Computer Science 2024-06-12 Yinbo Chen , Oliver Wang , Richard Zhang , Eli Shechtman , Xiaolong Wang , Michael Gharbi

Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models

Diffusion models have emerged as the new state-of-the-art generative model with high quality samples, with intriguing properties such as mode coverage and high flexibility. They have also been shown to be effective inverse problem solvers,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-06 Hyungjin Chung , Dohoon Ryu , Michael T. McCann , Marc L. Klasky , Jong Chul Ye

Computing Diffusion Geometry

Calculus and geometry are ubiquitous in the theoretical modelling of scientific phenomena, but have historically been very challenging to apply directly to real data as statistics. Diffusion geometry is a new theory that reformulates…

Differential Geometry · Mathematics 2026-02-09 Iolo Jones , David Lanners

Toward a Diffusion-Based Generalist for Dense Vision Tasks

Building generalized models that can solve many computer vision tasks simultaneously is an intriguing direction. Recent works have shown image itself can be used as a natural interface for general-purpose visual perception and demonstrated…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Yue Fan , Yongqin Xian , Xiaohua Zhai , Alexander Kolesnikov , Muhammad Ferjad Naeem , Bernt Schiele , Federico Tombari

Diffusion Models in Vision: A Survey

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Florinel-Alin Croitoru , Vlad Hondru , Radu Tudor Ionescu , Mubarak Shah

G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving

Recent literature has effectively leveraged diffusion models trained on continuous variables as priors for solving inverse problems. Notably, discrete diffusion models with discrete latent codes have shown strong performance, particularly…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Naoki Murata , Chieh-Hsin Lai , Yuhta Takida , Toshimitsu Uesaka , Bac Nguyen , Stefano Ermon , Yuki Mitsufuji

Diffusion Posterior Sampling for General Noisy Inverse Problems

Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear…

Machine Learning · Statistics 2025-10-06 Hyungjin Chung , Jeongsol Kim , Michael T. Mccann , Marc L. Klasky , Jong Chul Ye

Diffusion Models in 3D Vision: A Survey

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang

Diffusion Models Beat GANs on Image Classification

While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which uses a single pre-training stage to address both…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Soumik Mukhopadhyay , Matthew Gwilliam , Vatsal Agarwal , Namitha Padmanabhan , Archana Swaminathan , Srinidhi Hegde , Tianyi Zhou , Abhinav Shrivastava

Geometry Distributions

Neural representations of 3D data have been widely adopted across various applications, particularly in recent work leveraging coordinate-based networks to model scalar or vector fields. However, these approaches face inherent challenges,…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Biao Zhang , Jing Ren , Peter Wonka

Visual Bridge: Universal Visual Perception Representations Generating

Recent advances in diffusion models have achieved remarkable success in isolated computer vision tasks such as text-to-image generation, depth estimation, and optical flow. However, these models are often restricted by a…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Yilin Gao , Shuguang Dou , Junzhou Li , Zhiheng Yu , Yin Li , Dongsheng Jiang , Shugong Xu

Epipolar Geometry Improves Video Generation Models

Video generation models have progressed tremendously through large latent diffusion transformers trained with rectified flow techniques. Yet these models still struggle with geometric inconsistencies, unstable motion, and visual artifacts…

Computer Vision and Pattern Recognition · Computer Science 2025-10-27 Orest Kupyn , Fabian Manhardt , Federico Tombari , Christian Rupprecht

Diffusion Models in Low-Level Vision: A Survey

Deep generative models have garnered significant attention in low-level vision tasks due to their generative capabilities. Among them, diffusion model-based solutions, characterized by a forward diffusion process and a reverse denoising…

Computer Vision and Pattern Recognition · Computer Science 2025-02-26 Chunming He , Yuqi Shen , Chengyu Fang , Fengyang Xiao , Longxiang Tang , Yulun Zhang , Wangmeng Zuo , Zhenhua Guo , Xiu Li

DiffSG: A Generative Solver for Network Optimization with Diffusion Model

Generative diffusion models, famous for their performance in image generation, are popular in various cross-domain applications. However, their use in the communication community has been mostly limited to auxiliary tasks like data modeling…

Networking and Internet Architecture · Computer Science 2025-03-11 Ruihuai Liang , Bo Yang , Zhiwen Yu , Bin Guo , Xuelin Cao , Mérouane Debbah , H. Vincent Poor , Chau Yuen

DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models

Understanding and modeling lighting effects are fundamental tasks in computer vision and graphics. Classic physically-based rendering (PBR) accurately simulates the light transport, but relies on precise scene representations--explicit 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Ruofan Liang , Zan Gojcic , Huan Ling , Jacob Munkberg , Jon Hasselgren , Zhi-Hao Lin , Jun Gao , Alexander Keller , Nandita Vijaykumar , Sanja Fidler , Zian Wang

Pyramidal Denoising Diffusion Probabilistic Models

Recently, diffusion model have demonstrated impressive image generation performances, and have been extensively studied in various computer vision tasks. Unfortunately, training and evaluating diffusion models consume a lot of time and…

Computer Vision and Pattern Recognition · Computer Science 2022-10-03 Dohoon Ryu , Jong Chul Ye

From Image to Video: An Empirical Study of Diffusion Representations

Diffusion models have revolutionized generative modeling, enabling unprecedented realism in image and video synthesis. This success has sparked interest in leveraging their representations for visual understanding tasks. While recent works…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Pedro Vélez , Luisa F. Polanía , Yi Yang , Chuhan Zhang , Rishabh Kabra , Anurag Arnab , Mehdi S. M. Sajjadi

Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models

Using image models naively for solving inverse video problems often suffers from flickering, texture-sticking, and temporal inconsistency in generated videos. To tackle these problems, in this paper, we view frames as continuous functions…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Giannis Daras , Weili Nie , Karsten Kreis , Alex Dimakis , Morteza Mardani , Nikola Borislavov Kovachki , Arash Vahdat

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Gang Li , Heliang Zheng , Chaoyue Wang , Chang Li , Changwen Zheng , Dacheng Tao

Efficient Diffusion Models for Vision: A Survey

Diffusion Models (DMs) have demonstrated state-of-the-art performance in content generation without requiring adversarial training. These models are trained using a two-step process. First, a forward - diffusion - process gradually adds…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Anwaar Ulhaq , Naveed Akhtar