Related papers: StableNormal: Reducing Diffusion Variance for Stab…

TransNormal: Dense Visual Semantics for Diffusion-based Transparent Object Normal Estimation

Monocular normal estimation for transparent objects is critical for laboratory automation, yet it remains challenging due to complex light refraction and reflection. These optical properties often lead to catastrophic failures in…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Mingwei Li , Hehe Fan , Yi Yang

StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation

Recovering material information from images has been extensively studied in computer graphics and vision. Recent works in material estimation leverage diffusion model showing promising results. However, these diffusion-based methods adopt a…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Xiuchao Wu , Pengfei Zhu , Jiangjing Lyu , Xinguo Liu , Jie Guo , Yanwen Guo , Weiwei Xu , Chengfei Lyu

Deblurring via Stochastic Refinement

Image deblurring is an ill-posed problem with multiple plausible solutions for a given input image. However, most existing methods produce a deterministic estimate of the clean image and are trained to minimize pixel-level distortion. These…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 Jay Whang , Mauricio Delbracio , Hossein Talebi , Chitwan Saharia , Alexandros G. Dimakis , Peyman Milanfar

Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation

Monocular depth estimation is a crucial task in computer vision. While existing methods have shown impressive results under standard conditions, they often face challenges in reliably performing in scenarios such as low-light or rainy…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Yifan Mao , Jian Liu , Xianming Liu

StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation

We present StableMotion, a novel framework leverages knowledge (geometry and content priors) from pretrained large-scale image diffusion models to perform motion estimation, solving single-image-based image rectification tasks such as…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Ziyi Wang , Haipeng Li , Lin Sui , Tianhao Zhou , Hai Jiang , Lang Nie , Shuaicheng Liu

Stabilizing Diffusion Posterior Sampling by Noise--Frequency Continuation

Diffusion posterior sampling solves inverse problems by combining a pretrained diffusion prior with measurement-consistency guidance, but it often fails to recover fine details because measurement terms are applied in a manner that is…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Feng Tian , Yixuan Li , Weili Zeng , Weitian Zhang , Yichao Yan , Xiaokang Yang

DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency

Diffusion models have established new state of the art in a multitude of computer vision tasks, including image restoration. Diffusion-based inverse problem solvers generate reconstructions of exceptional visual quality from heavily…

Image and Video Processing · Electrical Eng. & Systems 2024-08-21 Zalan Fabian , Berk Tinaz , Mahdi Soltanolkotabi

StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning

We introduce StableMaterials, a novel approach for generating photorealistic physical-based rendering (PBR) materials that integrate semi-supervised learning with Latent Diffusion Models (LDMs). Our method employs adversarial training to…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Giuseppe Vecchio

Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection

Recent advances in diffusion models have spurred research into their application for Reconstruction-based unsupervised anomaly detection. However, these methods may struggle with maintaining structural integrity and recovering the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-27 Farzad Beizaee , Gregory A. Lodygensky , Christian Desrosiers , Jose Dolz

Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction

In a great number of tasks in science and engineering, the goal is to infer an unknown image from a small number of measurements collected from a known forward model describing certain sensing or imaging modality. Due to resource…

Image and Video Processing · Electrical Eng. & Systems 2024-06-13 Xingyu Xu , Yuejie Chi

PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

This work addresses the task of zero-shot monocular depth estimation. A recent advance in this field has been the idea of utilising Text-to-Image foundation models, such as Stable Diffusion. Foundation models provide a rich and generic…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Denis Zavadski , Damjan Kalšan , Carsten Rother

Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion

3D data simulation aims to bridge the gap between simulated and real-captured 3D data, which is a fundamental problem for real-world 3D visual tasks. Most 3D data simulation methods inject predefined physical priors but struggle to capture…

Computer Vision and Pattern Recognition · Computer Science 2025-08-01 Mutian Xu , Chongjie Ye , Haolin Liu , Yushuang Wu , Jiahao Chang , Xiaoguang Han

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models

Recently, diffusion models have made remarkable progress in text-to-image (T2I) generation, synthesizing images with high fidelity and diverse contents. Despite this advancement, latent space smoothness within diffusion models remains…

Computer Vision and Pattern Recognition · Computer Science 2023-12-08 Jiayi Guo , Xingqian Xu , Yifan Pu , Zanlin Ni , Chaofei Wang , Manushree Vasu , Shiji Song , Gao Huang , Humphrey Shi

Stable Backward Diffusion Models that Minimise Convex Energies

The inverse problem of backward diffusion is known to be ill-posed and highly unstable. Backward diffusion processes appear naturally in image enhancement and deblurring applications. It is therefore greatly desirable to establish a…

Numerical Analysis · Mathematics 2020-06-18 Leif Bergerhoff , Marcelo Cárdenas , Joachim Weickert , Martin Welk

Diffusion Posterior Proximal Sampling for Image Restoration

Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements…

Image and Video Processing · Electrical Eng. & Systems 2024-08-07 Hongjie Wu , Linchao He , Mingqin Zhang , Dongdong Chen , Kunming Luo , Mengting Luo , Ji-Zhe Zhou , Hu Chen , Jiancheng Lv

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Recent work showed that large diffusion models can be reused as highly precise monocular depth estimators by casting depth estimation as an image-conditional image generation task. While the proposed model achieved state-of-the-art results,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Gonzalo Martin Garcia , Karim Knaebel , Christian Schmidt , Daan de Geus , Alexander Hermans , Bastian Leibe

Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners

Diffusion models, such as Stable Diffusion, have shown incredible performance on text-to-image generation. Since text-to-image generation often requires models to generate visual concepts with fine-grained details and attributes specified…

Computer Vision and Pattern Recognition · Computer Science 2024-04-26 Xuehai He , Weixi Feng , Tsu-Jui Fu , Varun Jampani , Arjun Akula , Pradyumna Narayana , Sugato Basu , William Yang Wang , Xin Eric Wang

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Diffusion models have recently emerged as powerful generative priors for solving inverse problems. However, training diffusion models in the pixel space are both data-intensive and computationally demanding, which restricts their…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Bowen Song , Soo Min Kwon , Zecheng Zhang , Xinyu Hu , Qing Qu , Liyue Shen

Preserving Image Properties Through Initializations in Diffusion Models

Retail photography imposes specific requirements on images. For instance, images may need uniform background colors, consistent model poses, centered products, and consistent lighting. Minor deviations from these standards impact a site's…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Jeffrey Zhang , Shao-Yu Chang , Kedan Li , David Forsyth

Common Diffusion Noise Schedules and Sample Steps are Flawed

We discover that common diffusion noise schedules do not enforce the last timestep to have zero signal-to-noise ratio (SNR), and some implementations of diffusion samplers do not start from the last timestep. Such designs are flawed and do…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Shanchuan Lin , Bingchen Liu , Jiashi Li , Xiao Yang