English
Related papers

Related papers: StableNormal: Reducing Diffusion Variance for Stab…

200 papers

Monocular normal estimation for transparent objects is critical for laboratory automation, yet it remains challenging due to complex light refraction and reflection. These optical properties often lead to catastrophic failures in…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Mingwei Li , Hehe Fan , Yi Yang

Recovering material information from images has been extensively studied in computer graphics and vision. Recent works in material estimation leverage diffusion model showing promising results. However, these diffusion-based methods adopt a…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Xiuchao Wu , Pengfei Zhu , Jiangjing Lyu , Xinguo Liu , Jie Guo , Yanwen Guo , Weiwei Xu , Chengfei Lyu

Image deblurring is an ill-posed problem with multiple plausible solutions for a given input image. However, most existing methods produce a deterministic estimate of the clean image and are trained to minimize pixel-level distortion. These…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 Jay Whang , Mauricio Delbracio , Hossein Talebi , Chitwan Saharia , Alexandros G. Dimakis , Peyman Milanfar

Monocular depth estimation is a crucial task in computer vision. While existing methods have shown impressive results under standard conditions, they often face challenges in reliably performing in scenarios such as low-light or rainy…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Yifan Mao , Jian Liu , Xianming Liu

We present StableMotion, a novel framework leverages knowledge (geometry and content priors) from pretrained large-scale image diffusion models to perform motion estimation, solving single-image-based image rectification tasks such as…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Ziyi Wang , Haipeng Li , Lin Sui , Tianhao Zhou , Hai Jiang , Lang Nie , Shuaicheng Liu

Diffusion posterior sampling solves inverse problems by combining a pretrained diffusion prior with measurement-consistency guidance, but it often fails to recover fine details because measurement terms are applied in a manner that is…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Feng Tian , Yixuan Li , Weili Zeng , Weitian Zhang , Yichao Yan , Xiaokang Yang

Diffusion models have established new state of the art in a multitude of computer vision tasks, including image restoration. Diffusion-based inverse problem solvers generate reconstructions of exceptional visual quality from heavily…

Image and Video Processing · Electrical Eng. & Systems 2024-08-21 Zalan Fabian , Berk Tinaz , Mahdi Soltanolkotabi

We introduce StableMaterials, a novel approach for generating photorealistic physical-based rendering (PBR) materials that integrate semi-supervised learning with Latent Diffusion Models (LDMs). Our method employs adversarial training to…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Giuseppe Vecchio

Recent advances in diffusion models have spurred research into their application for Reconstruction-based unsupervised anomaly detection. However, these methods may struggle with maintaining structural integrity and recovering the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-27 Farzad Beizaee , Gregory A. Lodygensky , Christian Desrosiers , Jose Dolz

In a great number of tasks in science and engineering, the goal is to infer an unknown image from a small number of measurements collected from a known forward model describing certain sensing or imaging modality. Due to resource…

Image and Video Processing · Electrical Eng. & Systems 2024-06-13 Xingyu Xu , Yuejie Chi

This work addresses the task of zero-shot monocular depth estimation. A recent advance in this field has been the idea of utilising Text-to-Image foundation models, such as Stable Diffusion. Foundation models provide a rich and generic…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Denis Zavadski , Damjan Kalšan , Carsten Rother

3D data simulation aims to bridge the gap between simulated and real-captured 3D data, which is a fundamental problem for real-world 3D visual tasks. Most 3D data simulation methods inject predefined physical priors but struggle to capture…

Computer Vision and Pattern Recognition · Computer Science 2025-08-01 Mutian Xu , Chongjie Ye , Haolin Liu , Yushuang Wu , Jiahao Chang , Xiaoguang Han

Recently, diffusion models have made remarkable progress in text-to-image (T2I) generation, synthesizing images with high fidelity and diverse contents. Despite this advancement, latent space smoothness within diffusion models remains…

Computer Vision and Pattern Recognition · Computer Science 2023-12-08 Jiayi Guo , Xingqian Xu , Yifan Pu , Zanlin Ni , Chaofei Wang , Manushree Vasu , Shiji Song , Gao Huang , Humphrey Shi

The inverse problem of backward diffusion is known to be ill-posed and highly unstable. Backward diffusion processes appear naturally in image enhancement and deblurring applications. It is therefore greatly desirable to establish a…

Numerical Analysis · Mathematics 2020-06-18 Leif Bergerhoff , Marcelo Cárdenas , Joachim Weickert , Martin Welk

Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements…

Image and Video Processing · Electrical Eng. & Systems 2024-08-07 Hongjie Wu , Linchao He , Mingqin Zhang , Dongdong Chen , Kunming Luo , Mengting Luo , Ji-Zhe Zhou , Hu Chen , Jiancheng Lv

Recent work showed that large diffusion models can be reused as highly precise monocular depth estimators by casting depth estimation as an image-conditional image generation task. While the proposed model achieved state-of-the-art results,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Gonzalo Martin Garcia , Karim Knaebel , Christian Schmidt , Daan de Geus , Alexander Hermans , Bastian Leibe

Diffusion models, such as Stable Diffusion, have shown incredible performance on text-to-image generation. Since text-to-image generation often requires models to generate visual concepts with fine-grained details and attributes specified…

Computer Vision and Pattern Recognition · Computer Science 2024-04-26 Xuehai He , Weixi Feng , Tsu-Jui Fu , Varun Jampani , Arjun Akula , Pradyumna Narayana , Sugato Basu , William Yang Wang , Xin Eric Wang

Diffusion models have recently emerged as powerful generative priors for solving inverse problems. However, training diffusion models in the pixel space are both data-intensive and computationally demanding, which restricts their…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Bowen Song , Soo Min Kwon , Zecheng Zhang , Xinyu Hu , Qing Qu , Liyue Shen

Retail photography imposes specific requirements on images. For instance, images may need uniform background colors, consistent model poses, centered products, and consistent lighting. Minor deviations from these standards impact a site's…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Jeffrey Zhang , Shao-Yu Chang , Kedan Li , David Forsyth

We discover that common diffusion noise schedules do not enforce the last timestep to have zero signal-to-noise ratio (SNR), and some implementations of diffusion samplers do not start from the last timestep. Such designs are flawed and do…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Shanchuan Lin , Bingchen Liu , Jiashi Li , Xiao Yang
‹ Prev 1 2 3 10 Next ›