Related papers: Repurposing Diffusion-Based Image Generators for M…

MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth Estimation

Recovering metric depth from a single image remains a fundamental challenge in computer vision, requiring both scene understanding and accurate scaling. While deep learning has advanced monocular depth estimation, current models often…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Ansh Shah , K Madhava Krishna

Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion

Depth completion upgrades sparse depth measurements into dense depth maps guided by a conventional image. Existing methods for this highly ill-posed task operate in tightly constrained settings and tend to struggle when applied to images…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Massimiliano Viola , Kevin Qu , Nando Metzger , Bingxin Ke , Alexander Becker , Konrad Schindler , Anton Obukhov

PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

This work addresses the task of zero-shot monocular depth estimation. A recent advance in this field has been the idea of utilising Text-to-Image foundation models, such as Stable Diffusion. Foundation models provide a rich and generic…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Denis Zavadski , Damjan Kalšan , Carsten Rother

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

Monocular depth estimation is a challenging task that predicts the pixel-wise depth from a single 2D image. Current methods typically model this problem as a regression or classification task. We propose DiffusionDepth, a new approach that…

Computer Vision and Pattern Recognition · Computer Science 2023-08-30 Yiqun Duan , Xianda Guo , Zheng Zhu

Monocular Depth Estimation using Diffusion Models

We formulate monocular depth estimation using denoising diffusion models, inspired by their recent successes in high fidelity image generation. To that end, we introduce innovations to address problems arising due to noisy, incomplete depth…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Saurabh Saxena , Abhishek Kar , Mohammad Norouzi , David J. Fleet

Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues

Recent monocular metric depth estimation (MMDE) methods have made notable progress towards zero-shot generalization. However, they still exhibit a significant performance drop on out-of-distribution datasets. We address this limitation by…

Computer Vision and Pattern Recognition · Computer Science 2025-05-26 Chinmay Talegaonkar , Nikhil Gandudi Suresh , Zachary Novack , Yash Belhe , Priyanka Nagasamudra , Nicholas Antipa

Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion

Unsupervised monocular depth estimation has received widespread attention because of its capability to train without ground truth. In real-world scenarios, the images may be blurry or noisy due to the influence of weather conditions and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Runze Liu , Dongchen Zhu , Guanghui Zhang , Yue Xu , Wenjun Shi , Xiaolin Zhang , Lei Wang , Jiamao Li

Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis

The success of deep learning in computer vision over the past decade has hinged on large labeled datasets and strong pretrained models. In data-scarce settings, the quality of these pretrained models becomes crucial for effective transfer…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Bingxin Ke , Kevin Qu , Tianfu Wang , Nando Metzger , Shengyu Huang , Bo Li , Anton Obukhov , Konrad Schindler

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

In this paper, we present DM-Calib, a diffusion-based approach for estimating pinhole camera intrinsic parameters from a single input image. Monocular camera calibration is essential for many 3D vision tasks. However, most existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Junyuan Deng , Wei Yin , Xiaoyang Guo , Qian Zhang , Xiaotao Hu , Weiqiang Ren , Xiao-Xiao Long , Ping Tan

Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding

Monocular depth estimation involves predicting depth from a single RGB image and plays a crucial role in applications such as autonomous driving, robotic navigation, 3D reconstruction, etc. Recent advancements in learning-based methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-02-05 Jingming Xia , Guanqun Cao , Guang Ma , Yiben Luo , Qinzhao Li , John Oyekan

Monocular Depth Estimation From the Perspective of Feature Restoration: A Diffusion Enhanced Depth Restoration Approach

Monocular Depth Estimation (MDE) is a fundamental computer vision task with important applications in 3D vision. The current mainstream MDE methods employ an encoder-decoder architecture with multi-level/scale feature processing. However,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Huibin Bai , Shuai Li , Hanxiao Zhai , Yanbo Gao , Chong Lv , Yibo Wang , Haipeng Ping , Wei Hua , Xingyu Gao

MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Over the past few years, self-supervised monocular depth estimation that does not depend on ground-truth during the training phase has received widespread attention. Most efforts focus on designing different types of network architectures…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Shuwei Shao , Zhongcai Pei , Weihai Chen , Dingchi Sun , Peter C. Y. Chen , Zhengguo Li

Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions

We present a novel approach designed to address the complexities posed by challenging, out-of-distribution data in the single-image depth estimation task. Starting with images that facilitate depth prediction due to the absence of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Fabio Tosi , Pierluigi Zama Ramirez , Matteo Poggi

Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Recent work showed that large diffusion models can be reused as highly precise monocular depth estimators by casting depth estimation as an image-conditional image generation task. While the proposed model achieved state-of-the-art results,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Gonzalo Martin Garcia , Karim Knaebel , Christian Schmidt , Daan de Geus , Alexander Hermans , Bastian Leibe

SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation

We propose SharpDepth, a novel approach to monocular metric depth estimation that combines the metric accuracy of discriminative depth estimation methods (e.g., Metric3D, UniDepth) with the fine-grained boundary sharpness typically achieved…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Duc-Hai Pham , Tung Do , Phong Nguyen , Binh-Son Hua , Khoi Nguyen , Rang Nguyen

Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering

Accurate depth estimation is at the core of many applications in computer graphics, vision, and robotics. Current state-of-the-art monocular depth estimators, trained on extensive datasets, generalize well but lack 3D consistency needed for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Laura Fink , Linus Franke , Bernhard Egger , Joachim Keinert , Marc Stamminger

Monocular Depth Estimation: A Survey

Monocular depth estimation is often described as an ill-posed and inherently ambiguous problem. Estimating depth from 2D images is a crucial step in scene reconstruction, 3Dobject recognition, segmentation, and detection. The problem can be…

Computer Vision and Pattern Recognition · Computer Science 2019-01-29 Amlaan Bhoi

Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation

Monocular depth estimation is a crucial task in computer vision. While existing methods have shown impressive results under standard conditions, they often face challenges in reliably performing in scenarios such as low-light or rainy…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Yifan Mao , Jian Liu , Xianming Liu

Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation

Monocular depth estimation, enabled by self-supervised learning, is a key technique for 3D perception in computer vision. However, it faces significant challenges in real-world scenarios, which encompass adverse weather variations, motion…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Runze Chen , Haiyong Luo , Fang Zhao , Jingze Yu , Yupeng Jia , Juan Wang , Xuepeng Ma

DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation

Monocular camera calibration is a key precondition for numerous 3D vision applications. Despite considerable advancements, existing methods often hinge on specific assumptions and struggle to generalize across varied real-world scenarios,…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Xiankang He , Guangkai Xu , Bo Zhang , Hao Chen , Ying Cui , Dongyan Guo