Related papers: Monocular Depth Estimation using Diffusion Models

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

Monocular depth estimation is a challenging task that predicts the pixel-wise depth from a single 2D image. Current methods typically model this problem as a regression or classification task. We propose DiffusionDepth, a new approach that…

Computer Vision and Pattern Recognition · Computer Science 2023-08-30 Yiqun Duan , Xianda Guo , Zheng Zhu

MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Over the past few years, self-supervised monocular depth estimation that does not depend on ground-truth during the training phase has received widespread attention. Most efforts focus on designing different types of network architectures…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Shuwei Shao , Zhongcai Pei , Weihai Chen , Dingchi Sun , Peter C. Y. Chen , Zhengguo Li

Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion

Unsupervised monocular depth estimation has received widespread attention because of its capability to train without ground truth. In real-world scenarios, the images may be blurry or noisy due to the influence of weather conditions and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Runze Liu , Dongchen Zhu , Guanghui Zhang , Yue Xu , Wenjun Shi , Xiaolin Zhang , Lei Wang , Jiamao Li

The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation

Denoising diffusion probabilistic models have transformed image generation with their impressive fidelity and diversity. We show that they also excel in estimating optical flow and monocular depth, surprisingly, without task-specific…

Computer Vision and Pattern Recognition · Computer Science 2023-12-07 Saurabh Saxena , Charles Herrmann , Junhwa Hur , Abhishek Kar , Mohammad Norouzi , Deqing Sun , David J. Fleet

Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions

We present a novel approach designed to address the complexities posed by challenging, out-of-distribution data in the single-image depth estimation task. Starting with images that facilitate depth prediction due to the absence of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Fabio Tosi , Pierluigi Zama Ramirez , Matteo Poggi

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth from a single image is geometrically ill-posed and requires scene understanding, so it is not surprising that the rise of deep learning has led to a…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Bingxin Ke , Anton Obukhov , Shengyu Huang , Nando Metzger , Rodrigo Caye Daudt , Konrad Schindler

Error Diagnosis of Deep Monocular Depth Estimation Models

Estimating depth from a monocular image is an ill-posed problem: when the camera projects a 3D scene onto a 2D plane, depth information is inherently and permanently lost. Nevertheless, recent work has shown impressive results in estimating…

Computer Vision and Pattern Recognition · Computer Science 2021-12-13 Jagpreet Chawla , Nikhil Thakurdesai , Anuj Godase , Md Reza , David Crandall , Soon-Heung Jung

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

While methods for monocular depth estimation have made significant strides on standard benchmarks, zero-shot metric depth estimation remains unsolved. Challenges include the joint modeling of indoor and outdoor scenes, which often exhibit…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Saurabh Saxena , Junhwa Hur , Charles Herrmann , Deqing Sun , David J. Fleet

Leveraging Stable Diffusion for Monocular Depth Estimation via Image Semantic Encoding

Monocular depth estimation involves predicting depth from a single RGB image and plays a crucial role in applications such as autonomous driving, robotic navigation, 3D reconstruction, etc. Recent advancements in learning-based methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-02-05 Jingming Xia , Guanqun Cao , Guang Ma , Yiben Luo , Qinzhao Li , John Oyekan

PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

This work addresses the task of zero-shot monocular depth estimation. A recent advance in this field has been the idea of utilising Text-to-Image foundation models, such as Stable Diffusion. Foundation models provide a rich and generic…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Denis Zavadski , Damjan Kalšan , Carsten Rother

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

The success of monocular depth estimation relies on large and diverse training sets. Due to the challenges associated with acquiring dense ground-truth depth across different environments at scale, a number of datasets with distinct…

Computer Vision and Pattern Recognition · Computer Science 2020-08-26 René Ranftl , Katrin Lasinger , David Hafner , Konrad Schindler , Vladlen Koltun

Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation

Monocular depth estimation, enabled by self-supervised learning, is a key technique for 3D perception in computer vision. However, it faces significant challenges in real-world scenarios, which encompass adverse weather variations, motion…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Runze Chen , Haiyong Luo , Fang Zhao , Jingze Yu , Yupeng Jia , Juan Wang , Xuepeng Ma

Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation

Monocular depth estimation is a crucial task in computer vision. While existing methods have shown impressive results under standard conditions, they often face challenges in reliably performing in scenarios such as low-light or rainy…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Yifan Mao , Jian Liu , Xianming Liu

Monocular Depth Estimation with Hierarchical Fusion of Dilated CNNs and Soft-Weighted-Sum Inference

Monocular depth estimation is a challenging task in complex compositions depicting multiple objects of diverse scales. Albeit the recent great progress thanks to the deep convolutional neural networks (CNNs), the state-of-the-art monocular…

Computer Vision and Pattern Recognition · Computer Science 2017-08-09 Bo Li , Yuchao Dai , Mingyi He

DepthMaster: Taming Diffusion Models for Monocular Depth Estimation

Monocular depth estimation within the diffusion-denoising paradigm demonstrates impressive generalization ability but suffers from low inference speed. Recent methods adopt a single-step deterministic paradigm to improve inference…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Ziyang Song , Zerong Wang , Bo Li , Hao Zhang , Ruijie Zhu , Li Liu , Peng-Tao Jiang , Tianzhu Zhang

Deep Optics for Monocular Depth Estimation and 3D Object Detection

Depth estimation and 3D object detection are critical for scene understanding but remain challenging to perform with a single image due to the loss of 3D information during image capture. Recent models using deep neural networks have…

Computer Vision and Pattern Recognition · Computer Science 2019-04-19 Julie Chang , Gordon Wetzstein

EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation

Due to their text-to-image synthesis feature, diffusion models have recently seen a rise in visual perception tasks, such as depth estimation. The lack of good-quality datasets makes the extraction of a fine-grain semantic context…

Computer Vision and Pattern Recognition · Computer Science 2024-10-04 Nischal Khanal , Shivanand Venkanna Sheshappanavar

Monocular Depth Estimation From the Perspective of Feature Restoration: A Diffusion Enhanced Depth Restoration Approach

Monocular Depth Estimation (MDE) is a fundamental computer vision task with important applications in 3D vision. The current mainstream MDE methods employ an encoder-decoder architecture with multi-level/scale feature processing. However,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Huibin Bai , Shuai Li , Hanxiao Zhai , Yanbo Gao , Chong Lv , Yibo Wang , Haipeng Ping , Wei Hua , Xingyu Gao

NDDepth: Normal-Distance Assisted Monocular Depth Estimation

Monocular depth estimation has drawn widespread attention from the vision community due to its broad applications. In this paper, we propose a novel physics (geometry)-driven deep learning framework for monocular depth estimation by…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Shuwei Shao , Zhongcai Pei , Weihai Chen , Xingming Wu , Zhengguo Li

Monocular Depth Prediction through Continuous 3D Loss

This paper reports a new continuous 3D loss function for learning depth from monocular images. The dense depth prediction from a monocular image is supervised using sparse LIDAR points, which enables us to leverage available open source…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Minghan Zhu , Maani Ghaffari , Yuanxin Zhong , Pingping Lu , Zhong Cao , Ryan M. Eustice , Huei Peng