Related papers: SharpDepth: Sharpening Metric Depth Predictions Us…

UniDepth: Universal Monocular Metric Depth Estimation

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Luigi Piccinelli , Yung-Hsu Yang , Christos Sakaridis , Mattia Segu , Siyuan Li , Luc Van Gool , Fisher Yu

DepthFM: Fast Monocular Depth Estimation with Flow Matching

Current discriminative depth estimation methods often produce blurry artifacts, while generative approaches suffer from slow sampling due to curvatures in the noise-to-depth transport. Our method addresses these challenges by framing depth…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Ming Gui , Johannes Schusterbauer , Ulrich Prestel , Pingchuan Ma , Dmytro Kotovenko , Olga Grebenkova , Stefan Andreas Baumann , Vincent Tao Hu , Björn Ommer

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth from a single image is geometrically ill-posed and requires scene understanding, so it is not surprising that the rise of deep learning has led to a…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Bingxin Ke , Anton Obukhov , Shengyu Huang , Nando Metzger , Rodrigo Caye Daudt , Konrad Schindler

Towards Sharper Object Boundaries in Self-Supervised Depth Estimation

Accurate monocular depth estimation is crucial for 3D scene understanding, but existing methods often blur depth at object boundaries, introducing spurious intermediate 3D points. While achieving sharp edges usually requires very…

Computer Vision and Pattern Recognition · Computer Science 2025-11-19 Aurélien Cecille , Stefan Duffner , Franck Davoine , Rémi Agier , Thibault Neveu

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

We present a foundation model for zero-shot metric monocular depth estimation. Our model, Depth Pro, synthesizes high-resolution depth maps with unparalleled sharpness and high-frequency details. The predictions are metric, with absolute…

Computer Vision and Pattern Recognition · Computer Science 2025-04-22 Aleksei Bochkovskii , Amaël Delaunoy , Hugo Germain , Marcel Santos , Yichao Zhou , Stephan R. Richter , Vladlen Koltun

MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Over the past few years, self-supervised monocular depth estimation that does not depend on ground-truth during the training phase has received widespread attention. Most efforts focus on designing different types of network architectures…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Shuwei Shao , Zhongcai Pei , Weihai Chen , Dingchi Sun , Peter C. Y. Chen , Zhengguo Li

Boosting Monocular Metric Depth Estimation via Bokeh Rendering

Bokeh rendering and depth estimation share a fundamental optical connection, yet existing methods fail to fully exploit this reciprocity. Conventional bokeh pipelines rely heavily on noisy depth maps that inevitably introduce visual…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Hangwei Zhang , Armando Fortes , Tianyi Wei , Xingang Pan

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

Monocular depth estimation is a challenging task that predicts the pixel-wise depth from a single 2D image. Current methods typically model this problem as a regression or classification task. We propose DiffusionDepth, a new approach that…

Computer Vision and Pattern Recognition · Computer Science 2023-08-30 Yiqun Duan , Xianda Guo , Zheng Zhu

PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage

This work addresses the task of zero-shot monocular depth estimation. A recent advance in this field has been the idea of utilising Text-to-Image foundation models, such as Stable Diffusion. Foundation models provide a rich and generic…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Denis Zavadski , Damjan Kalšan , Carsten Rother

Unsupervised Monocular Depth Estimation Based on Hierarchical Feature-Guided Diffusion

Unsupervised monocular depth estimation has received widespread attention because of its capability to train without ground truth. In real-world scenarios, the images may be blurry or noisy due to the influence of weather conditions and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Runze Liu , Dongchen Zhu , Guanghui Zhang , Yue Xu , Wenjun Shi , Xiaolin Zhang , Lei Wang , Jiamao Li

UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Luigi Piccinelli , Christos Sakaridis , Yung-Hsu Yang , Mattia Segu , Siyuan Li , Wim Abbeloos , Luc Van Gool

Monocular Depth Estimation using Diffusion Models

We formulate monocular depth estimation using denoising diffusion models, inspired by their recent successes in high fidelity image generation. To that end, we introduce innovations to address problems arising due to noisy, incomplete depth…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Saurabh Saxena , Abhishek Kar , Mohammad Norouzi , David J. Fleet

ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation

Estimating depth from a single image is a challenging visual task. Compared to relative depth estimation, metric depth estimation attracts more attention due to its practical physical significance and critical applications in real-life…

Computer Vision and Pattern Recognition · Computer Science 2026-02-25 Ruijie Zhu , Chuxin Wang , Ziyang Song , Li Liu , Tianzhu Zhang , Yongdong Zhang

EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model

Monocular depth estimation (MDE) plays a pivotal role in various computer vision applications, such as robotics, augmented reality, and autonomous driving. Despite recent advancements, existing methods often fail to meet key requirements…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Andrii Litvynchuk , Ivan Livinsky , Anand Ravi , Nima Kalantari , Andrii Tsarov

Structure-Centric Robust Monocular Depth Estimation via Knowledge Distillation

Monocular depth estimation, enabled by self-supervised learning, is a key technique for 3D perception in computer vision. However, it faces significant challenges in real-world scenarios, which encompass adverse weather variations, motion…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Runze Chen , Haiyong Luo , Fang Zhao , Jingze Yu , Yupeng Jia , Juan Wang , Xuepeng Ma

Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues

Recent monocular metric depth estimation (MMDE) methods have made notable progress towards zero-shot generalization. However, they still exhibit a significant performance drop on out-of-distribution datasets. We address this limitation by…

Computer Vision and Pattern Recognition · Computer Science 2025-05-26 Chinmay Talegaonkar , Nikhil Gandudi Suresh , Zachary Novack , Yash Belhe , Priyanka Nagasamudra , Nicholas Antipa

Distilling Monocular Foundation Model for Fine-grained Depth Completion

Depth completion involves predicting dense depth maps from sparse LiDAR inputs. However, sparse depth annotations from sensors limit the availability of dense supervision, which is necessary for learning detailed geometric features. In this…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Yingping Liang , Yutao Hu , Wenqi Shao , Ying Fu

SharpNet: Fast and Accurate Recovery of Occluding Contours in Monocular Depth Estimation

We introduce SharpNet, a method that predicts an accurate depth map for an input color image, with a particular attention to the reconstruction of occluding contours: Occluding contours are an important cue for object recognition, and for…

Computer Vision and Pattern Recognition · Computer Science 2019-11-13 Michaël Ramamonjisoa , Vincent Lepetit

Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion

Depth completion upgrades sparse depth measurements into dense depth maps guided by a conventional image. Existing methods for this highly ill-posed task operate in tightly constrained settings and tend to struggle when applied to images…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Massimiliano Viola , Kevin Qu , Nando Metzger , Bingxin Ke , Alexander Becker , Konrad Schindler , Anton Obukhov

HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors

We propose HYBRIDDEPTH, a robust depth estimation pipeline that addresses key challenges in depth estimation,including scale ambiguity, hardware heterogeneity, and generalizability. HYBRIDDEPTH leverages focal stack, data conveniently…

Computer Vision and Pattern Recognition · Computer Science 2024-12-30 Ashkan Ganj , Hang Su , Tian Guo