Related papers: MEDeA: Multi-view Efficient Depth Adjustment

PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation

Self-supervised monocular depth estimation is of significant importance with applications spanning across autonomous driving and robotics. However, the reliance on self-supervision introduces a strong static-scene assumption, thereby posing…

Computer Vision and Pattern Recognition · Computer Science 2024-01-18 Yue-Jiang Dong , Yuan-Chen Guo , Ying-Tian Liu , Fang-Lue Zhang , Song-Hai Zhang

Refinement of Monocular Depth Maps via Multi-View Differentiable Rendering

Accurate depth estimation is at the core of many applications in computer graphics, vision, and robotics. Current state-of-the-art monocular depth estimators, trained on extensive datasets, generalize well but lack 3D consistency needed for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Laura Fink , Linus Franke , Bernhard Egger , Joachim Keinert , Marc Stamminger

Survey on Monocular Metric Depth Estimation

Monocular Depth Estimation (MDE) enables spatial understanding, 3D reconstruction, and autonomous navigation, yet deep learning approaches often predict only relative depth without a consistent metric scale. This limitation reduces…

Computer Vision and Pattern Recognition · Computer Science 2025-08-27 Jiuling Zhang

MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes

Monocular metric depth estimation (MMDE) is a crucial task to solve for indoor scene reconstruction on edge devices. Despite this importance, existing models are sensitive to factors such as boundary frequency of objects in the scene and…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Sanghyun Byun , Jacob Song , Woo Seong Chung

Inter-View Depth Consistency Testing in Depth Difference Subspace

Multiview depth imagery will play a critical role in free-viewpoint television. This technology requires high quality virtual view synthesis to enable viewers to move freely in a dynamic real world scene. Depth imagery at different…

Computer Vision and Pattern Recognition · Computer Science 2023-01-30 Pravin Kumar Rana , Markus Flierl

SDGE: Stereo Guided Depth Estimation for 360$^\circ$ Camera Sets

Depth estimation is a critical technology in autonomous driving, and multi-camera systems are often used to achieve a 360$^\circ$ perception. These 360$^\circ$ camera sets often have limited or low-quality overlap regions, making multi-view…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Jialei Xu , Wei Yin , Dong Gong , Junjun Jiang , Xianming Liu

EGA-Depth: Efficient Guided Attention for Self-Supervised Multi-Camera Depth Estimation

The ubiquitous multi-camera setup on modern autonomous vehicles provides an opportunity to construct surround-view depth. Existing methods, however, either perform independent monocular depth estimations on each camera or rely on…

Computer Vision and Pattern Recognition · Computer Science 2023-04-10 Yunxiao Shi , Hong Cai , Amin Ansari , Fatih Porikli

EfficientDepth: A Fast and Detail-Preserving Monocular Depth Estimation Model

Monocular depth estimation (MDE) plays a pivotal role in various computer vision applications, such as robotics, augmented reality, and autonomous driving. Despite recent advancements, existing methods often fail to meet key requirements…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Andrii Litvynchuk , Ivan Livinsky , Anand Ravi , Nima Kalantari , Andrii Tsarov

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Depth map estimation from images is an important task in robotic systems. Existing methods can be categorized into two groups including multi-view stereo and monocular depth estimation. The former requires cameras to have large overlapping…

Computer Vision and Pattern Recognition · Computer Science 2022-10-06 Jialei Xu , Xianming Liu , Yuanchao Bai , Junjun Jiang , Kaixuan Wang , Xiaozhi Chen , Xiangyang Ji

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth

Self-supervised monocular depth estimation networks are trained to predict scene depth using nearby frames as a supervision signal during training. However, for many applications, sequence information in the form of video frames is also…

Computer Vision and Pattern Recognition · Computer Science 2021-07-15 Jamie Watson , Oisin Mac Aodha , Victor Prisacariu , Gabriel Brostow , Michael Firman

UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Luigi Piccinelli , Christos Sakaridis , Yung-Hsu Yang , Mattia Segu , Siyuan Li , Wim Abbeloos , Luc Van Gool

Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks

We present a novel method for multi-view depth estimation from a single video, which is a critical task in various applications, such as perception, reconstruction and robot navigation. Although previous learning-based methods have…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Xiaoxiao Long , Lingjie Liu , Wei Li , Christian Theobalt , Wenping Wang

PTC-Depth: Pose-Refined Monocular Depth Estimation with Temporal Consistency

Monocular depth estimation (MDE) has been widely adopted in the perception systems of autonomous vehicles and mobile robots. However, existing approaches often struggle to maintain temporal consistency in depth estimation across consecutive…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Leezy Han , Seunggyu Kim , Dongseok Shim , Hyeonbeom Lee

Normal Assisted Stereo Depth Estimation

Accurate stereo depth estimation plays a critical role in various 3D tasks in both indoor and outdoor environments. Recently, learning-based multi-view stereo methods have demonstrated competitive performance with a limited number of views.…

Computer Vision and Pattern Recognition · Computer Science 2020-06-02 Uday Kusupati , Shuo Cheng , Rui Chen , Hao Su

Multi-Frame Self-Supervised Depth Estimation with Multi-Scale Feature Fusion in Dynamic Scenes

Multi-frame methods improve monocular depth estimation over single-frame approaches by aggregating spatial-temporal information via feature matching. However, the spatial-temporal feature leads to accuracy degradation in dynamic scenes. To…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Jiquan Zhong , Xiaolin Huang , Xiao Yu

UniDepth: Universal Monocular Metric Depth Estimation

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Luigi Piccinelli , Yung-Hsu Yang , Christos Sakaridis , Mattia Segu , Siyuan Li , Luc Van Gool , Fisher Yu

ScaleDepth: Decomposing Metric Depth Estimation into Scale Prediction and Relative Depth Estimation

Estimating depth from a single image is a challenging visual task. Compared to relative depth estimation, metric depth estimation attracts more attention due to its practical physical significance and critical applications in real-life…

Computer Vision and Pattern Recognition · Computer Science 2026-02-25 Ruijie Zhu , Chuxin Wang , Ziyang Song , Li Liu , Tianzhu Zhang , Yongdong Zhang

Edge-aware Consistent Stereo Video Depth Estimation

Video depth estimation is crucial in various applications, such as scene reconstruction and augmented reality. In contrast to the naive method of estimating depths from images, a more sophisticated approach uses temporal information,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-05 Elena Kosheleva , Sunil Jaiswal , Faranak Shamsafar , Noshaba Cheema , Klaus Illgner-Fehns , Philipp Slusallek

RePoseD: Efficient Relative Pose Estimation With Known Depth Information

Recent advances in monocular depth estimation methods (MDE) and their improved accuracy open new possibilities for their applications. In this paper, we investigate how monocular depth estimates can be used for relative pose estimation. In…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Yaqing Ding , Viktor Kocur , Václav Vávra , Zuzana Berger Haladová , Jian Yang , Torsten Sattler , Zuzana Kukelova

WEDepth: Efficient Adaptation of World Knowledge for Monocular Depth Estimation

Monocular depth estimation (MDE) has widely applicable but remains highly challenging due to the inherently ill-posed nature of reconstructing 3D scenes from single 2D images. Modern Vision Foundation Models (VFMs), pre-trained on…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Gongshu Wang , Zhirui Wang , Kan Yang