English
Related papers

Related papers: Repurposing Diffusion-Based Image Generators for M…

200 papers

Recovering metric depth from a single image remains a fundamental challenge in computer vision, requiring both scene understanding and accurate scaling. While deep learning has advanced monocular depth estimation, current models often…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Ansh Shah , K Madhava Krishna

Depth completion upgrades sparse depth measurements into dense depth maps guided by a conventional image. Existing methods for this highly ill-posed task operate in tightly constrained settings and tend to struggle when applied to images…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Massimiliano Viola , Kevin Qu , Nando Metzger , Bingxin Ke , Alexander Becker , Konrad Schindler , Anton Obukhov

This work addresses the task of zero-shot monocular depth estimation. A recent advance in this field has been the idea of utilising Text-to-Image foundation models, such as Stable Diffusion. Foundation models provide a rich and generic…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Denis Zavadski , Damjan Kalšan , Carsten Rother

Monocular depth estimation is a challenging task that predicts the pixel-wise depth from a single 2D image. Current methods typically model this problem as a regression or classification task. We propose DiffusionDepth, a new approach that…

Computer Vision and Pattern Recognition · Computer Science 2023-08-30 Yiqun Duan , Xianda Guo , Zheng Zhu

We formulate monocular depth estimation using denoising diffusion models, inspired by their recent successes in high fidelity image generation. To that end, we introduce innovations to address problems arising due to noisy, incomplete depth…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Saurabh Saxena , Abhishek Kar , Mohammad Norouzi , David J. Fleet

Recent monocular metric depth estimation (MMDE) methods have made notable progress towards zero-shot generalization. However, they still exhibit a significant performance drop on out-of-distribution datasets. We address this limitation by…

Computer Vision and Pattern Recognition · Computer Science 2025-05-26 Chinmay Talegaonkar , Nikhil Gandudi Suresh , Zachary Novack , Yash Belhe , Priyanka Nagasamudra , Nicholas Antipa

Unsupervised monocular depth estimation has received widespread attention because of its capability to train without ground truth. In real-world scenarios, the images may be blurry or noisy due to the influence of weather conditions and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Runze Liu , Dongchen Zhu , Guanghui Zhang , Yue Xu , Wenjun Shi , Xiaolin Zhang , Lei Wang , Jiamao Li

The success of deep learning in computer vision over the past decade has hinged on large labeled datasets and strong pretrained models. In data-scarce settings, the quality of these pretrained models becomes crucial for effective transfer…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Bingxin Ke , Kevin Qu , Tianfu Wang , Nando Metzger , Shengyu Huang , Bo Li , Anton Obukhov , Konrad Schindler

In this paper, we present DM-Calib, a diffusion-based approach for estimating pinhole camera intrinsic parameters from a single input image. Monocular camera calibration is essential for many 3D vision tasks. However, most existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Junyuan Deng , Wei Yin , Xiaoyang Guo , Qian Zhang , Xiaotao Hu , Weiqiang Ren , Xiao-Xiao Long , Ping Tan

Monocular depth estimation involves predicting depth from a single RGB image and plays a crucial role in applications such as autonomous driving, robotic navigation, 3D reconstruction, etc. Recent advancements in learning-based methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-02-05 Jingming Xia , Guanqun Cao , Guang Ma , Yiben Luo , Qinzhao Li , John Oyekan

Monocular Depth Estimation (MDE) is a fundamental computer vision task with important applications in 3D vision. The current mainstream MDE methods employ an encoder-decoder architecture with multi-level/scale feature processing. However,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Huibin Bai , Shuai Li , Hanxiao Zhai , Yanbo Gao , Chong Lv , Yibo Wang , Haipeng Ping , Wei Hua , Xingyu Gao

Over the past few years, self-supervised monocular depth estimation that does not depend on ground-truth during the training phase has received widespread attention. Most efforts focus on designing different types of network architectures…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Shuwei Shao , Zhongcai Pei , Weihai Chen , Dingchi Sun , Peter C. Y. Chen , Zhengguo Li

We present a novel approach designed to address the complexities posed by challenging, out-of-distribution data in the single-image depth estimation task. Starting with images that facilitate depth prediction due to the absence of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Fabio Tosi , Pierluigi Zama Ramirez , Matteo Poggi

Recent work showed that large diffusion models can be reused as highly precise monocular depth estimators by casting depth estimation as an image-conditional image generation task. While the proposed model achieved state-of-the-art results,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Gonzalo Martin Garcia , Karim Knaebel , Christian Schmidt , Daan de Geus , Alexander Hermans , Bastian Leibe

We propose SharpDepth, a novel approach to monocular metric depth estimation that combines the metric accuracy of discriminative depth estimation methods (e.g., Metric3D, UniDepth) with the fine-grained boundary sharpness typically achieved…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Duc-Hai Pham , Tung Do , Phong Nguyen , Binh-Son Hua , Khoi Nguyen , Rang Nguyen

Accurate depth estimation is at the core of many applications in computer graphics, vision, and robotics. Current state-of-the-art monocular depth estimators, trained on extensive datasets, generalize well but lack 3D consistency needed for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Laura Fink , Linus Franke , Bernhard Egger , Joachim Keinert , Marc Stamminger

Monocular depth estimation is often described as an ill-posed and inherently ambiguous problem. Estimating depth from 2D images is a crucial step in scene reconstruction, 3Dobject recognition, segmentation, and detection. The problem can be…

Computer Vision and Pattern Recognition · Computer Science 2019-01-29 Amlaan Bhoi

Monocular depth estimation is a crucial task in computer vision. While existing methods have shown impressive results under standard conditions, they often face challenges in reliably performing in scenarios such as low-light or rainy…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Yifan Mao , Jian Liu , Xianming Liu

Monocular depth estimation, enabled by self-supervised learning, is a key technique for 3D perception in computer vision. However, it faces significant challenges in real-world scenarios, which encompass adverse weather variations, motion…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Runze Chen , Haiyong Luo , Fang Zhao , Jingze Yu , Yupeng Jia , Juan Wang , Xuepeng Ma

Monocular camera calibration is a key precondition for numerous 3D vision applications. Despite considerable advancements, existing methods often hinge on specific assumptions and struggle to generalize across varied real-world scenarios,…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Xiankang He , Guangkai Xu , Bo Zhang , Hao Chen , Ying Cui , Dongyan Guo
‹ Prev 1 2 3 10 Next ›