Related papers: DoubleTake: Geometry Guided Depth Estimation

Peeking Behind Objects: Layered Depth Prediction from a Single Image

While conventional depth estimation can infer the geometry of a scene from a single RGB image, it fails to estimate scene regions that are occluded by foreground objects. This limits the use of depth prediction in augmented and virtual…

Computer Vision and Pattern Recognition · Computer Science 2019-05-09 Helisa Dhamo , Keisuke Tateno , Iro Laina , Nassir Navab , Federico Tombari

A Two-Streamed Network for Estimating Fine-Scaled Depth Maps from Single RGB Images

Estimating depth from a single RGB image is an ill-posed and inherently ambiguous problem. State-of-the-art deep learning methods can now estimate accurate 2D depth maps, but when the maps are projected into 3D, they lack local detail and…

Computer Vision and Pattern Recognition · Computer Science 2017-12-05 Jun Li , Reinhard Klein , Angela Yao

Towards Keypoint Guided Self-Supervised Depth Estimation

This paper proposes to use keypoints as a self-supervision clue for learning depth map estimation from a collection of input images. As ground truth depth from real images is difficult to obtain, there are many unsupervised and…

Computer Vision and Pattern Recognition · Computer Science 2020-11-09 Kristijan Bartol , David Bojanic , Tomislav Petkovic , Tomislav Pribanic , Yago Diez Donoso

Depth from a Single Image by Harmonizing Overcomplete Local Network Predictions

A single color image can contain many cues informative towards different aspects of local geometric structure. We approach the problem of monocular depth estimation by using a neural network to produce a mid-level representation that…

Computer Vision and Pattern Recognition · Computer Science 2016-09-08 Ayan Chakrabarti , Jingyu Shao , Gregory Shakhnarovich

Depth Map Estimation of Dynamic Scenes Using Prior Depth Information

Depth information is useful for many applications. Active depth sensors are appealing because they obtain dense and accurate depth maps. However, due to issues that range from power constraints to multi-sensor interference, these sensors…

Image and Video Processing · Electrical Eng. & Systems 2020-02-04 James Noraky , Vivienne Sze

Interp3R: Continuous-time 3D Geometry Estimation with Frames and Events

In recent years, 3D visual foundation models pioneered by pointmap-based approaches such as DUSt3R have attracted a lot of interest, achieving impressive accuracy and strong generalization across diverse scenes. However, these methods are…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Shuang Guo , Filbert Febryanto , Lei Sun , Guillermo Gallego

Augmenting Depth Estimation with Geospatial Context

Modern cameras are equipped with a wide array of sensors that enable recording the geospatial context of an image. Taking advantage of this, we explore depth estimation under the assumption that the camera is geocalibrated, a problem we…

Computer Vision and Pattern Recognition · Computer Science 2021-09-22 Scott Workman , Hunter Blanton

Monocular Human Digitization via Implicit Re-projection Networks

We present an approach to generating 3D human models from images. The key to our framework is that we predict double-sided orthographic depth maps and color images from a single perspective projected image. Our framework consists of three…

Computer Vision and Pattern Recognition · Computer Science 2022-05-17 Min-Gyu Park , Ju-Mi Kang , Je Woo Kim , Ju Hong Yoon

A Neural Network for Detailed Human Depth Estimation from a Single Image

This paper presents a neural network to estimate a detailed depth map of the foreground human in a single RGB image. The result captures geometry details such as cloth wrinkles, which are important in visualization applications. To achieve…

Computer Vision and Pattern Recognition · Computer Science 2019-12-25 Sicong Tang , Feitong Tan , Kelvin Cheng , Zhaoyang Li , Siyu Zhu , Ping Tan

A Survey on Deep Learning Architectures for Image-based Depth Reconstruction

Estimating depth from RGB images is a long-standing ill-posed problem, which has been explored for decades by the computer vision, graphics, and machine learning communities. In this article, we provide a comprehensive survey of the recent…

Computer Vision and Pattern Recognition · Computer Science 2019-06-17 Hamid Laga

Learning to Estimate 3D Hand Pose from Single RGB Images

Low-cost consumer depth cameras and deep learning have enabled reasonable 3D hand pose estimation from single depth images. In this paper, we present an approach that estimates 3D hand pose from regular RGB images. This task has far more…

Computer Vision and Pattern Recognition · Computer Science 2017-10-17 Christian Zimmermann , Thomas Brox

Predicting Depth Maps from Single RGB Images and Addressing Missing Information in Depth Estimation

Depth imaging is a crucial area in Autonomous Driving Systems (ADS), as it plays a key role in detecting and measuring objects in the vehicle's surroundings. However, a significant challenge in this domain arises from missing information in…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Mohamad Mofeed Chaar , Jamal Raiyn , Galia Weidl

Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB

Despite the recent advances in computer vision research, estimating the 3D human pose from single RGB images remains a challenging task, as multiple 3D poses can correspond to the same 2D projection on the image. In this context, depth data…

Computer Vision and Pattern Recognition · Computer Science 2024-09-18 Alessandro Simoni , Francesco Marchetti , Guido Borghi , Federico Becattini , Davide Davoli , Lorenzo Garattoni , Gianpiero Francesca , Lorenzo Seidenari , Roberto Vezzani

A Survey on Deep Learning Techniques for Stereo-based Depth Estimation

Estimating depth from RGB images is a long-standing ill-posed problem, which has been explored for decades by the computer vision, graphics, and machine learning communities. Among the existing techniques, stereo matching remains one of the…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Hamid Laga , Laurent Valentin Jospin , Farid Boussaid , Mohammed Bennamoun

Self-Supervised Depth Estimation in Laparoscopic Image using 3D Geometric Consistency

Depth estimation is a crucial step for image-guided intervention in robotic surgery and laparoscopic imaging system. Since per-pixel depth ground truth is difficult to acquire for laparoscopic image data, it is rarely possible to apply…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Baoru Huang , Jian-Qing Zheng , Anh Nguyen , Chi Xu , Ioannis Gkouzionis , Kunal Vyas , David Tuch , Stamatia Giannarou , Daniel S. Elson

ENG: End-to-end Neural Geometry for Robust Depth and Pose Estimation using CNNs

Recovering structure and motion parameters given a image pair or a sequence of images is a well studied problem in computer vision. This is often achieved by employing Structure from Motion (SfM) or Simultaneous Localization and Mapping…

Computer Vision and Pattern Recognition · Computer Science 2018-11-07 Thanuja Dharmasiri , Andrew Spek , Tom Drummond

Panoramic Depth Estimation via Supervised and Unsupervised Learning in Indoor Scenes

Depth estimation, as a necessary clue to convert 2D images into the 3D space, has been applied in many machine vision areas. However, to achieve an entire surrounding 360-degree geometric sensing, traditional stereo matching algorithms for…

Computer Vision and Pattern Recognition · Computer Science 2021-09-22 Keyang Zhou , Kailun Yang , Kaiwei Wang

Deep Depth Completion of a Single RGB-D Image

The goal of our work is to complete the depth channel of an RGB-D image. Commodity-grade depth cameras often fail to sense depth for shiny, bright, transparent, and distant surfaces. To address this problem, we train a deep network that…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Yinda Zhang , Thomas Funkhouser

DeepSight: Bridging Depth Maps and Language with a Depth-Driven Multimodal Model

Multimodal large language models (MLLMs) have achieved impressive performance across various tasks such as image captioning and visual question answer(VQA); however, they often struggle to accurately interpret depth information inherent in…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Hao Yang , Hongbo Zhang , Yanyan Zhao , Bing Qin

Depth-Map Generation using Pixel Matching in Stereoscopic Pair of Images

Modern day multimedia content generation and dissemination is moving towards the presentation of more and more `realistic' scenarios. The switch from 2-dimensional (2D) to 3-dimensional (3D) has been a major driving force in that direction.…

Computer Vision and Pattern Recognition · Computer Science 2019-05-16 Asra Aslam , Mohd. Samar Ansari