Related papers: S2R-DepthNet: Learning a Generalizable Depth-speci…

T2Net: Synthetic-to-Realistic Translation for Solving Single-Image Depth Estimation Tasks

Current methods for single-image depth estimation use training datasets with real image-depth pairs or stereo pairs, which are not easy to acquire. We propose a framework, trained on synthetic image-depth pairs and unpaired real images,…

Computer Vision and Pattern Recognition · Computer Science 2018-08-07 Chuanxia Zheng , Tat-Jen Cham , Jianfei Cai

Shallow2Deep: Indoor Scene Modeling by Single Image Understanding

Dense indoor scene modeling from 2D images has been bottlenecked due to the absence of depth information and cluttered occlusions. We present an automatic indoor scene modeling approach using deep features from neural networks. Given a…

Computer Vision and Pattern Recognition · Computer Science 2020-02-25 Yinyu Nie , Shihui Guo , Jian Chang , Xiaoguang Han , Jiahui Huang , Shi-Min Hu , Jian Jun Zhang

Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations

Unsupervised learning with generative models has the potential of discovering rich representations of 3D scenes. While geometric deep learning has explored 3D-structure-aware representations of scene geometry, these models typically require…

Computer Vision and Pattern Recognition · Computer Science 2020-01-30 Vincent Sitzmann , Michael Zollhöfer , Gordon Wetzstein

Unsupervised Learning of 3D Structure from Images

A key goal of computer vision is to recover the underlying 3D structure from 2D observations of the world. In this paper we learn strong deep generative models of 3D structures, and recover these structures from 3D and 2D images via…

Computer Vision and Pattern Recognition · Computer Science 2018-06-20 Danilo Jimenez Rezende , S. M. Ali Eslami , Shakir Mohamed , Peter Battaglia , Max Jaderberg , Nicolas Heess

Learning to Reconstruct and Segment 3D Objects

To endow machines with the ability to perceive the real-world in a three dimensional representation as we do as humans is a fundamental and long-standing topic in Artificial Intelligence. Given different types of visual inputs such as…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Bo Yang

Geometric Perception based Efficient Text Recognition

Every Scene Text Recognition (STR) task consists of text localization \& text recognition as the prominent sub-tasks. However, in real-world applications with fixed camera positions such as equipment monitor reading, image-based data entry,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-09 P. N. Deelaka , D. R. Jayakodi , D. Y. Silva

MarrNet: 3D Shape Reconstruction via 2.5D Sketches

3D object reconstruction from a single image is a highly under-determined problem, requiring strong prior knowledge of plausible 3D shapes. This introduces challenges for learning-based approaches, as 3D object annotations are scarce in…

Computer Vision and Pattern Recognition · Computer Science 2017-11-10 Jiajun Wu , Yifan Wang , Tianfan Xue , Xingyuan Sun , William T Freeman , Joshua B Tenenbaum

Pose from Shape: Deep Pose Estimation for Arbitrary 3D Objects

Most deep pose estimation methods need to be trained for specific object instances or categories. In this work we propose a completely generic deep pose estimation approach, which does not require the network to have been trained on…

Computer Vision and Pattern Recognition · Computer Science 2019-08-06 Yang Xiao , Xuchong Qiu , Pierre-Alain Langlois , Mathieu Aubry , Renaud Marlet

Deep Robust Single Image Depth Estimation Neural Network Using Scene Understanding

Single image depth estimation (SIDE) plays a crucial role in 3D computer vision. In this paper, we propose a two-stage robust SIDE framework that can perform blind SIDE for both indoor and outdoor scenes. At the first stage, the scene…

Computer Vision and Pattern Recognition · Computer Science 2019-06-11 Haoyu Ren , Mostafa El-khamy , Jungwon Lee

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve

Object recognition has seen significant progress in the image domain, with focus primarily on 2D perception. We propose to leverage existing large-scale datasets of 3D models to understand the underlying 3D structure of objects seen in an…

Computer Vision and Pattern Recognition · Computer Science 2020-07-28 Weicheng Kuo , Anelia Angelova , Tsung-Yi Lin , Angela Dai

SSR-2D: Semantic 3D Scene Reconstruction from 2D Images

Most deep learning approaches to comprehensive semantic modeling of 3D indoor spaces require costly dense annotations in the 3D domain. In this work, we explore a central 3D scene modeling task, namely, semantic scene reconstruction without…

Computer Vision and Pattern Recognition · Computer Science 2024-06-06 Junwen Huang , Alexey Artemov , Yujin Chen , Shuaifeng Zhi , Kai Xu , Matthias Nießner

RenderNet: A deep convolutional network for differentiable rendering from 3D shapes

Traditional computer graphics rendering pipeline is designed for procedurally generating 2D quality images from 3D shapes with high performance. The non-differentiability due to discrete operations such as visibility computation makes it…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Thu Nguyen-Phuoc , Chuan Li , Stephen Balaban , Yong-Liang Yang

Self-Supervised Learning of Domain Invariant Features for Depth Estimation

We tackle the problem of unsupervised synthetic-to-real domain adaptation for single image depth estimation. An essential building block of single image depth estimation is an encoder-decoder task network that takes RGB images as input and…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Hiroyasu Akada , Shariq Farooq Bhat , Ibraheem Alhashim , Peter Wonka

Towards Learning Neural Representations from Shadows

We present a method that learns neural shadow fields which are neural scene representations that are only learnt from the shadows present in the scene. While traditional shape-from-shadow (SfS) algorithms reconstruct geometry from shadows,…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Kushagra Tiwary , Tzofi Klinghoffer , Ramesh Raskar

3D-to-2D Distillation for Indoor Scene Parsing

Indoor scene semantic parsing from RGB images is very challenging due to occlusions, object distortion, and viewpoint variations. Going beyond prior works that leverage geometry information, typically paired depth maps, we present a new…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Zhengzhe Liu , Xiaojuan Qi , Chi-Wing Fu

Self-supervised Depth Estimation Leveraging Global Perception and Geometric Smoothness Using On-board Videos

Self-supervised depth estimation has drawn much attention in recent years as it does not require labeled data but image sequences. Moreover, it can be conveniently used in various applications, such as autonomous driving, robotics,…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Shaocheng Jia , Xin Pei , Wei Yao , S. C. Wong

Towards General Purpose Geometry-Preserving Single-View Depth Estimation

Single-view depth estimation (SVDE) plays a crucial role in scene understanding for AR applications, 3D modeling, and robotics, providing the geometry of a scene based on a single image. Recent works have shown that a successful solution…

Computer Vision and Pattern Recognition · Computer Science 2021-02-11 Mikhail Romanov , Nikolay Patatkin , Anna Vorontsova , Sergey Nikolenko , Anton Konushin , Dmitry Senyushkin

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering

In this study, we address the challenge of 3D scene structure recovery from monocular depth estimation. While traditional depth estimation methods leverage labeled datasets to directly predict absolute depth, recent advancements advocate…

Computer Vision and Pattern Recognition · Computer Science 2023-09-19 Chi Zhang , Wei Yin , Gang Yu , Zhibin Wang , Tao Chen , Bin Fu , Joey Tianyi Zhou , Chunhua Shen

Normal-guided Detail-Preserving Neural Implicit Function for High-Fidelity 3D Surface Reconstruction

Neural implicit representations have emerged as a powerful paradigm for 3D reconstruction. However, despite their success, existing methods fail to capture fine geometric details and thin structures, especially in scenarios where only…

Computer Vision and Pattern Recognition · Computer Science 2025-04-23 Aarya Patel , Hamid Laga , Ojaswa Sharma

Deep Learned Full-3D Object Completion from Single View

3D geometry is a very informative cue when interacting with and navigating an environment. This writing proposes a new approach to 3D reconstruction and scene understanding, which implicitly learns 3D geometry from depth maps pairing a deep…

Computer Vision and Pattern Recognition · Computer Science 2018-08-22 Dario Rethage , Federico Tombari , Felix Achilles , Nassir Navab