Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

Deniz Beker; Hiroharu Kato; Mihai Adrian Morariu; Takahiro Ando; Toru Matsuoka; Wadim Kehl; Adrien Gaidon

Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

Computer Vision and Pattern Recognition 2020-10-01 v1 Machine Learning

Authors: Deniz Beker , Hiroharu Kato , Mihai Adrian Morariu , Takahiro Ando , Toru Matsuoka , Wadim Kehl , Adrien Gaidon

View on arXiv ↗ PDF ↗

Abstract

3D object detection from monocular images is an ill-posed problem due to the projective entanglement of depth and scale. To overcome this ambiguity, we present a novel self-supervised method for textured 3D shape reconstruction and pose estimation of rigid objects with the help of strong shape priors and 2D instance masks. Our method predicts the 3D location and meshes of each object in an image using differentiable rendering and a self-supervised objective derived from a pretrained monocular depth estimation network. We use the KITTI 3D object detection dataset to evaluate the accuracy of the method. Experiments demonstrate that we can effectively use noisy monocular depth and differentiable rendering as an alternative to expensive 3D ground-truth labels or LiDAR information.

Keywords

monocular object detection depth estimation 3d reconstruction

Cite

@article{arxiv.2009.14524,
  title  = {Monocular Differentiable Rendering for Self-Supervised 3D Object Detection},
  author = {Deniz Beker and Hiroharu Kato and Mihai Adrian Morariu and Takahiro Ando and Toru Matsuoka and Wadim Kehl and Adrien Gaidon},
  journal= {arXiv preprint arXiv:2009.14524},
  year   = {2020}
}

Comments

20 pages, Supplementary material included, Published in ECCV 2020

Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

Abstract

Keywords

Cite

Comments

Related papers