English

Multi-Task Multi-Sensor Fusion for 3D Object Detection

Computer Vision and Pattern Recognition 2020-12-24 v1

Abstract

In this paper we propose to exploit multiple related tasks for accurate multi-sensor 3D object detection. Towards this goal we present an end-to-end learnable architecture that reasons about 2D and 3D object detection as well as ground estimation and depth completion. Our experiments show that all these tasks are complementary and help the network learn better representations by fusing information at various levels. Importantly, our approach leads the KITTI benchmark on 2D, 3D and BEV object detection, while being real time.

Keywords

Cite

@article{arxiv.2012.12397,
  title  = {Multi-Task Multi-Sensor Fusion for 3D Object Detection},
  author = {Ming Liang and Bin Yang and Yun Chen and Rui Hu and Raquel Urtasun},
  journal= {arXiv preprint arXiv:2012.12397},
  year   = {2020}
}

Comments

CVPR 2019

R2 v1 2026-06-23T21:15:02.098Z