Related papers: Shape-Aware Monocular 3D Object Detection

M3DSSD: Monocular 3D Single Stage Object Detector

In this paper, we propose a Monocular 3D Single Stage object Detector (M3DSSD) with feature alignment and asymmetric non-local attention. Current anchor-based monocular 3D object detection methods suffer from feature mismatching. To…

Computer Vision and Pattern Recognition · Computer Science 2021-03-25 Shujie Luo , Hang Dai , Ling Shao , Yong Ding

MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm

Monocular 3D object detection is very challenging in autonomous driving due to the lack of depth information. This paper proposes a one-stage monocular 3D object detection algorithm based on multi-scale depth stratification, which uses the…

Computer Vision and Pattern Recognition · Computer Science 2022-04-29 Zhouzhen Xie , Yuying Song , Jingxuan Wu , Zecheng Li , Chunyi Song , Zhiwei Xu

Open Vocabulary Monocular 3D Object Detection

We propose and study open-vocabulary monocular 3D detection, a novel task that aims to detect objects of any categores in metric 3D space from a single RGB image. Existing 3D object detectors either rely on costly sensors such as LiDAR or…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Jin Yao , Hao Gu , Xuweiyi Chen , Jiayun Wang , Zezhou Cheng

3D Object Aided Self-Supervised Monocular Depth Estimation

Monocular depth estimation has been actively studied in fields such as robot vision, autonomous driving, and 3D scene understanding. Given a sequence of color images, unsupervised learning methods based on the framework of…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Songlin Wei , Guodong Chen , Wenzheng Chi , Zhenhua Wang , Lining Sun

AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Existing deep learning-based approaches for monocular 3D object detection in autonomous driving often model the object as a rotated 3D cuboid while the object's geometric shape has been ignored. In this work, we propose an approach for…

Computer Vision and Pattern Recognition · Computer Science 2021-08-26 Zongdai Liu , Dingfu Zhou , Feixiang Lu , Jin Fang , Liangjun Zhang

Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction

We present MonoPSR, a monocular 3D object detection method that leverages proposals and shape reconstruction. First, using the fundamental relations of a pinhole camera model, detections from a mature 2D object detector are used to generate…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Jason Ku , Alex D. Pon , Steven L. Waslander

Object-Aware Centroid Voting for Monocular 3D Object Detection

Monocular 3D object detection aims to detect objects in a 3D physical world from a single camera. However, recent approaches either rely on expensive LiDAR devices, or resort to dense pixel-wise depth estimation that causes prohibitive…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Wentao Bao , Qi Yu , Yu Kong

Monocular 3D Object Detection using Multi-Stage Approaches with Attention and Slicing aided hyper inference

3D object detection is vital as it would enable us to capture objects' sizes, orientation, and position in the world. As a result, we would be able to use this 3D detection in real-world applications such as Augmented Reality (AR),…

Computer Vision and Pattern Recognition · Computer Science 2022-12-23 Abonia Sojasingarayar , Ashish Patel

Objects are Different: Flexible Monocular 3D Object Detection

The precise localization of 3D objects from a single image without depth information is a highly challenging problem. Most existing methods adopt the same approach for all objects regardless of their diverse distributions, leading to…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Yunpeng Zhang , Jiwen Lu , Jie Zhou

Generalizing Monocular 3D Object Detection

Monocular 3D object detection (Mono3D) is a fundamental computer vision task that estimates an object's class, 3D position, dimensions, and orientation from a single image. Its applications, including autonomous driving, augmented reality,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Abhinav Kumar

MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships

Monocular 3D object detection is an essential component in autonomous driving while challenging to solve, especially for those occluded samples which are only partially visible. Most detectors consider each 3D object as an independent…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Yongjian Chen , Lei Tai , Kai Sun , Mingyang Li

S$^3$-MonoDETR: Supervised Shape&Scale-perceptive Deformable Transformer for Monocular 3D Object Detection

Recently, transformer-based methods have shown exceptional performance in monocular 3D object detection, which can predict 3D attributes from a single 2D image. These methods typically use visual and depth representations to generate query…

Computer Vision and Pattern Recognition · Computer Science 2024-08-22 Xuan He , Jin Yuan , Kailun Yang , Zhenchao Zeng , Zhiyong Li

Probabilistic and Geometric Depth: Detecting Objects in Perspective

3D object detection is an important capability needed in various practical applications such as driver assistance systems. Monocular 3D detection, as a representative general setting among image-based approaches, provides a more economical…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Tai Wang , Xinge Zhu , Jiangmiao Pang , Dahua Lin

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection

Monocular 3D object detection is an important task for autonomous driving considering its advantage of low cost. It is much more challenging than conventional 2D cases due to its inherent ill-posed property, which is mainly reflected in the…

Computer Vision and Pattern Recognition · Computer Science 2021-09-27 Tai Wang , Xinge Zhu , Jiangmiao Pang , Dahua Lin

Learning Geometry-Guided Depth via Projective Modeling for Monocular 3D Object Detection

As a crucial task of autonomous driving, 3D object detection has made great progress in recent years. However, monocular 3D object detection remains a challenging problem due to the unsatisfactory performance in depth estimation. Most…

Computer Vision and Pattern Recognition · Computer Science 2024-04-25 Yinmin Zhang , Xinzhu Ma , Shuai Yi , Jun Hou , Zhihui Wang , Wanli Ouyang , Dan Xu

Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues

Today's state-of-the-art methods for 3D object detection are based on lidar, stereo, or monocular cameras. Lidar-based methods achieve the best accuracy, but have a large footprint, high cost, and mechanically-limited angular sampling…

Computer Vision and Pattern Recognition · Computer Science 2021-02-09 Frank Julca-Aguilar , Jason Taylor , Mario Bijelic , Fahim Mannan , Ethan Tseng , Felix Heide

Monocular 3D Object Detection and Box Fitting Trained End-to-End Using Intersection-over-Union Loss

Three-dimensional object detection from a single view is a challenging task which, if performed with good accuracy, is an important enabler of low-cost mobile robot perception. Previous approaches to this problem suffer either from an…

Computer Vision and Pattern Recognition · Computer Science 2019-06-21 Eskil Jörgensen , Christopher Zach , Fredrik Kahl

MonoEdge: Monocular 3D Object Detection Using Local Perspectives

We propose a novel approach for monocular 3D object detection by leveraging local perspective effects of each object. While the global perspective effect shown as size and position variations has been exploited for monocular 3D detection…

Computer Vision and Pattern Recognition · Computer Science 2023-01-06 Minghan Zhu , Lingting Ge , Panqu Wang , Huei Peng

Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

3D object detection from monocular images is an ill-posed problem due to the projective entanglement of depth and scale. To overcome this ambiguity, we present a novel self-supervised method for textured 3D shape reconstruction and pose…

Computer Vision and Pattern Recognition · Computer Science 2020-10-01 Deniz Beker , Hiroharu Kato , Mihai Adrian Morariu , Takahiro Ando , Toru Matsuoka , Wadim Kehl , Adrien Gaidon

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

Monocular 3D object detection is valuable for various applications such as robotics and AR/VR. Existing methods are confined to closed-set settings, where the training and testing sets consist of the same scenes and/or object categories.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-08 Yung-Hsu Yang , Luigi Piccinelli , Mattia Segu , Siyuan Li , Rui Huang , Yuqian Fu , Marc Pollefeys , Hermann Blum , Zuria Bauer