Related papers: Task-Aware Monocular Depth Estimation for 3D Objec…

Learning Geometry-Guided Depth via Projective Modeling for Monocular 3D Object Detection

As a crucial task of autonomous driving, 3D object detection has made great progress in recent years. However, monocular 3D object detection remains a challenging problem due to the unsatisfactory performance in depth estimation. Most…

Computer Vision and Pattern Recognition · Computer Science 2024-04-25 Yinmin Zhang , Xinzhu Ma , Shuai Yi , Jun Hou , Zhihui Wang , Wanli Ouyang , Dan Xu

3D Object Aided Self-Supervised Monocular Depth Estimation

Monocular depth estimation has been actively studied in fields such as robot vision, autonomous driving, and 3D scene understanding. Given a sequence of color images, unsupervised learning methods based on the framework of…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Songlin Wei , Guodong Chen , Wenzheng Chi , Zhenhua Wang , Lining Sun

MonoGround: Detecting Monocular 3D Objects from the Ground

Monocular 3D object detection has attracted great attention for its advantages in simplicity and cost. Due to the ill-posed 2D to 3D mapping essence from the monocular imaging process, monocular 3D object detection suffers from inaccurate…

Computer Vision and Pattern Recognition · Computer Science 2022-06-16 Zequn Qin , Xi Li

MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation

Monocular 3D object detection (Mono3D) in mobile settings (e.g., on a vehicle, a drone, or a robot) is an important yet challenging task. Due to the near-far disparity phenomenon of monocular vision and the ever-changing camera pose, it is…

Computer Vision and Pattern Recognition · Computer Science 2023-03-27 Yunsong Zhou , Quan Liu , Hongzi Zhu , Yunzhe Li , Shan Chang , Minyi Guo

MonoCD: Monocular 3D Object Detection with Complementary Depths

Monocular 3D object detection has attracted widespread attention due to its potential to accurately obtain object 3D localization from a single image at a low cost. Depth estimation is an essential but challenging subtask of monocular 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Longfei Yan , Pei Yan , Shengzhou Xiong , Xuanyu Xiang , Yihua Tan

Self-Supervised Monocular Scene Decomposition and Depth Estimation

Self-supervised monocular depth estimation approaches either ignore independently moving objects in the scene or need a separate segmentation step to identify them. We propose MonoDepthSeg to jointly estimate depth and segment moving…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Sadra Safadoust , Fatma Güney

Monocular Depth Estimation: A Survey

Monocular depth estimation is often described as an ill-posed and inherently ambiguous problem. Estimating depth from 2D images is a crucial step in scene reconstruction, 3Dobject recognition, segmentation, and detection. The problem can be…

Computer Vision and Pattern Recognition · Computer Science 2019-01-29 Amlaan Bhoi

Self-supervised 3D Object Detection from Monocular Pseudo-LiDAR

There have been attempts to detect 3D objects by fusion of stereo camera images and LiDAR sensor data or using LiDAR for pre-training and only monocular images for testing, but there have been less attempts to use only monocular image…

Computer Vision and Pattern Recognition · Computer Science 2022-09-21 Curie Kim , Ue-Hwan Kim , Jong-Hwan Kim

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

Although the majority of recent autonomous driving systems concentrate on developing perception methods based on ego-vehicle sensors, there is an overlooked alternative approach that involves leveraging intelligent roadside cameras to help…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Lei Yang , Jiaxin Yu , Xinyu Zhang , Jun Li , Li Wang , Yi Huang , Chuang Zhang , Hong Wang , Yiming Li

Generalizing Monocular 3D Object Detection

Monocular 3D object detection (Mono3D) is a fundamental computer vision task that estimates an object's class, 3D position, dimensions, and orientation from a single image. Its applications, including autonomous driving, augmented reality,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Abhinav Kumar

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Monocular 3D object detection aims for precise 3D localization and identification of objects from a single-view image. Despite its recent progress, it often struggles while handling pervasive object occlusions that tend to complicate and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Xueying Jiang , Sheng Jin , Xiaoqin Zhang , Ling Shao , Shijian Lu

Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking

Monocular image-based 3D perception has become an active research area in recent years owing to its applications in autonomous driving. Approaches to monocular 3D perception including detection and tracking, however, often yield inferior…

Computer Vision and Pattern Recognition · Computer Science 2022-06-09 Longlong Jing , Ruichi Yu , Henrik Kretzschmar , Kang Li , Charles R. Qi , Hang Zhao , Alper Ayvaci , Xu Chen , Dillon Cower , Yingwei Li , Yurong You , Han Deng , Congcong Li , Dragomir Anguelov

A Survey on Joint Object Detection and Pose Estimation using Monocular Vision

In this survey we present a complete landscape of joint object detection and pose estimation methods that use monocular vision. Descriptions of traditional approaches that involve descriptors or models and various estimation methods have…

Computer Vision and Pattern Recognition · Computer Science 2018-11-27 Aniruddha V Patil , Pankaj Rabha

Ground-aware Monocular 3D Object Detection for Autonomous Driving

Estimating the 3D position and orientation of objects in the environment with a single RGB camera is a critical and challenging task for low-cost urban autonomous driving and mobile robots. Most of the existing algorithms are based on the…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Yuxuan Liu , Yuan Yixuan , Ming Liu

Densely Constrained Depth Estimator for Monocular 3D Object Detection

Estimating accurate 3D locations of objects from monocular images is a challenging problem because of lacking depth. Previous work shows that utilizing the object's keypoint projection constraints to estimate multiple depth candidates…

Computer Vision and Pattern Recognition · Computer Science 2022-09-28 Yingyan Li , Yuntao Chen , Jiawei He , Zhaoxiang Zhang

Objects are Different: Flexible Monocular 3D Object Detection

The precise localization of 3D objects from a single image without depth information is a highly challenging problem. Most existing methods adopt the same approach for all objects regardless of their diverse distributions, leading to…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Yunpeng Zhang , Jiwen Lu , Jie Zhou

Monocular Depth Prediction through Continuous 3D Loss

This paper reports a new continuous 3D loss function for learning depth from monocular images. The dense depth prediction from a monocular image is supervised using sparse LIDAR points, which enables us to leverage available open source…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Minghan Zhu , Maani Ghaffari , Yuanxin Zhong , Pingping Lu , Zhong Cao , Ryan M. Eustice , Huei Peng

Probabilistic and Geometric Depth: Detecting Objects in Perspective

3D object detection is an important capability needed in various practical applications such as driver assistance systems. Monocular 3D detection, as a representative general setting among image-based approaches, provides a more economical…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Tai Wang , Xinge Zhu , Jiangmiao Pang , Dahua Lin

On the Metrics for Evaluating Monocular Depth Estimation

Monocular Depth Estimation (MDE) is performed to produce 3D information that can be used in downstream tasks such as those related to on-board perception for Autonomous Vehicles (AVs) or driver assistance. Therefore, a relevant arising…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Akhil Gurram , Antonio M. Lopez

MonoPGC: Monocular 3D Object Detection with Pixel Geometry Contexts

Monocular 3D object detection reveals an economical but challenging task in autonomous driving. Recently center-based monocular methods have developed rapidly with a great trade-off between speed and accuracy, where they usually depend on…

Computer Vision and Pattern Recognition · Computer Science 2023-02-22 Zizhang Wu , Yuanzhu Gan , Lei Wang , Guilian Chen , Jian Pu