Related papers: Sparse3DTrack: Monocular 3D Object Tracking Using …

PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection

Monocular 3D object detection (M3OD) has long faced challenges due to data scarcity caused by high annotation costs and inherent 2D-to-3D ambiguity. Although various weakly supervised methods and pseudo-labeling methods have been proposed…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Seokyeong Lee , Sithu Aung , Junyong Choi , Seungryong Kim , Ig-Jae Kim , Junghyun Cho

MonoSAOD: Monocular 3D Object Detection with Sparsely Annotated Label

Monocular 3D object detection has achieved impressive performance on densely annotated datasets. However, it struggles when only a fraction of objects are labeled due to the high cost of 3D annotation. This sparsely annotated setting is…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Junyoung Jung , Seokwon Kim , Jung Uk Kim

Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection

Monocular 3D object detection continues to attract attention due to the cost benefits and wider availability of RGB cameras. Despite the recent advances and the ability to acquire data at scale, annotation cost and complexity still limit…

Computer Vision and Pattern Recognition · Computer Science 2022-07-27 Issa Mouawad , Nikolas Brasch , Fabian Manhardt , Federico Tombari , Francesca Odone

Exploiting More Information in Sparse Point Cloud for 3D Single Object Tracking

3D single object tracking is a key task in 3D computer vision. However, the sparsity of point clouds makes it difficult to compute the similarity and locate the object, posing big challenges to the 3D tracker. Previous works tried to solve…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Yubo Cui , Jiayao Shan , Zuoxu Gu , Zhiheng Li , Zheng Fang

Unified Unsupervised and Sparsely-Supervised 3D Object Detection by Semantic Pseudo-Labeling and Prototype Learning

3D object detection is essential for autonomous driving and robotic perception, yet its reliance on large-scale manually annotated data limits scalability and adaptability. To reduce annotation dependency, unsupervised and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Yushen He , Lei Zhao , Weidong Chen

View-to-Label: Multi-View Consistency for Self-Supervised 3D Object Detection

For autonomous vehicles, driving safely is highly dependent on the capability to correctly perceive the environment in 3D space, hence the task of 3D object detection represents a fundamental aspect of perception. While 3D sensors deliver…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Issa Mouawad , Nikolas Brasch , Fabian Manhardt , Federico Tombari , Francesca Odone

Every Dataset Counts: Scaling up Monocular 3D Object Detection with Joint Datasets Training

Monocular 3D object detection plays a crucial role in autonomous driving. However, existing monocular 3D detection algorithms depend on 3D labels derived from LiDAR measurements, which are costly to acquire for new datasets and challenging…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Fulong Ma , Xiaoyang Yan , Guoyang Zhao , Xiaojie Xu , Yuxuan Liu , Jun Ma , Ming Liu

3D Object Aided Self-Supervised Monocular Depth Estimation

Monocular depth estimation has been actively studied in fields such as robot vision, autonomous driving, and 3D scene understanding. Given a sequence of color images, unsupervised learning methods based on the framework of…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Songlin Wei , Guodong Chen , Wenzheng Chi , Zhenhua Wang , Lining Sun

Monocular Differentiable Rendering for Self-Supervised 3D Object Detection

3D object detection from monocular images is an ill-posed problem due to the projective entanglement of depth and scale. To overcome this ambiguity, we present a novel self-supervised method for textured 3D shape reconstruction and pose…

Computer Vision and Pattern Recognition · Computer Science 2020-10-01 Deniz Beker , Hiroharu Kato , Mihai Adrian Morariu , Takahiro Ando , Toru Matsuoka , Wadim Kehl , Adrien Gaidon

Weakly Supervised Monocular 3D Object Detection using Multi-View Projection and Direction Consistency

Monocular 3D object detection has become a mainstream approach in automatic driving for its easy application. A prominent advantage is that it does not need LiDAR point clouds during the inference. However, most current methods still rely…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Runzhou Tao , Wencheng Han , Zhongying Qiu , Cheng-zhong Xu , Jianbing Shen

Collecting Consistently High Quality Object Tracks with Minimal Human Involvement by Using Self-Supervised Learning to Detect Tracker Errors

We propose a hybrid framework for consistently producing high-quality object tracks by combining an automated object tracker with little human input. The key idea is to tailor a module for each dataset to intelligently decide when an object…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Samreen Anjum , Suyog Jain , Danna Gurari

Monocular Quasi-Dense 3D Object Tracking

A reliable and accurate 3D tracking framework is essential for predicting future locations of surrounding objects and planning the observer's actions in numerous applications such as autonomous driving. We propose a framework that can…

Computer Vision and Pattern Recognition · Computer Science 2021-03-15 Hou-Ning Hu , Yung-Hsu Yang , Tobias Fischer , Trevor Darrell , Fisher Yu , Min Sun

MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships

Monocular 3D object detection is an essential component in autonomous driving while challenging to solve, especially for those occluded samples which are only partially visible. Most detectors consider each 3D object as an independent…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Yongjian Chen , Lei Tai , Kai Sun , Mingyang Li

Self-supervised Video Object Segmentation

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a.k.a. dense tracking). We make the following contributions: (i) we propose to improve the existing…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Fangrui Zhu , Li Zhang , Yanwei Fu , Guodong Guo , Weidi Xie

Detecting Invisible People

Monocular object detection and tracking have improved drastically in recent years, but rely on a key assumption: that objects are visible to the camera. Many offline tracking approaches reason about occluded objects post-hoc, by linking…

Computer Vision and Pattern Recognition · Computer Science 2020-12-16 Tarasha Khurana , Achal Dave , Deva Ramanan

ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object Detection

Monocular 3D object detection (M3OD) is a significant yet inherently challenging task in autonomous driving due to absence of explicit depth cues in a single RGB image. In this paper, we strive to boost currently underperforming monocular…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Weijia Zhang , Dongnan Liu , Chao Ma , Weidong Cai

Self-supervised Sparse-to-Dense: Self-supervised Depth Completion from LiDAR and Monocular Camera

Depth completion, the technique of estimating a dense depth image from sparse depth measurements, has a variety of applications in robotics and autonomous driving. However, depth completion faces 3 main challenges: the irregularly spaced…

Computer Vision and Pattern Recognition · Computer Science 2018-07-04 Fangchang Ma , Guilherme Venturelli Cavalheiro , Sertac Karaman

Self-supervised 3D Object Detection from Monocular Pseudo-LiDAR

There have been attempts to detect 3D objects by fusion of stereo camera images and LiDAR sensor data or using LiDAR for pre-training and only monocular images for testing, but there have been less attempts to use only monocular image…

Computer Vision and Pattern Recognition · Computer Science 2022-09-21 Curie Kim , Ue-Hwan Kim , Jong-Hwan Kim

SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth

Exploring robust and efficient association methods has always been an important issue in multiple-object tracking (MOT). Although existing tracking methods have achieved impressive performance, congestion and frequent occlusions still pose…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Zelin Liu , Xinggang Wang , Cheng Wang , Wenyu Liu , Xiang Bai

TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection

Semi-supervised 3D object detection is a common strategy employed to circumvent the challenge of manually labeling large-scale autonomous driving perception datasets. Pseudo-labeling approaches to semi-supervised learning adopt a…

Computer Vision and Pattern Recognition · Computer Science 2024-09-18 Philip Jacobson , Yichen Xie , Mingyu Ding , Chenfeng Xu , Masayoshi Tomizuka , Wei Zhan , Ming C. Wu