Related papers: Anyview: Generalizable Indoor 3D Object Detection …

Towards Generalizable Multi-Camera 3D Object Detection via Perspective Debiasing

Detecting objects in 3D space using multiple cameras, known as Multi-Camera 3D Object Detection (MC3D-Det), has gained prominence with the advent of bird's-eye view (BEV) approaches. However, these methods often struggle when faced with…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Hao Lu , Yunpeng Zhang , Qing Lian , Dalong Du , Yingcong Chen

PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points

Detecting 3D objects from a single RGB image is intrinsically ambiguous, thus requiring appropriate prior knowledge and intermediate representations as constraints to reduce the uncertainties and improve the consistencies between the 2D…

Computer Vision and Pattern Recognition · Computer Science 2019-12-18 Siyuan Huang , Yixin Chen , Tao Yuan , Siyuan Qi , Yixin Zhu , Song-Chun Zhu

3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection

This paper proposes 3DGeoDet, a novel geometry-aware 3D object detection approach that effectively handles single- and multi-view RGB images in indoor and outdoor environments, showcasing its general-purpose applicability. The key challenge…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Yi Zhang , Yi Wang , Yawen Cui , Lap-Pui Chau

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

In this paper, we introduce the task of multi-view RGB-based 3D object detection as an end-to-end optimization problem. To address this problem, we propose ImVoxelNet, a novel fully convolutional method of 3D object detection based on…

Computer Vision and Pattern Recognition · Computer Science 2021-10-18 Danila Rukhovich , Anna Vorontsova , Anton Konushin

From Points to Multi-Object 3D Reconstruction

We propose a method to detect and reconstruct multiple 3D objects from a single RGB image. The key idea is to optimize for detection, alignment and shape jointly over all objects in the RGB image, while focusing on realistic and physically…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Francis Engelmann , Konstantinos Rematas , Bastian Leibe , Vittorio Ferrari

Accurate Monocular Object Detection via Color-Embedded 3D Reconstruction for Autonomous Driving

In this paper, we propose a monocular 3D object detection framework in the domain of autonomous driving. Unlike previous image-based methods which focus on RGB feature extracted from 2D images, our method solves this problem in the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Xinzhu Ma , Zhihui Wang , Haojie Li , Pengbo Zhang , Xin Fan , Wanli Ouyang

Monocular 3D Object Detection via Geometric Reasoning on Keypoints

Monocular 3D object detection is well-known to be a challenging vision task due to the loss of depth information; attempts to recover depth using separate image-only approaches lead to unstable and noisy depth estimates, harming 3D…

Computer Vision and Pattern Recognition · Computer Science 2019-05-15 Ivan Barabanau , Alexey Artemov , Evgeny Burnaev , Vyacheslav Murashkin

UniGeo: A Unified 3D Indoor Object Detection Framework Integrating Geometry-Aware Learning and Dynamic Channel Gating

The growing adoption of robotics and augmented reality in real-world applications has driven considerable research interest in 3D object detection based on point clouds. While previous methods address unified training across multiple…

Computer Vision and Pattern Recognition · Computer Science 2026-02-02 Xing Yi , Jinyang Huang , Feng-Qi Cui , Anyang Tong , Ruimin Wang , Liu Liu , Dan Guo

3D Object Recognition By Corresponding and Quantizing Neural 3D Scene Representations

We propose a system that learns to detect objects and infer their 3D poses in RGB-D images. Many existing systems can identify objects and infer 3D poses, but they heavily rely on human labels and 3D annotations. The challenge here is to…

Computer Vision and Pattern Recognition · Computer Science 2020-11-02 Mihir Prabhudesai , Shamit Lal , Hsiao-Yu Fish Tung , Adam W. Harley , Shubhankar Potdar , Katerina Fragkiadaki

General Geometry-aware Weakly Supervised 3D Object Detection

3D object detection is an indispensable component for scene understanding. However, the annotation of large-scale 3D datasets requires significant human effort. To tackle this problem, many methods adopt weakly supervised 3D object…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Guowen Zhang , Junsong Fan , Liyi Chen , Zhaoxiang Zhang , Zhen Lei , Lei Zhang

VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation

Input aggregation is a simple technique used by state-of-the-art LiDAR 3D object detectors to improve detection. However, increasing aggregation is known to have diminishing returns and even performance degradation, due to objects…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Chengjie Huang , Vahdat Abdelzad , Sean Sedwards , Krzysztof Czarnecki

View N-gram Network for 3D Object Retrieval

How to aggregate multi-view representations of a 3D object into an informative and discriminative one remains a key challenge for multi-view 3D object retrieval. Existing methods either use view-wise pooling strategies which neglect the…

Computer Vision and Pattern Recognition · Computer Science 2019-08-16 Xinwei He , Tengteng Huang , Song Bai , Xiang Bai

MonoGRNet: A General Framework for Monocular 3D Object Detection

Detecting and localizing objects in the real 3D space, which plays a crucial role in scene understanding, is particularly challenging given only a monocular image due to the geometric information loss during imagery projection. We propose…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Zengyi Qin , Jinglu Wang , Yan Lu

Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction

This work presents SGCDet, a novel multi-view indoor 3D object detection framework based on adaptive 3D volume construction. Unlike previous approaches that restrict the receptive field of voxels to fixed locations on images, we introduce a…

Computer Vision and Pattern Recognition · Computer Science 2025-07-25 Runmin Zhang , Zhu Yu , Si-Yuan Cao , Lingyu Zhu , Guangyi Zhang , Xiaokai Bai , Hui-Liang Shen

Complete 3D Scene Parsing from an RGBD Image

One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors. In this paper, we aim to interpret indoor scenes from one RGBD image. Our representation encodes the layout of orthogonal walls and…

Computer Vision and Pattern Recognition · Computer Science 2018-11-15 Chuhang Zou , Ruiqi Guo , Zhizhong Li , Derek Hoiem

UniDet3D: Multi-dataset Indoor 3D Object Detection

Growing customer demand for smart solutions in robotics and augmented reality has attracted considerable attention to 3D object detection from point clouds. Yet, existing indoor datasets taken individually are too small and insufficiently…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Maksim Kolodiazhnyi , Anna Vorontsova , Matvey Skripkin , Danila Rukhovich , Anton Konushin

Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for Autonomous Driving

Recent years have witnessed huge successes in 3D object detection to recognize common objects for autonomous driving (e.g., vehicles and pedestrians). However, most methods rely heavily on a large amount of well-labeled training data. This…

Computer Vision and Pattern Recognition · Computer Science 2023-02-09 Jiawei Liu , Xingping Dong , Sanyuan Zhao , Jianbing Shen

MonoGlass3D: Monocular 3D Glass Detection with Plane Regression and Adaptive Feature Fusion

Detecting and localizing glass in 3D environments poses significant challenges for visual perception systems, as the optical properties of glass often hinder conventional sensors from accurately distinguishing glass surfaces. The lack of…

Robotics · Computer Science 2025-09-09 Kai Zhang , Guoyang Zhao , Jianxing Shi , Bonan Liu , Weiqing Qi , Jun Ma

AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection

Existing deep learning-based approaches for monocular 3D object detection in autonomous driving often model the object as a rotated 3D cuboid while the object's geometric shape has been ignored. In this work, we propose an approach for…

Computer Vision and Pattern Recognition · Computer Science 2021-08-26 Zongdai Liu , Dingfu Zhou , Feixiang Lu , Jin Fang , Liangjun Zhang

SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection

Current 3D object detection methods for indoor scenes mainly follow the voting-and-grouping strategy to generate proposals. However, most methods utilize instance-agnostic groupings, such as ball query, leading to inconsistent semantic…

Computer Vision and Pattern Recognition · Computer Science 2023-12-22 Yun Zhu , Le Hui , Yaqi Shen , Jin Xie