English
Related papers

Related papers: Multimodal Object Query Initialization for 3D Obje…

200 papers

3D object detection with surrounding cameras has been a promising direction for autonomous driving. In this paper, we present SimMOD, a Simple baseline for Multi-camera Object Detection, to solve the problem. To incorporate multi-view…

Computer Vision and Pattern Recognition · Computer Science 2022-08-23 Yunpeng Zhang , Wenzhao Zheng , Zheng Zhu , Guan Huang , Jie Zhou , Jiwen Lu

Recent advancements in 3D object detection have benefited from multi-modal information from the multi-view cameras and LiDAR sensors. However, the inherent disparities between the modalities pose substantial challenges. We observe that…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Juhan Cha , Minseok Joo , Jihwan Park , Sanghyeok Lee , Injae Kim , Hyunwoo J. Kim

3D object detection from multi-view images in traffic scenarios has garnered significant attention in recent years. Many existing approaches rely on object queries that are generated from 3D reference points to localize objects. However, a…

Computer Vision and Pattern Recognition · Computer Science 2025-10-28 Ziyu Wang , Wenhao Li , Ji Wu

3D object detection is fundamental for safe and robust intelligent transportation systems. Current multi-modal 3D object detectors often rely on complex architectures and training strategies to achieve higher detection accuracy. However,…

Computer Vision and Pattern Recognition · Computer Science 2025-12-24 Xiangxuan Ren , Zhongdao Wang , Pin Tang , Guoqing Wang , Jilai Zheng , Chao Ma

3D object detection is a significant task for autonomous driving. Recently with the progress of vision transformers, the 2D object detection problem is being treated with the set-to-set loss. Inspired by these approaches on 2D object…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Gopi Krishna Erabati , Helder Araujo

Multi-label image classification is about predicting a set of class labels that can be considered as orderless sequential data. Transformers process the sequential data as a whole, therefore they are inherently good at set prediction. The…

Computer Vision and Pattern Recognition · Computer Science 2022-05-17 Vacit Oguz Yazici , Joost van de Weijer , Longlong Yu

Despite radar's popularity in the automotive industry, for fusion-based 3D object detection, most existing works focus on LiDAR and camera fusion. In this paper, we propose TransCAR, a Transformer-based Camera-And-Radar fusion solution for…

Computer Vision and Pattern Recognition · Computer Science 2023-05-02 Su Pang , Daniel Morris , Hayder Radha

LiDAR and camera are two important sensors for 3D object detection in autonomous driving. Despite the increasing popularity of sensor fusion in this field, the robustness against inferior image conditions, e.g., bad illumination and sensor…

Computer Vision and Pattern Recognition · Computer Science 2022-03-23 Xuyang Bai , Zeyu Hu , Xinge Zhu , Qingqiu Huang , Yilun Chen , Hongbo Fu , Chiew-Lan Tai

Query-based transformer has shown great potential in constructing long-range attention in many image-domain tasks, but has rarely been considered in LiDAR-based 3D object detection due to the overwhelming size of the point cloud data. In…

Computer Vision and Pattern Recognition · Computer Science 2022-09-14 Zixiang Zhou , Xiangchen Zhao , Yu Wang , Panqu Wang , Hassan Foroosh

Multi-camera 3D object detection aims to detect and localize objects in 3D space using multiple cameras, which has attracted more attention due to its cost-effectiveness trade-off. However, these methods often struggle with the lack of…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Kun Guo , Qiang Ling

We introduce a framework for multi-camera 3D object detection. In contrast to existing works, which estimate 3D bounding boxes directly from monocular images or use depth prediction networks to generate input for 3D object detection from 2D…

Computer Vision and Pattern Recognition · Computer Science 2021-10-14 Yue Wang , Vitor Guizilini , Tianyuan Zhang , Yilun Wang , Hang Zhao , Justin Solomon

Detecting dynamic objects and predicting static road information such as drivable areas and ground heights are crucial for safe autonomous driving. Previous works studied each perception task separately, and lacked a collective quantitative…

Computer Vision and Pattern Recognition · Computer Science 2021-03-09 Di Feng , Yiyang Zhou , Chenfeng Xu , Masayoshi Tomizuka , Wei Zhan

Recent query-based 3D object detection methods using camera and LiDAR inputs have shown strong performance, but existing query initialization strategies,such as random sampling or BEV heatmap-based sampling, often result in inefficient…

Computer Vision and Pattern Recognition · Computer Science 2026-02-10 Janghyun Baek , Mincheol Chang , Seokha Moon , Seung Joon Lee , Jinkyu Kim

Three-dimensional Object Detection from multi-view cameras and LiDAR is a crucial component for autonomous driving and smart transportation. However, in the process of basic feature extraction, perspective transformation, and feature…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Zhongyu Xia , Hansong Yang , Yongtao Wang

The estimation of uncertainty in robotic vision, such as 3D object detection, is an essential component in developing safe autonomous systems aware of their own performance. However, the deployment of current uncertainty estimation methods…

Computer Vision and Pattern Recognition · Computer Science 2022-07-27 Matthew Pitropov , Chengjie Huang , Vahdat Abdelzad , Krzysztof Czarnecki , Steven Waslander

3D object detection is crucial for autonomous driving, leveraging both LiDAR point clouds for precise depth information and camera images for rich semantic information. Therefore, the multi-modal methods that combine both modalities offer…

Computer Vision and Pattern Recognition · Computer Science 2025-04-07 Kaidong Li , Tianxiao Zhang , Kuan-Chuan Peng , Guanghui Wang

Incremental object detection (IOD) aims to sequentially learn new classes, while maintaining the capability to locate and identify old ones. As the training data arrives with annotations only with new classes, IOD suffers from catastrophic…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Jichuan Zhang , Wei Li , Shuang Cheng , Ya-Li Li , Shengjin Wang

Service mobile robots are often required to avoid dynamic objects while performing their tasks, but they usually have only limited computational resources. To further advance the practical application of service robots in complex dynamic…

Robotics · Computer Science 2026-02-25 Yushen He , Lei Zhao , Tianchen Deng , Zipeng Fang , Weidong Chen

For 3D object detection, both camera and lidar have been demonstrated to be useful sensory devices for providing complementary information about the same scenery with data representations in different modalities, e.g., 2D RGB image vs 3D…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Xinhao Xiang , Jiawei Zhang

This work aims to address the challenges in autonomous driving by focusing on the 3D perception of the environment using roadside LiDARs. We design a 3D object detection model that can detect traffic participants in roadside LiDARs in…

Computer Vision and Pattern Recognition · Computer Science 2022-07-13 Walter Zimmer , Jialong Wu , Xingcheng Zhou , Alois C. Knoll
‹ Prev 1 2 3 10 Next ›