English
Related papers

Related papers: Virtual Sparse Convolution for Multimodal 3D Objec…

200 papers

Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive safely. Given the limited hardware resources, existing 3D perception models are not able to recognize small instances (e.g., pedestrians, cyclists)…

Computer Vision and Pattern Recognition · Computer Science 2020-08-14 Haotian Tang , Zhijian Liu , Shengyu Zhao , Yujun Lin , Ji Lin , Hanrui Wang , Song Han

3D object detectors usually rely on hand-crafted proxies, e.g., anchors or centers, and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be densified and processed by dense prediction heads, which inevitably…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Yukang Chen , Jianhui Liu , Xiangyu Zhang , Xiaojuan Qi , Jiaya Jia

LiDAR-based 3D object detectors often struggle to detect far-field objects due to the sparsity of point clouds at long ranges, which limits the availability of reliable geometric cues. To address this, prior approaches augment LiDAR data…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Veerain Sood , Bnalin , Gaurav Pandey

Camera-based 3D object detection in BEV (Bird's Eye View) space has drawn great attention over the past few years. Dense detectors typically follow a two-stage pipeline by first constructing a dense BEV feature and then performing object…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Haisong Liu , Yao Teng , Tao Lu , Haiguang Wang , Limin Wang

It has been well recognized that fusing the complementary information from depth-aware LiDAR point clouds and semantic-rich stereo images would benefit 3D object detection. Nevertheless, it is not trivial to explore the inherently unnatural…

Computer Vision and Pattern Recognition · Computer Science 2021-12-02 Hanqi Zhu , Jiajun Deng , Yu Zhang , Jianmin Ji , Qiuyu Mao , Houqiang Li , Yanyong Zhang

Detection and tracking of moving objects is an essential component in environmental perception for autonomous driving. In the flourishing field of multi-view 3D camera-based detectors, different transformer-based pipelines are designed to…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Yining Shi , Jingyan Shen , Yifan Sun , Yunlong Wang , Jiaxin Li , Shiqi Sun , Kun Jiang , Diange Yang

3D scenes are dominated by a large number of background points, which is redundant for the detection task that mainly needs to focus on foreground objects. In this paper, we analyze major components of existing sparse 3D CNNs and find that…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Jianhui Liu , Yukang Chen , Xiaoqing Ye , Zhuotao Tian , Xiao Tan , Xiaojuan Qi

The autonomous car must recognize the driving environment quickly for safe driving. As the Light Detection And Range (LiDAR) sensor is widely used in the autonomous car, fast semantic segmentation of LiDAR point cloud, which is the…

Computer Vision and Pattern Recognition · Computer Science 2022-02-22 Jaehyun Park , Chansoo Kim , Kichun Jo

We present a new convolutional neural network, called Multi Voxel-Point Neurons Convolution (MVPConv), for fast and accurate 3D deep learning. The previous works adopt either individual point-based features or local-neighboring voxel-based…

Computer Vision and Pattern Recognition · Computer Science 2021-05-03 Wei Zhou , Xin Cao , Xiaodan Zhang , Xingxing Hao , Dekui Wang , Ying He

LiDAR-based fully sparse architecture has garnered increasing attention. FSDv1 stands out as a representative work, achieving impressive efficacy and efficiency, albeit with intricate structures and handcrafted designs. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Lue Fan , Feng Wang , Naiyan Wang , Zhaoxiang Zhang

Pillar-based 3D object detection has gained traction in self-driving technology due to its speed and accuracy facilitated by the artificial densification of pillars for GPU-friendly processing. However, dense pillar processing fundamentally…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Seongmin Park , Minjae Lee , Junwon Choi , Jungwook Choi

Multi-view 3D object detection is a fundamental task in autonomous driving perception, where achieving a balance between detection accuracy and computational efficiency remains crucial. Sparse query-based 3D detectors efficiently aggregate…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Di Wu , Feng Yang , Wenhui Zhao , Jinwen Yu , Pan Liao , Benlian Xu , Dingwen Zhang

LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, LiDAR's diverging point pattern with increasing distance results in a non-uniform sampled point cloud ill-suited to discretized volumetric…

Computer Vision and Pattern Recognition · Computer Science 2022-03-23 Jordan S. K. Hu , Tianshu Kuai , Steven L. Waslander

The safe operation of automated vehicles depends on their ability to perceive the environment comprehensively. However, occlusion, sensor range, and environmental factors limit their perception capabilities. To overcome these limitations,…

Computer Vision and Pattern Recognition · Computer Science 2024-08-13 Sven Teufel , Jörg Gamerdinger , Georg Volk , Oliver Bringmann

Most previous 3D object detection methods that leverage the multi-modality of LiDAR and cameras utilize the Bird's Eye View (BEV) space for intermediate feature representation. However, this space uses a low x, y-resolution and sacrifices…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Hyeongseok Son , Jia He , Seung-In Park , Ying Min , Yunhao Zhang , ByungIn Yoo

3D object detection is receiving increasing attention from both industry and academia thanks to its wide applications in various fields. In this paper, we propose Point-Voxel Region-based Convolution Neural Networks (PV-RCNNs) for 3D object…

Computer Vision and Pattern Recognition · Computer Science 2022-11-09 Shaoshuai Shi , Li Jiang , Jiajun Deng , Zhe Wang , Chaoxu Guo , Jianping Shi , Xiaogang Wang , Hongsheng Li

3D object detection using point cloud (PC) data is essential for perception pipelines of autonomous driving, where efficient encoding is key to meeting stringent resource and latency requirements. PointPillars, a widely adopted bird's-eye…

Hardware Architecture · Computer Science 2024-01-17 Minjae Lee , Seongmin Park , Hyungmin Kim , Minyong Yoon , Janghwan Lee , Jun Won Choi , Nam Sung Kim , Mingu Kang , Jungwook Choi

Reconstructing 3D objects from extremely sparse views is a long-standing and challenging problem. While recent techniques employ image diffusion models for generating plausible images at novel viewpoints or for distilling pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Zi-Xin Zou , Weihao Cheng , Yan-Pei Cao , Shi-Sheng Huang , Ying Shan , Song-Hai Zhang

We present Voxel Transformer (VoTr), a novel and effective voxel-based Transformer backbone for 3D object detection from point clouds. Conventional 3D convolutional backbones in voxel-based 3D detectors cannot efficiently capture large…

Computer Vision and Pattern Recognition · Computer Science 2021-09-14 Jiageng Mao , Yujing Xue , Minzhe Niu , Haoyue Bai , Jiashi Feng , Xiaodan Liang , Hang Xu , Chunjing Xu

In the perception task of autonomous driving, multi-modal methods have become a trend due to the complementary characteristics of LiDAR point clouds and image data. However, the performance of multi-modal methods is usually limited by the…

Computer Vision and Pattern Recognition · Computer Science 2023-09-19 Binglu Ren , Jianqin Yin
‹ Prev 1 2 3 10 Next ›