Related papers: Virtual Sparse Convolution for Multimodal 3D Objec…

Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive safely. Given the limited hardware resources, existing 3D perception models are not able to recognize small instances (e.g., pedestrians, cyclists)…

Computer Vision and Pattern Recognition · Computer Science 2020-08-14 Haotian Tang , Zhijian Liu , Shengyu Zhao , Yujun Lin , Ji Lin , Hanrui Wang , Song Han

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

3D object detectors usually rely on hand-crafted proxies, e.g., anchors or centers, and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be densified and processed by dense prediction heads, which inevitably…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Yukang Chen , Jianhui Liu , Xiangyu Zhang , Xiaojuan Qi , Jiaya Jia

Point Virtual Transformer

LiDAR-based 3D object detectors often struggle to detect far-field objects due to the sparsity of point clouds at long ranges, which limits the availability of reliable geometric cues. To address this, prior approaches augment LiDAR data…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Veerain Sood , Bnalin , Gaurav Pandey

SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Camera-based 3D object detection in BEV (Bird's Eye View) space has drawn great attention over the past few years. Dense detectors typically follow a two-stage pipeline by first constructing a dense BEV feature and then performing object…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Haisong Liu , Yao Teng , Tao Lu , Haiguang Wang , Limin Wang

VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and Stereo Data Fusion

It has been well recognized that fusing the complementary information from depth-aware LiDAR point clouds and semantic-rich stereo images would benefit 3D object detection. Nevertheless, it is not trivial to explore the inherently unnatural…

Computer Vision and Pattern Recognition · Computer Science 2021-12-02 Hanqi Zhu , Jiajun Deng , Yu Zhang , Jianmin Ji , Qiuyu Mao , Houqiang Li , Yanyong Zhang

SRCN3D: Sparse R-CNN 3D for Compact Convolutional Multi-View 3D Object Detection and Tracking

Detection and tracking of moving objects is an essential component in environmental perception for autonomous driving. In the flourishing field of multi-view 3D camera-based detectors, different transformer-based pipelines are designed to…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Yining Shi , Jingyan Shen , Yifan Sun , Yunlong Wang , Jiaxin Li , Shiqi Sun , Kun Jiang , Diange Yang

Spatial Pruned Sparse Convolution for Efficient 3D Object Detection

3D scenes are dominated by a large number of background points, which is redundant for the detection task that mainly needs to focus on foreground objects. In this paper, we analyze major components of existing sparse 3D CNNs and find that…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Jianhui Liu , Yukang Chen , Xiaoqing Ye , Zhuotao Tian , Xiao Tan , Xiaojuan Qi

PCSCNet: Fast 3D Semantic Segmentation of LiDAR Point Cloud for Autonomous Car using Point Convolution and Sparse Convolution Network

The autonomous car must recognize the driving environment quickly for safe driving. As the Light Detection And Range (LiDAR) sensor is widely used in the autonomous car, fast semantic segmentation of LiDAR point cloud, which is the…

Computer Vision and Pattern Recognition · Computer Science 2022-02-22 Jaehyun Park , Chansoo Kim , Kichun Jo

Multi Voxel-Point Neurons Convolution (MVPConv) for Fast and Accurate 3D Deep Learning

We present a new convolutional neural network, called Multi Voxel-Point Neurons Convolution (MVPConv), for fast and accurate 3D deep learning. The previous works adopt either individual point-based features or local-neighboring voxel-based…

Computer Vision and Pattern Recognition · Computer Science 2021-05-03 Wei Zhou , Xin Cao , Xiaodan Zhang , Xingxing Hao , Dekui Wang , Ying He

FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels

LiDAR-based fully sparse architecture has garnered increasing attention. FSDv1 stands out as a representative work, achieving impressive efficacy and efficiency, albeit with intricate structures and handcrafted designs. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Lue Fan , Feng Wang , Naiyan Wang , Zhaoxiang Zhang

Selectively Dilated Convolution for Accuracy-Preserving Sparse Pillar-based Embedded 3D Object Detection

Pillar-based 3D object detection has gained traction in self-driving technology due to its speed and accuracy facilitated by the artificial densification of pillars for GPU-friendly processing. However, dense pillar processing fundamentally…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Seongmin Park , Minjae Lee , Junwon Choi , Jungwook Choi

StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection

Multi-view 3D object detection is a fundamental task in autonomous driving perception, where achieving a balance between detection accuracy and computational efficiency remains crucial. Sparse query-based 3D detectors efficiently aggregate…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Di Wu , Feng Yang , Wenhui Zhao , Jinwen Yu , Pan Liao , Benlian Xu , Dingwen Zhang

Point Density-Aware Voxels for LiDAR 3D Object Detection

LiDAR has become one of the primary 3D object detection sensors in autonomous driving. However, LiDAR's diverging point pattern with increasing distance results in a non-uniform sampled point cloud ill-suited to discretized volumetric…

Computer Vision and Pattern Recognition · Computer Science 2022-03-23 Jordan S. K. Hu , Tianshu Kuai , Steven L. Waslander

MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception

The safe operation of automated vehicles depends on their ability to perceive the environment comprehensively. However, occlusion, sensor range, and environmental factors limit their perception capabilities. To overcome these limitations,…

Computer Vision and Pattern Recognition · Computer Science 2024-08-13 Sven Teufel , Jörg Gamerdinger , Georg Volk , Oliver Bringmann

SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection

Most previous 3D object detection methods that leverage the multi-modality of LiDAR and cameras utilize the Bird's Eye View (BEV) space for intermediate feature representation. However, this space uses a low x, y-resolution and sacrifices…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Hyeongseok Son , Jia He , Seung-In Park , Ying Min , Yunhao Zhang , ByungIn Yoo

PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection

3D object detection is receiving increasing attention from both industry and academia thanks to its wide applications in various fields. In this paper, we propose Point-Voxel Region-based Convolution Neural Networks (PV-RCNNs) for 3D object…

Computer Vision and Pattern Recognition · Computer Science 2022-11-09 Shaoshuai Shi , Li Jiang , Jiajun Deng , Zhe Wang , Chaoxu Guo , Jianping Shi , Xiaogang Wang , Hongsheng Li

SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous Driving

3D object detection using point cloud (PC) data is essential for perception pipelines of autonomous driving, where efficient encoding is key to meeting stringent resource and latency requirements. PointPillars, a widely adopted bird's-eye…

Hardware Architecture · Computer Science 2024-01-17 Minjae Lee , Seongmin Park , Hyungmin Kim , Minyong Yoon , Janghwan Lee , Jun Won Choi , Nam Sung Kim , Mingu Kang , Jungwook Choi

Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views

Reconstructing 3D objects from extremely sparse views is a long-standing and challenging problem. While recent techniques employ image diffusion models for generating plausible images at novel viewpoints or for distilling pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Zi-Xin Zou , Weihao Cheng , Yan-Pei Cao , Shi-Sheng Huang , Ying Shan , Song-Hai Zhang

Voxel Transformer for 3D Object Detection

We present Voxel Transformer (VoTr), a novel and effective voxel-based Transformer backbone for 3D object detection from point clouds. Conventional 3D convolutional backbones in voxel-based 3D detectors cannot efficiently capture large…

Computer Vision and Pattern Recognition · Computer Science 2021-09-14 Jiageng Mao , Yujing Xue , Minzhe Niu , Haoyue Bai , Jiashi Feng , Xiaodan Liang , Hang Xu , Chunjing Xu

SDVRF: Sparse-to-Dense Voxel Region Fusion for Multi-modal 3D Object Detection

In the perception task of autonomous driving, multi-modal methods have become a trend due to the complementary characteristics of LiDAR point clouds and image data. However, the performance of multi-modal methods is usually limited by the…

Computer Vision and Pattern Recognition · Computer Science 2023-09-19 Binglu Ren , Jianqin Yin