Related papers: LinK: Linear Kernel for LiDAR-based 3D Perception

LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels

Autonomous systems need to process large-scale, sparse, and irregular point clouds with limited compute resources. Consequently, it is essential to develop LiDAR perception methods that are both efficient and effective. Although naively…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Tuo Feng , Wenguan Wang , Fan Ma , Yi Yang

LinK3D: Linear Keypoints Representation for 3D LiDAR Point Cloud

Feature extraction and matching are the basic parts of many robotic vision tasks, such as 2D or 3D object detection, recognition, and registration. As is known, 2D feature extraction and matching have already achieved great success.…

Computer Vision and Pattern Recognition · Computer Science 2024-01-11 Yunge Cui , Yinlong Zhang , Jiahua Dong , Haibo Sun , Xieyuanli Chen , Feng Zhu

LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs

Recent advance in 2D CNNs has revealed that large kernels are important. However, when directly applying large convolutional kernels in 3D CNNs, severe difficulties are met, where those successful module designs in 2D become surprisingly…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Yukang Chen , Jianhui Liu , Xiangyu Zhang , Xiaojuan Qi , Jiaya Jia

Spatially-Adaptive Gradient Re-parameterization for 3D Large Kernel Optimization

Large kernel convolutions offer a scalable alternative to vision transformers for high-resolution 3D volumetric analysis, yet naively increasing kernel size often leads to optimization instability. Motivated by the spatial bias inherent in…

Computer Vision and Pattern Recognition · Computer Science 2026-02-02 Ho Hin Lee , Quan Liu , Shunxing Bao , Yuankai Huo , Bennett A. Landman

PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution

Recently, some large kernel convnets strike back with appealing performance and efficiency. However, given the square complexity of convolution, scaling up kernels can bring about an enormous amount of parameters and the proliferated…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Honghao Chen , Xiangxiang Chu , Yongjian Ren , Xin Zhao , Kaiqi Huang

MGTANet: Encoding Sequential LiDAR Points Using Long Short-Term Motion-Guided Temporal Attention for 3D Object Detection

Most scanning LiDAR sensors generate a sequence of point clouds in real-time. While conventional 3D object detectors use a set of unordered LiDAR points acquired over a fixed time interval, recent studies have revealed that substantial…

Computer Vision and Pattern Recognition · Computer Science 2022-12-22 Junho Koh , Junhyung Lee , Youngwoo Lee , Jaekyum Kim , Jun Won Choi

*: Improving the 3D detector by introducing Voxel2Pillar feature encoding and extracting multi-scale features

The multi-line LiDAR is widely used in autonomous vehicles, so point cloud-based 3D detectors are essential for autonomous driving. Extracting rich multi-scale features is crucial for point cloud-based 3D detectors in autonomous driving due…

Computer Vision and Pattern Recognition · Computer Science 2024-11-14 Xusheng Li , Chengliang Wang , Shumao Wang , Zhuo Zeng , Ji Liu

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few large convolutional kernels instead of a stack of small…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Xiaohan Ding , Xiangyu Zhang , Yizhuang Zhou , Jungong Han , Guiguang Ding , Jian Sun

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR-based Perception

State-of-the-art methods for driving-scene LiDAR-based perception (including point cloud semantic segmentation, panoptic segmentation and 3D detection, \etc) often project the point clouds to 2D space and then process them via 2D…

Computer Vision and Pattern Recognition · Computer Science 2021-09-14 Xinge Zhu , Hui Zhou , Tai Wang , Fangzhou Hong , Wei Li , Yuexin Ma , Hongsheng Li , Ruigang Yang , Dahua Lin

A Compact Kernel Approximation for 3D Action Recognition

3D action recognition was shown to benefit from a covariance representation of the input data (joint 3D positions). A kernel machine feed with such feature is an effective paradigm for 3D action recognition, yielding state-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2017-10-05 Jacopo Cavazza , Pietro Morerio , Vittorio Murino

To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels

3D object detection is vital for many robotics applications. For tasks where a 2D perspective range image exists, we propose to learn a 3D representation directly from this range image view. To this end, we designed a 2D convolutional…

Computer Vision and Pattern Recognition · Computer Science 2021-06-28 Yuning Chai , Pei Sun , Jiquan Ngiam , Weiyue Wang , Benjamin Caine , Vijay Vasudevan , Xiao Zhang , Dragomir Anguelov

VIN: Voxel-based Implicit Network for Joint 3D Object Detection and Segmentation for Lidars

A unified neural network structure is presented for joint 3D object detection and point cloud segmentation in this paper. We leverage rich supervision from both detection and segmentation labels rather than using just one of them. In…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Yuanxin Zhong , Minghan Zhu , Huei Peng

Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN

Visual Attention Networks (VAN) with Large Kernel Attention (LKA) modules have been shown to provide remarkable performance, that surpasses Vision Transformers (ViTs), on a range of vision-based tasks. However, the depth-wise convolutional…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Kin Wai Lau , Lai-Man Po , Yasar Abbas Ur Rehman

Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation

State-of-the-art methods for large-scale driving-scene LiDAR segmentation often project the point clouds to 2D space and then process them via 2D convolution. Although this corporation shows the competitiveness in the point cloud, it…

Computer Vision and Pattern Recognition · Computer Science 2020-11-20 Xinge Zhu , Hui Zhou , Tai Wang , Fangzhou Hong , Yuexin Ma , Wei Li , Hongsheng Li , Dahua Lin

LidarMultiNet: Towards a Unified Multi-Task Network for LiDAR Perception

LiDAR-based 3D object detection, semantic segmentation, and panoptic segmentation are usually implemented in specialized networks with distinctive architectures that are difficult to adapt to each other. This paper presents LidarMultiNet, a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Dongqiangzi Ye , Zixiang Zhou , Weijia Chen , Yufei Xie , Yu Wang , Panqu Wang , Hassan Foroosh

Multi-scale Location-aware Kernel Representation for Object Detection

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of object proposals for final classification and regression. Recent classification methods…

Computer Vision and Pattern Recognition · Computer Science 2018-04-03 Hao Wang , Qilong Wang , Mingqi Gao , Peihua Li , Wangmeng Zuo

Metric and Kernel Learning using a Linear Transformation

Metric and kernel learning are important in several machine learning applications. However, most existing metric learning algorithms are limited to learning metrics over low-dimensional data, while existing kernel learning algorithms are…

Machine Learning · Computer Science 2009-11-02 Prateek Jain , Brian Kulis , Jason V. Davis , Inderjit S. Dhillon

Learning Depth-Guided Convolutions for Monocular 3D Object Detection

3D object detection from a single image without LiDAR is a challenging task due to the lack of accurate depth information. Conventional 2D convolutions are unsuitable for this task because they fail to capture local object and its scale…

Computer Vision and Pattern Recognition · Computer Science 2019-12-16 Mingyu Ding , Yuqi Huo , Hongwei Yi , Zhe Wang , Jianping Shi , Zhiwu Lu , Ping Luo

Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation

Medical image segmentation has seen significant improvements with transformer models, which excel in grasping far-reaching contexts and global contextual information. However, the increasing computational demands of these models,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-04 Reza Azad , Leon Niggemeier , Michael Huttemann , Amirhossein Kazerouni , Ehsan Khodapanah Aghdam , Yury Velichko , Ulas Bagci , Dorit Merhof

Feature Visualization in 3D Convolutional Neural Networks

Understanding the computations of convolutional neural networks requires effective visualization of their kernels. While maximal activation methods have proven successful in highlighting the preferred features of 2D convolutional kernels,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Chunpeng Li , Ya-tang Li