Related papers: Omni-Dimensional Dynamic Convolution

Dynamic Convolution: Attention over Convolution Kernels

Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and the width (number of channels) of CNNs, resulting in limited…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Yinpeng Chen , Xiyang Dai , Mengchen Liu , Dongdong Chen , Lu Yuan , Zicheng Liu

Cross-Task Benchmarking of CNN Architectures

This project provides a comparative study of dynamic convolutional neural networks (CNNs) for various tasks, including image classification, segmentation, and time series analysis. Based on the ResNet-18 architecture, we compare five…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Kamal Sherawat , Vikrant Bhati

LDConv: Linear deformable convolution for improving convolutional neural networks

Neural networks based on convolutional operations have achieved remarkable results in the field of deep learning, but there are two inherent flaws in standard convolutional operations. On the one hand, the convolution operation is confined…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Xin Zhang , Yingze Song , Tingting Song , Degang Yang , Yichen Ye , Jie Zhou , Liming Zhang

Efficient Dynamic Attention 3D Convolution for Hyperspectral Image Classification

Deep neural networks face several challenges in hyperspectral image classification, including insufficient utilization of joint spatial-spectral information, gradient vanishing with increasing depth, and overfitting. To enhance feature…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Guandong Li , Mengxia Ye

OneDConv: Generalized Convolution For Transform-Invariant Representation

Convolutional Neural Networks (CNNs) have exhibited their great power in a variety of vision tasks. However, the lack of transform-invariant property limits their further applications in complicated real-world scenarios. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Tong Zhang , Haohan Weng , Ke Yi , C. L. Philip Chen

MKConv: Multidimensional Feature Representation for Point Cloud Analysis

Despite the remarkable success of deep learning, an optimal convolution operation on point clouds remains elusive owing to their irregular data structure. Existing methods mainly focus on designing an effective continuous kernel function…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Sungmin Woo , Dogyoon Lee , Sangwon Hwang , Woojin Kim , Sangyoun Lee

Adaptive Convolution for CNN-based Speech Enhancement Models

Deep learning-based speech enhancement methods have significantly improved speech quality and intelligibility. Convolutional neural networks (CNNs) have been proven to be essential components of many high-performance models. In this paper,…

Audio and Speech Processing · Electrical Eng. & Systems 2025-11-11 Dahan Wang , Xiaobin Rong , Shiruo Sun , Yuxiang Hu , Changbao Zhu , Jing Lu

Revisiting Dynamic Convolution via Matrix Decomposition

Recent research in dynamic convolution shows substantial performance boost for efficient CNNs, due to the adaptive aggregation of K static convolution kernels. It has two limitations: (a) it increases the number of convolutional weights by…

Computer Vision and Pattern Recognition · Computer Science 2021-03-17 Yunsheng Li , Yinpeng Chen , Xiyang Dai , Mengchen Liu , Dongdong Chen , Ye Yu , Lu Yuan , Zicheng Liu , Mei Chen , Nuno Vasconcelos

Multi-dimensional frequency dynamic convolution with confident mean teacher for sound event detection

Recently, convolutional neural networks (CNNs) have been widely used in sound event detection (SED). However, traditional convolution is deficient in learning time-frequency domain representation of different sound events. To address this…

Audio and Speech Processing · Electrical Eng. & Systems 2023-02-22 Shengchang Xiao , Xueshuai Zhang , Pengyuan Zhang

FMDConv: Fast Multi-Attention Dynamic Convolution via Speed-Accuracy Trade-off

Spatial convolution is fundamental in constructing deep Convolutional Neural Networks (CNNs) for visual recognition. While dynamic convolution enhances model accuracy by adaptively combining static kernels, it incurs significant…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Tianyu Zhang , Fan Wan , Haoran Duan , Kevin W. Tong , Jingjing Deng , Yang Long

Dynamic Region-Aware Convolution

We propose a new convolution called Dynamic Region-Aware Convolution (DRConv), which can automatically assign multiple filters to corresponding spatial regions where features have similar representation. In this way, DRConv outperforms…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Jin Chen , Xijun Wang , Zichao Guo , Xiangyu Zhang , Jian Sun

ODE-CNN: Omnidirectional Depth Extension Networks

Omnidirectional 360{\deg} camera proliferates rapidly for autonomous robots since it significantly enhances the perception ability by widening the field of view(FoV). However, corresponding 360{\deg} depth sensors, which are also critical…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Xinjing Cheng , Peng Wang , Yanqi Zhou , Chenye Guan , Ruigang Yang

Adaptive Growth: Real-time CNN Layer Expansion

Deep Neural Networks (DNNs) have shown unparalleled achievements in numerous applications, reflecting their proficiency in managing vast data sets. Yet, their static structure limits their adaptability in ever-changing environments. This…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Yunjie Zhu , Yunhao Chen

DO-Conv: Depthwise Over-parameterized Convolutional Layer

Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs). In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Jinming Cao , Yangyan Li , Mingchao Sun , Ying Chen , Dani Lischinski , Daniel Cohen-Or , Baoquan Chen , Changhe Tu

Convolutional Networks with Oriented 1D Kernels

In computer vision, 2D convolution is arguably the most important operation performed by a ConvNet. Unsurprisingly, it has been the focus of intense software and hardware optimization and enjoys highly efficient implementations. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-09-28 Alexandre Kirchmeyer , Jia Deng

Active Convolution: Learning the Shape of Convolution for Image Classification

In recent years, deep learning has achieved great success in many computer vision applications. Convolutional neural networks (CNNs) have lately emerged as a major approach to image classification. Most research on CNNs thus far has focused…

Computer Vision and Pattern Recognition · Computer Science 2017-03-28 Yunho Jeon , Junmo Kim

PointConv: Deep Convolutional Networks on 3D Point Clouds

Unlike images which are represented in regular dense grids, 3D point clouds are irregular and unordered, hence applying convolution on them can be difficult. In this paper, we extend the dynamic filter to a new convolution operation, named…

Computer Vision and Pattern Recognition · Computer Science 2020-11-11 Wenxuan Wu , Zhongang Qi , Li Fuxin

CondConv: Conditionally Parameterized Convolutions for Efficient Inference

Convolutional layers are one of the basic building blocks of modern deep neural networks. One fundamental assumption is that convolutional kernels should be shared for all examples in a dataset. We propose conditionally parameterized…

Computer Vision and Pattern Recognition · Computer Science 2020-09-07 Brandon Yang , Gabriel Bender , Quoc V. Le , Jiquan Ngiam

KernelWarehouse: Rethinking the Design of Dynamic Convolution

Dynamic convolution learns a linear mixture of n static kernels weighted with their input-dependent attentions, demonstrating superior performance than normal convolution. However, it increases the number of convolutional parameters by n…

Computer Vision and Pattern Recognition · Computer Science 2024-06-13 Chao Li , Anbang Yao

CKConv: Continuous Kernel Convolution For Sequential Data

Conventional neural architectures for sequential data present important limitations. Recurrent networks suffer from exploding and vanishing gradients, small effective memory horizons, and must be trained sequentially. Convolutional networks…

Machine Learning · Computer Science 2022-03-18 David W. Romero , Anna Kuzina , Erik J. Bekkers , Jakub M. Tomczak , Mark Hoogendoorn