Related papers: FlexConv: Continuous Kernel Convolutions with Diff…

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few large convolutional kernels instead of a stack of small…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Xiaohan Ding , Xiangyu Zhang , Yizhuang Zhou , Jungong Han , Guiguang Ding , Jian Sun

CKConv: Continuous Kernel Convolution For Sequential Data

Conventional neural architectures for sequential data present important limitations. Recurrent networks suffer from exploding and vanishing gradients, small effective memory horizons, and must be trained sequentially. Convolutional networks…

Machine Learning · Computer Science 2022-03-18 David W. Romero , Anna Kuzina , Erik J. Bekkers , Jakub M. Tomczak , Mark Hoogendoorn

CondConv: Conditionally Parameterized Convolutions for Efficient Inference

Convolutional layers are one of the basic building blocks of modern deep neural networks. One fundamental assumption is that convolutional kernels should be shared for all examples in a dataset. We propose conditionally parameterized…

Computer Vision and Pattern Recognition · Computer Science 2020-09-07 Brandon Yang , Gabriel Bender , Quoc V. Le , Jiquan Ngiam

Hyper-Convolutions via Implicit Kernels for Medical Imaging

The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares…

Image and Video Processing · Electrical Eng. & Systems 2022-02-08 Tianyu Ma , Alan Q. Wang , Adrian V. Dalca , Mert R. Sabuncu

Fast Training of Convolutional Neural Networks via Kernel Rescaling

Training deep Convolutional Neural Networks (CNN) is a time consuming task that may take weeks to complete. In this article we propose a novel, theoretically founded method for reducing CNN training time without incurring any loss in…

Computer Vision and Pattern Recognition · Computer Science 2016-10-13 Pedro Porto Buarque de Gusmão , Gianluca Francini , Skjalg Lepsøy , Enrico Magli

Hyper-Convolution Networks for Biomedical Image Segmentation

The convolution operation is a central building block of neural network architectures widely used in computer vision. The size of the convolution kernels determines both the expressiveness of convolutional neural networks (CNN), as well as…

Image and Video Processing · Electrical Eng. & Systems 2022-10-10 Tianyu Ma , Adrian V. Dalca , Mert R. Sabuncu

MixConv: Mixed Depthwise Convolutional Kernels

Depthwise convolution is becoming increasingly popular in modern efficient ConvNets, but its kernel size is often overlooked. In this paper, we systematically study the impact of different kernel sizes, and observe that combining the…

Computer Vision and Pattern Recognition · Computer Science 2019-12-03 Mingxing Tan , Quoc V. Le

$ShiftwiseConv:$ Small Convolutional Kernel with Large Kernel Effect

Large kernels make standard convolutional neural networks (CNNs) great again over transformer architectures in various vision tasks. Nonetheless, recent studies meticulously designed around increasing kernel size have shown diminishing…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Dachong Li , Li Li , Zhuangzhuang Chen , Jianqiang Li

LDConv: Linear deformable convolution for improving convolutional neural networks

Neural networks based on convolutional operations have achieved remarkable results in the field of deep learning, but there are two inherent flaws in standard convolutional operations. On the one hand, the convolution operation is confined…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Xin Zhang , Yingze Song , Tingting Song , Degang Yang , Yichen Ye , Jie Zhou , Liming Zhang

HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs

We present a novel deep learning architecture in which the convolution operation leverages heterogeneous kernels. The proposed HetConv (Heterogeneous Kernel-Based Convolution) reduces the computation (FLOPs) and the number of parameters as…

Computer Vision and Pattern Recognition · Computer Science 2019-03-26 Pravendra Singh , Vinay Kumar Verma , Piyush Rai , Vinay P. Namboodiri

Dynamic Convolution: Attention over Convolution Kernels

Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and the width (number of channels) of CNNs, resulting in limited…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Yinpeng Chen , Xiyang Dai , Mengchen Liu , Dongdong Chen , Lu Yuan , Zicheng Liu

Dilated convolution with learnable spacings

Recent works indicate that convolutional neural networks (CNN) need large receptive fields (RF) to compete with visual transformers and their attention mechanism. In CNNs, RFs can simply be enlarged by increasing the convolution kernel…

Computer Vision and Pattern Recognition · Computer Science 2023-05-12 Ismail Khalfaoui-Hassani , Thomas Pellegrini , Timothée Masquelier

More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity

Transformers have quickly shined in the computer vision world since the emergence of Vision Transformers (ViTs). The dominant role of convolutional neural networks (CNNs) seems to be challenged by increasingly effective transformer-based…

Computer Vision and Pattern Recognition · Computer Science 2023-03-07 Shiwei Liu , Tianlong Chen , Xiaohan Chen , Xuxi Chen , Qiao Xiao , Boqian Wu , Tommi Kärkkäinen , Mykola Pechenizkiy , Decebal Mocanu , Zhangyang Wang

LeanConvNets: Low-cost Yet Effective Convolutional Neural Networks

Convolutional Neural Networks (CNNs) have become indispensable for solving machine learning tasks in speech recognition, computer vision, and other areas that involve high-dimensional data. A CNN filters the input feature using a network…

Machine Learning · Computer Science 2020-02-13 Jonathan Ephrath , Moshe Eliasof , Lars Ruthotto , Eldad Haber , Eran Treister

PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer

Despite their strong modeling capacities, Convolutional Neural Networks (CNNs) are often scale-sensitive. For enhancing the robustness of CNNs to scale variance, multi-scale feature fusion from different layers or filters attracts great…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Duo Li , Anbang Yao , Qifeng Chen

Adaptive Convolution Kernel for Artificial Neural Networks

Many deep neural networks are built by using stacked convolutional layers of fixed and single size (often 3$\times$3) kernels. This paper describes a method for training the size of convolutional kernels to provide varying size kernels in a…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 F. Boray Tek , İlker Çam , Deniz Karlı

Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations

This paper proposes the paradigm of large convolutional kernels in designing modern Convolutional Neural Networks (ConvNets). We establish that employing a few large kernels, instead of stacking multiple smaller ones, can be a superior…

Computer Vision and Pattern Recognition · Computer Science 2024-10-11 Yiyuan Zhang , Xiaohan Ding , Xiangyu Yue

Kernel Modulation: A Parameter-Efficient Method for Training Convolutional Neural Networks

Deep Neural Networks, particularly Convolutional Neural Networks (ConvNets), have achieved incredible success in many vision tasks, but they usually require millions of parameters for good accuracy performance. With increasing applications…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Yuhuang Hu , Shih-Chii Liu

Modelling Long Range Dependencies in $N$D: From Task-Specific to a General Purpose CNN

Performant Convolutional Neural Network (CNN) architectures must be tailored to specific tasks in order to consider the length, resolution, and dimensionality of the input data. In this work, we tackle the need for problem-specific CNN…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 David M. Knigge , David W. Romero , Albert Gu , Efstratios Gavves , Erik J. Bekkers , Jakub M. Tomczak , Mark Hoogendoorn , Jan-Jakob Sonke

Finding One's Bearings in the Hyperparameter Landscape of a Wide-Kernel Convolutional Fault Detector

State-of-the-art algorithms are reported to be almost perfect at distinguishing the vibrations arising from healthy and damaged machine bearings, according to benchmark datasets at least. However, what about their application to new data?…

Machine Learning · Computer Science 2025-05-19 Dan Hudson , Jurgen van den Hoogen , Martin Atzmueller