Related papers: VarGNet: Variable Group Convolutional Neural Netwo…

Patch Is Not All You Need

Vision Transformers have achieved great success in computer visions, delivering exceptional performance across various tasks. However, their inherent reliance on sequential input enforces the manual partitioning of images into patch…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Changzhen Li , Jie Zhang , Yang Wei , Zhilong Ji , Jinfeng Bai , Shiguang Shan

Algorithm and Hardware Co-Design for Efficient Complex-Valued Uncertainty Estimation

Complex-Valued Neural Networks (CVNNs) have significant advantages in handling tasks that involve complex numbers. However, existing CVNNs are unable to quantify predictive uncertainty. We propose, for the first time, dropout-based Bayesian…

Hardware Architecture · Computer Science 2026-04-23 Zehuan Zhang , Mark Chen , He Li , Wayne Luk

Accelerated Rotation-Invariant Convolution for UAV Image Segmentation

Rotation invariance is essential for precise, object-level segmentation in UAV aerial imagery, where targets can have arbitrary orientations and exhibit fine-scale details. Conventional segmentation architectures like U-Net rely on…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Manduhu Manduhu , Alexander Dow , Gerard Dooly , James Riordan

A Quantum Convolutional Neural Network for Image Classification

Artificial neural networks have achieved great success in many fields ranging from image recognition to video understanding. However, its high requirements for computing and memory resources have limited further development on processing…

Quantum Physics · Physics 2021-08-05 Yanxuan Lü , Qing Gao , Jinhu Lü , Maciej Ogorzałek , Jin Zheng

Multi-level Wavelet Convolutional Neural Networks

In computer vision, convolutional networks (CNNs) often adopts pooling to enlarge receptive field which has the advantage of low computational complexity. However, pooling can cause information loss and thus is detrimental to further…

Computer Vision and Pattern Recognition · Computer Science 2019-07-09 Pengju Liu , Hongzhi Zhang , Wei Lian , Wangmeng Zuo

The Geometry of Cortical Computation: Manifold Disentanglement and Predictive Dynamics in VCNet

Despite their success, modern convolutional neural networks (CNNs) exhibit fundamental limitations, including data inefficiency, poor out-of-distribution generalization, and vulnerability to adversarial perturbations. These shortcomings can…

Neural and Evolutionary Computing · Computer Science 2025-11-25 Brennen A. Hill , Zhang Xinyu , Timothy Putra Prasetio

Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks

Very deep convolutional neural networks (CNNs) have been firmly established as the primary methods for many computer vision tasks. However, most state-of-the-art CNNs are large, which results in high inference latency. Recently, depth-wise…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Yihui He , Jianing Qian , Jianren Wang , Cindy X. Le , Congrui Hetang , Qi Lyu , Wenping Wang , Tianwei Yue

Fusing Deep Convolutional Networks for Large Scale Visual Concept Classification

Deep learning architectures are showing great promise in various computer vision domains including image classification, object detection, event detection and action recognition. In this study, we investigate various aspects of…

Computer Vision and Pattern Recognition · Computer Science 2016-08-08 Hilal Ergun , Mustafa Sert

An efficient and flexible inference system for serving heterogeneous ensembles of deep neural networks

Ensembles of Deep Neural Networks (DNNs) have achieved qualitative predictions but they are computing and memory intensive. Therefore, the demand is growing to make them answer a heavy workload of requests with available computational…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-31 Pierrick Pochelu , Serge G. Petiton , Bruno Conche

An Efficient 3D Convolutional Neural Network with Channel-wise, Spatial-grouped, and Temporal Convolutions

There has been huge progress on video action recognition in recent years. However, many works focus on tweaking existing 2D backbones due to the reliance of ImageNet pretraining, which restrains the models from achieving higher efficiency…

Computer Vision and Pattern Recognition · Computer Science 2025-03-05 Zhe Wang , Xulei Yang

Wavelet Channel Attention Module with a Fusion Network for Single Image Deraining

Single image deraining is a crucial problem because rain severely degenerates the visibility of images and affects the performance of computer vision tasks like outdoor surveillance systems and intelligent vehicles. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Hao-Hsiang Yang , Chao-Han Huck Yang , Yu-Chiang Frank Wang

Random 2.5D U-net for Fully 3D Segmentation

Convolutional neural networks are state-of-the-art for various segmentation tasks. While for 2D images these networks are also computationally efficient, 3D convolutions have huge storage requirements and therefore, end-to-end training is…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Christoph Angermann , Markus Haltmeier

Binarized Convolutional Neural Networks for Efficient Inference on GPUs

Convolutional neural networks have recently achieved significant breakthroughs in various image classification tasks. However, they are computationally expensive,which can make their feasible mplementation on embedded and low-power devices…

Machine Learning · Computer Science 2018-08-02 Mir Khan , Heikki Huttunen , Jani Boutellier

Warped Convolutions: Efficient Invariance to Spatial Transformations

Convolutional Neural Networks (CNNs) are extremely efficient, since they exploit the inherent translation-invariance of natural images. However, translation is just one of a myriad of useful spatial transformations. Can the same efficiency…

Computer Vision and Pattern Recognition · Computer Science 2021-12-02 João F. Henriques , Andrea Vedaldi

Accelerating Very Deep Convolutional Networks for Classification and Detection

This paper aims to accelerate the test-time computation of convolutional neural networks (CNNs), especially very deep CNNs that have substantially impacted the computer vision community. Unlike previous methods that are designed for…

Computer Vision and Pattern Recognition · Computer Science 2015-11-19 Xiangyu Zhang , Jianhua Zou , Kaiming He , Jian Sun

Convolutional Neural Network with Convolutional Block Attention Module for Finger Vein Recognition

Convolutional neural networks have become a popular research in the field of finger vein recognition because of their powerful image feature representation. However, most researchers focus on improving the performance of the network by…

Computer Vision and Pattern Recognition · Computer Science 2022-02-15 Zhongxia Zhang , Mingwen Wang

Hyper-Convolutions via Implicit Kernels for Medical Imaging

The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares…

Image and Video Processing · Electrical Eng. & Systems 2022-02-08 Tianyu Ma , Alan Q. Wang , Adrian V. Dalca , Mert R. Sabuncu

Deep Clustered Convolutional Kernels

Deep neural networks have recently achieved state of the art performance thanks to new training algorithms for rapid parameter estimation and new regularization methods to reduce overfitting. However, in practice the network architecture…

Machine Learning · Computer Science 2016-03-04 Minyoung Kim , Luca Rigazio

HeteGCN: Heterogeneous Graph Convolutional Networks for Text Classification

We consider the problem of learning efficient and inductive graph convolutional networks for text classification with a large number of examples and features. Existing state-of-the-art graph embedding based methods such as predictive text…

Computation and Language · Computer Science 2020-09-01 Rahul Ragesh , Sundararajan Sellamanickam , Arun Iyer , Ram Bairi , Vijay Lingam

IC Networks: Remodeling the Basic Unit for Convolutional Neural Networks

Convolutional neural network (CNN) is a class of artificial neural networks widely used in computer vision tasks. Most CNNs achieve excellent performance by stacking certain types of basic units. In addition to increasing the depth and…

Computer Vision and Pattern Recognition · Computer Science 2021-02-09 Junyi An , Fengshan Liu , Jian Zhao , Furao Shen