English
Related papers

Related papers: Partial Convolution Meets Visual Attention

200 papers

Designing a module or mechanism that enables a network to maintain low parameters and FLOPs without sacrificing accuracy and throughput remains a challenge. To address this challenge and exploit the redundancy within feature map channels,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-04 Haiduo Huang , Tian Xia , Wenzhe zhao , Pengju Ren

Vision Transformer (ViT) has prevailed in computer vision tasks due to its strong long-range dependency modelling ability. \textcolor{blue}{However, its large model size and weak local feature modeling ability hinder its application in real…

Computer Vision and Pattern Recognition · Computer Science 2025-09-12 Yi Zhang , Lingxiao Wei , Bowei Zhang , Ziwei Liu , Kai Yi , Shu Hu

Transformers have shown great potential in various computer vision tasks. By borrowing design concepts from transformers, many studies revolutionized CNNs and showed remarkable results. This paper falls in this line of studies.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Ruihan Xu , Haokui Zhang , Wenze Hu , Shiliang Zhang , Xiaoyu Wang

We design a new family of hybrid CNN-ViT neural networks, named FasterViT, with a focus on high image throughput for computer vision (CV) applications. FasterViT combines the benefits of fast local representation learning in CNNs and global…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Ali Hatamizadeh , Greg Heinrich , Hongxu Yin , Andrew Tao , Jose M. Alvarez , Jan Kautz , Pavlo Molchanov

Medical image segmentation can provide detailed information for clinical analysis which can be useful for scenarios where the detailed location of a finding is important. Knowing the location of disease can play a vital role in treatment…

Image and Video Processing · Electrical Eng. & Systems 2021-11-23 Abhishek Srivastava , Sukalpa Chanda , Debesh Jha , Michael A. Riegler , Pål Halvorsen , Dag Johansen , Umapada Pal

Recently, vision transformers started to show impressive results which outperform large convolution based models significantly. However, in the area of small models for mobile or resource constrained devices, ConvNet still has its own…

Computer Vision and Pattern Recognition · Computer Science 2022-07-27 Haokui Zhang , Wenze Hu , Xiaoyu Wang

Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information.…

Computer Vision and Pattern Recognition · Computer Science 2020-09-11 Irwan Bello , Barret Zoph , Ashish Vaswani , Jonathon Shlens , Quoc V. Le

Transformer models have made tremendous progress in various fields in recent years. In the field of computer vision, vision transformers (ViTs) also become strong alternatives to convolutional neural networks (ConvNets), yet they have not…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Tao Yang , Haokui Zhang , Wenze Hu , Changwen Chen , Xiaoyu Wang

This work aims at designing a lightweight convolutional neural network for image super resolution (SR). With simplicity bare in mind, we construct a pretty concise and effective network with a newly proposed pixel attention scheme. Pixel…

Image and Video Processing · Electrical Eng. & Systems 2020-10-05 Hengyuan Zhao , Xiangtao Kong , Jingwen He , Yu Qiao , Chao Dong

Self-attention (SA) has become the cornerstone of modern vision backbones for its powerful expressivity over traditional Convolutions (Conv). However, its quadratic complexity remains a critical bottleneck for practical applications. Given…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Hao Yu , Haoyu Chen , Yan Jiang , Wei Peng , Zhaodong Sun , Samuel Kaski , Guoying Zhao

We present in this paper a new architecture, the Pattern Attention Transformer (PAT), that is composed of the new doughnut kernel. Compared with tokens in the NLP field, Transformer in computer vision has the problem of handling the high…

Computer Vision and Pattern Recognition · Computer Science 2023-09-19 WenYuan Sheng

This paper introduces ConvShareViT, a novel deep learning architecture that adapts Vision Transformers (ViTs) to the 4f free-space optical system. ConvShareViT replaces linear layers in multi-head self-attention (MHSA) and Multilayer…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Riad Ibadulla , Thomas M. Chen , Constantino Carlos Reyes-Aldasoro

Light-weight convolutional neural networks (CNNs) are specially designed for applications on mobile devices with faster inference speed. The convolutional operation can only capture local information in a window region, which prevents…

Computer Vision and Pattern Recognition · Computer Science 2022-11-24 Yehui Tang , Kai Han , Jianyuan Guo , Chang Xu , Chao Xu , Yunhe Wang

3D to 2D retinal vessel segmentation is a challenging problem in Optical Coherence Tomography Angiography (OCTA) images. Accurate retinal vessel segmentation is important for the diagnosis and prevention of ophthalmic diseases. However,…

Image and Video Processing · Electrical Eng. & Systems 2021-12-17 Zhuojie Wu , Zijian Wang , Wenxuan Zou , Fan Ji , Hao Dang , Wanting Zhou , Muyi Sun

This paper introduces a novel attention mechanism, called dual attention, which is both efficient and effective. The dual attention mechanism consists of two parallel components: local attention generated by Convolutional Neural Networks…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Zhengkai Jiang , Liang Liu , Jiangning Zhang , Yabiao Wang , Mingang Chen , Chengjie Wang

Humans can effectively find salient regions in complex scenes. Self-attention mechanisms were introduced into Computer Vision (CV) to achieve this. Attention Augmented Convolutional Network (AANet) is a mixture of convolution and…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Runqing Zhang , Tianshu Zhu

Convolutional layers are a major driving force behind the successes of deep learning. Pointwise convolution (PWC) is a 1x1 convolutional filter that is primarily used for parameter reduction. However, the PWC ignores the spatial information…

Computer Vision and Pattern Recognition · Computer Science 2020-02-07 Pratik Mazumder , Pravendra Singh , Vinay Namboodiri

Deep neural networks face several challenges in hyperspectral image classification, including insufficient utilization of joint spatial-spectral information, gradient vanishing with increasing depth, and overfitting. To enhance feature…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Guandong Li , Mengxia Ye

To design fast neural networks, many works have been focusing on reducing the number of floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does not necessarily lead to a similar level of reduction in…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Jierun Chen , Shiu-hong Kao , Hao He , Weipeng Zhuo , Song Wen , Chul-Ho Lee , S. -H. Gary Chan

In NLP, convolutional neural networks (CNNs) have benefited less than recurrent neural networks (RNNs) from attention mechanisms. We hypothesize that this is because the attention in CNNs has been mainly implemented as attentive pooling…

Computation and Language · Computer Science 2018-11-14 Wenpeng Yin , Hinrich Schütze
‹ Prev 1 2 3 10 Next ›