Related papers: Optimizing Winograd Convolution on ARMv8 processor…

Towards Design Space Exploration and Optimization of Fast Algorithms for Convolutional Neural Networks (CNNs) on FPGAs

Convolutional Neural Networks (CNNs) have gained widespread popularity in the field of computer vision and image processing. Due to huge computational requirements of CNNs, dedicated hardware-based implementations are being explored to…

Signal Processing · Electrical Eng. & Systems 2019-03-06 Afzal Ahmad , Muhammad Adeel Pasha

Efficient Winograd or Cook-Toom Convolution Kernel Implementation on Widely Used Mobile CPUs

The Winograd or Cook-Toom class of algorithms help to reduce the overall compute complexity of many modern deep convolutional neural networks (CNNs). Although there has been a lot of research done on model and algorithmic optimization of…

Machine Learning · Computer Science 2019-03-06 Partha Maji , Andrew Mundy , Ganesh Dasika , Jesse Beu , Matthew Mattina , Robert Mullins

Efficient Winograd Convolution via Integer Arithmetic

Convolution is the core operation for many deep neural networks. The Winograd convolution algorithms have been shown to accelerate the widely-used small convolution sizes. Quantized neural networks can effectively reduce model sizes and…

Neural and Evolutionary Computing · Computer Science 2019-01-09 Lingchuan Meng , John Brothers

WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs

The combination of Winograd's algorithm and systolic array architecture has demonstrated the capability of improving DSP efficiency in accelerating convolutional neural networks (CNNs) on FPGA platforms. However, handling arbitrary…

Hardware Architecture · Computer Science 2021-07-12 Xinheng Liu , Yao Chen , Cong Hao , Ashutosh Dhar , Deming Chen

Accelerating Large Kernel Convolutions with Nested Winograd Transformation.pdf

Recent literature has shown that convolutional neural networks (CNNs) with large kernels outperform vision transformers (ViTs) and CNNs with stacked small kernels in many computer vision tasks, such as object detection and image…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Jingbo Jiang , Xizi Chen , Chi-Ying Tsui

Towards Design Methodology of Efficient Fast Algorithms for Accelerating Generative Adversarial Networks on FPGAs

Generative adversarial networks (GANs) have shown excellent performance in image and speech applications. GANs create impressive data primarily through a new type of operator called deconvolution (DeConv) or transposed convolution (Conv).…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-11-19 Jung-Woo Chang , Saehyun Ahn , Keon-Woo Kang , Suk-Ju Kang

Deep Tensor Convolution on Multicores

Deep convolutional neural networks (ConvNets) of 3-dimensional kernels allow joint modeling of spatiotemporal features. These networks have improved performance of video and volumetric image analysis, but have been limited in size due to…

Computer Vision and Pattern Recognition · Computer Science 2017-06-13 David Budden , Alexander Matveev , Shibani Santurkar , Shraman Ray Chaudhuri , Nir Shavit

Searching for Winograd-aware Quantized Networks

Lightweight architectural designs of Convolutional Neural Networks (CNNs) together with quantization have paved the way for the deployment of demanding computer vision applications on mobile devices. Parallel to this, alternative…

Machine Learning · Computer Science 2020-03-24 Javier Fernandez-Marques , Paul N. Whatmough , Andrew Mundy , Matthew Mattina

Winograd Algorithm for AdderNet

Adder neural network (AdderNet) is a new kind of deep model that replaces the original massive multiplications in convolutions by additions while preserving the high performance. Since the hardware complexity of additions is much lower than…

Machine Learning · Computer Science 2021-05-13 Wenshuo Li , Hanting Chen , Mingqiang Huang , Xinghao Chen , Chunjing Xu , Yunhe Wang

Fast Convolution based on Winograd Minimum Filtering: Introduction and Development

Convolutional Neural Network (CNN) has been widely used in various fields and played an important role. Convolution operators are the fundamental component of convolutional neural networks, and it is also the most time-consuming part of…

Artificial Intelligence · Computer Science 2021-11-02 Gan Tong , Libo Huang

Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs

Deep Convolutional Neural Networks have become a Swiss knife in solving critical artificial intelligence tasks. However, deploying deep CNN models for latency-critical tasks remains to be challenging because of the complex nature of CNNs.…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Chuanhao Zhuge , Xinheng Liu , Xiaofan Zhang , Sudeep Gummadi , Jinjun Xiong , Deming Chen

Winograd Convolution: A Perspective from Fault Tolerance

Winograd convolution is originally proposed to reduce the computing overhead by converting multiplication in neural network (NN) with addition via linear transformation. Other than the computing efficiency, we observe its great potential in…

Machine Learning · Computer Science 2022-02-18 Xinghua Xue , Haitong Huang , Cheng Liu , Ying Wang , Tao Luo , Lei Zhang

A Winograd-based Integrated Photonics Accelerator for Convolutional Neural Networks

Neural Networks (NNs) have become the mainstream technology in the artificial intelligence (AI) renaissance over the past decade. Among different types of neural networks, convolutional neural networks (CNNs) have been widely adopted as…

Emerging Technologies · Computer Science 2019-12-05 Armin Mehrabian , Mario Miscuglio , Yousra Alkabani , Volker J. Sorger , Tarek El-Ghazawi

DWM: A Decomposable Winograd Method for Convolution Acceleration

Winograd's minimal filtering algorithm has been widely used in Convolutional Neural Networks (CNNs) to reduce the number of multiplications for faster processing. However, it is only effective on convolutions with kernel size as 3x3 and…

Machine Learning · Computer Science 2020-02-06 Di Huang , Xishan Zhang , Rui Zhang , Tian Zhi , Deyuan He , Jiaming Guo , Chang Liu , Qi Guo , Zidong Du , Shaoli Liu , Tianshi Chen , Yunji Chen

Efficient Residue Number System Based Winograd Convolution

Prior research has shown that Winograd algorithm can reduce the computational complexity of convolutional neural networks (CNN) with weights and activations represented in floating point. However it is difficult to apply the scheme to the…

Machine Learning · Computer Science 2020-07-27 Zhi-Gang Liu , Matthew Mattina

Im2win: An Efficient Convolution Paradigm on GPU

Convolution is the most time-consuming operation in deep neural network operations, so its performance is critical to the overall performance of the neural network. The commonly used methods for convolution on GPU include the general matrix…

Neural and Evolutionary Computing · Computer Science 2023-06-27 Shuai Lu , Jun Chu , Luanzheng Guo , Xu T. Liu

Accelerating CNN inference on long vector architectures via co-design

CPU-based inference can be an alternative to off-chip accelerators, and vector architectures are a promising option due to their efficiency. However, the large design space of convolutional algorithms and hardware implementations makes it…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-12-23 Sonia Rani Gupta , Nikela Papadopoulou , Miquel Pericas

High performance and energy efficient inference for deep learning on ARM processors

We evolve PyDTNN, a framework for distributed parallel training of Deep Neural Networks (DNNs), into an efficient inference tool for convolutional neural networks. Our optimization process on multicore ARM processors involves several…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-05-20 Adrián Castelló , Sergio Barrachina , Manuel F. Dolz , Enrique S. Quintana-Ortí , Pau San Juan

LANCE: Efficient Low-Precision Quantized Winograd Convolution for Neural Networks Based on Graphics Processing Units

Accelerating deep convolutional neural networks has become an active topic and sparked an interest in academia and industry. In this paper, we propose an efficient low-precision quantized Winograd convolution algorithm, called LANCE, which…

Computer Vision and Pattern Recognition · Computer Science 2020-07-29 Guangli Li , Lei Liu , Xueying Wang , Xiu Ma , Xiaobing Feng

Enabling Sparse Winograd Convolution by Native Pruning

Sparse methods and the use of Winograd convolutions are two orthogonal approaches, each of which significantly accelerates convolution computations in modern CNNs. Sparse Winograd merges these two and thus has the potential to offer a…

Computer Vision and Pattern Recognition · Computer Science 2017-10-17 Sheng Li , Jongsoo Park , Ping Tak Peter Tang