Related papers: DWM: A Decomposable Winograd Method for Convolutio…

Accelerating Large Kernel Convolutions with Nested Winograd Transformation.pdf

Recent literature has shown that convolutional neural networks (CNNs) with large kernels outperform vision transformers (ViTs) and CNNs with stacked small kernels in many computer vision tasks, such as object detection and image…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Jingbo Jiang , Xizi Chen , Chi-Ying Tsui

Winograd Convolution for Deep Neural Networks: Efficient Point Selection

Convolutional neural networks (CNNs) have dramatically improved the accuracy of tasks such as object recognition, image segmentation and interactive speech systems. CNNs require large amounts of computing resources because ofcomputationally…

Computer Vision and Pattern Recognition · Computer Science 2022-01-26 Syed Asad Alam , Andrew Anderson , Barbara Barabasz , David Gregg

Efficient Winograd Convolution via Integer Arithmetic

Convolution is the core operation for many deep neural networks. The Winograd convolution algorithms have been shown to accelerate the widely-used small convolution sizes. Quantized neural networks can effectively reduce model sizes and…

Neural and Evolutionary Computing · Computer Science 2019-01-09 Lingchuan Meng , John Brothers

Fast Convolution based on Winograd Minimum Filtering: Introduction and Development

Convolutional Neural Network (CNN) has been widely used in various fields and played an important role. Convolution operators are the fundamental component of convolutional neural networks, and it is also the most time-consuming part of…

Artificial Intelligence · Computer Science 2021-11-02 Gan Tong , Libo Huang

Efficient Sparse-Winograd Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are computationally intensive, which limits their application on mobile devices. Their energy is dominated by the number of multiplies needed to perform the convolutions. Winograd's minimal filtering…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Xingyu Liu , Jeff Pool , Song Han , William J. Dally

Towards Design Space Exploration and Optimization of Fast Algorithms for Convolutional Neural Networks (CNNs) on FPGAs

Convolutional Neural Networks (CNNs) have gained widespread popularity in the field of computer vision and image processing. Due to huge computational requirements of CNNs, dedicated hardware-based implementations are being explored to…

Signal Processing · Electrical Eng. & Systems 2019-03-06 Afzal Ahmad , Muhammad Adeel Pasha

Searching for Winograd-aware Quantized Networks

Lightweight architectural designs of Convolutional Neural Networks (CNNs) together with quantization have paved the way for the deployment of demanding computer vision applications on mobile devices. Parallel to this, alternative…

Machine Learning · Computer Science 2020-03-24 Javier Fernandez-Marques , Paul N. Whatmough , Andrew Mundy , Matthew Mattina

Towards Design Methodology of Efficient Fast Algorithms for Accelerating Generative Adversarial Networks on FPGAs

Generative adversarial networks (GANs) have shown excellent performance in image and speech applications. GANs create impressive data primarily through a new type of operator called deconvolution (DeConv) or transposed convolution (Conv).…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-11-19 Jung-Woo Chang , Saehyun Ahn , Keon-Woo Kang , Suk-Ju Kang

Efficient Residue Number System Based Winograd Convolution

Prior research has shown that Winograd algorithm can reduce the computational complexity of convolutional neural networks (CNN) with weights and activations represented in floating point. However it is difficult to apply the scheme to the…

Machine Learning · Computer Science 2020-07-27 Zhi-Gang Liu , Matthew Mattina

WinoCNN: Kernel Sharing Winograd Systolic Array for Efficient Convolutional Neural Network Acceleration on FPGAs

The combination of Winograd's algorithm and systolic array architecture has demonstrated the capability of improving DSP efficiency in accelerating convolutional neural networks (CNNs) on FPGA platforms. However, handling arbitrary…

Hardware Architecture · Computer Science 2021-07-12 Xinheng Liu , Yao Chen , Cong Hao , Ashutosh Dhar , Deming Chen

Hardware-Efficient Structure of the Accelerating Module for Implementation of Convolutional Neural Network Basic Operation

This paper presents a structural design of the hardware-efficient module for implementation of convolution neural network (CNN) basic operation with reduced implementation complexity. For this purpose we utilize some modification of the…

Signal Processing · Electrical Eng. & Systems 2018-11-09 Aleksandr Cariow , Galina Cariowa

Optimizing Winograd Convolution on ARMv8 processors

As Convolutional Neural Networks (CNNs) gain prominence in deep learning, algorithms like Winograd Convolution have been introduced to enhance computational efficiency. However, existing implementations often face challenges such as high…

Performance · Computer Science 2024-12-30 Haoyuan Gui , Xiaoyu Zhang , Chong Zhang , Zitong Su , Huiyuan Li

Exploring Winograd Convolution for Cost-effective Neural Network Fault Tolerance

Winograd is generally utilized to optimize convolution performance and computational efficiency because of the reduced multiplication operations, but the reliability issues brought by winograd are usually overlooked. In this work, we…

Machine Learning · Computer Science 2023-08-17 Xinghua Xue , Cheng Liu , Bo Liu , Haitong Huang , Ying Wang , Tao Luo , Lei Zhang , Huawei Li , Xiaowei Li

I/O Lower Bounds for Auto-tuning of Convolutions in CNNs

Convolution is the most time-consuming part in the computation of convolutional neural networks (CNNs), which have achieved great successes in numerous applications. Due to the complex data dependency and the increase in the amount of model…

Machine Learning · Computer Science 2021-01-01 Xiaoyang Zhang , Junmin Xiao , Guangming Tan

Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks

Popular deep neural networks (DNNs) spend the majority of their execution time computing convolutions. The Winograd family of algorithms can greatly reduce the number of arithmetic operations required and is present in many DNN software…

Numerical Analysis · Computer Science 2019-05-03 Barbara Barabasz , Andrew Anderson , Kirk M. Soodhalter , David Gregg

Winograd Convolution: A Perspective from Fault Tolerance

Winograd convolution is originally proposed to reduce the computing overhead by converting multiplication in neural network (NN) with addition via linear transformation. Other than the computing efficiency, we observe its great potential in…

Machine Learning · Computer Science 2022-02-18 Xinghua Xue , Haitong Huang , Cheng Liu , Ying Wang , Tao Luo , Lei Zhang

Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile

Most of today's computer vision pipelines are built around deep neural networks, where convolution operations require most of the generally high compute effort. The Winograd convolution algorithm computes convolutions with fewer MACs…

Hardware Architecture · Computer Science 2022-09-28 Renzo Andri , Beatrice Bussolino , Antonio Cipolletta , Lukas Cavigelli , Zhe Wang

Spatial-Winograd Pruning Enabling Sparse Winograd Convolution

Deep convolutional neural networks (CNNs) are deployed in various applications but demand immense computational requirements. Pruning techniques and Winograd convolution are two typical methods to reduce the CNN computation. However, they…

Computer Vision and Pattern Recognition · Computer Science 2019-01-09 Jiecao Yu , Jongsoo Park , Maxim Naumov

Winograd Convolution for DNNs: Beyond linear polynomials

Winograd convolution is widely used in deep neural networks (DNNs). Existing work for DNNs considers only the subset Winograd algorithms that are equivalent to Toom-Cook convolution. We investigate a wider range of Winograd algorithms for…

Machine Learning · Computer Science 2019-06-26 Barbara Barabasz , David Gregg

SFC: Achieve Accurate Fast Convolution under Low-precision Arithmetic

Fast convolution algorithms, including Winograd and FFT, can efficiently accelerate convolution operations in deep models. However, these algorithms depend on high-precision arithmetic to maintain inference accuracy, which conflicts with…

Machine Learning · Computer Science 2024-07-04 Liulu He , Yufei Zhao , Rui Gao , Yuan Du , Li Du