English
Related papers

Related papers: Optimizing Winograd Convolution on ARMv8 processor…

200 papers

Convolutional Neural Networks (CNNs) have gained widespread popularity in the field of computer vision and image processing. Due to huge computational requirements of CNNs, dedicated hardware-based implementations are being explored to…

Signal Processing · Electrical Eng. & Systems 2019-03-06 Afzal Ahmad , Muhammad Adeel Pasha

The Winograd or Cook-Toom class of algorithms help to reduce the overall compute complexity of many modern deep convolutional neural networks (CNNs). Although there has been a lot of research done on model and algorithmic optimization of…

Machine Learning · Computer Science 2019-03-06 Partha Maji , Andrew Mundy , Ganesh Dasika , Jesse Beu , Matthew Mattina , Robert Mullins

Convolution is the core operation for many deep neural networks. The Winograd convolution algorithms have been shown to accelerate the widely-used small convolution sizes. Quantized neural networks can effectively reduce model sizes and…

Neural and Evolutionary Computing · Computer Science 2019-01-09 Lingchuan Meng , John Brothers

The combination of Winograd's algorithm and systolic array architecture has demonstrated the capability of improving DSP efficiency in accelerating convolutional neural networks (CNNs) on FPGA platforms. However, handling arbitrary…

Hardware Architecture · Computer Science 2021-07-12 Xinheng Liu , Yao Chen , Cong Hao , Ashutosh Dhar , Deming Chen

Recent literature has shown that convolutional neural networks (CNNs) with large kernels outperform vision transformers (ViTs) and CNNs with stacked small kernels in many computer vision tasks, such as object detection and image…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Jingbo Jiang , Xizi Chen , Chi-Ying Tsui

Generative adversarial networks (GANs) have shown excellent performance in image and speech applications. GANs create impressive data primarily through a new type of operator called deconvolution (DeConv) or transposed convolution (Conv).…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-11-19 Jung-Woo Chang , Saehyun Ahn , Keon-Woo Kang , Suk-Ju Kang

Deep convolutional neural networks (ConvNets) of 3-dimensional kernels allow joint modeling of spatiotemporal features. These networks have improved performance of video and volumetric image analysis, but have been limited in size due to…

Computer Vision and Pattern Recognition · Computer Science 2017-06-13 David Budden , Alexander Matveev , Shibani Santurkar , Shraman Ray Chaudhuri , Nir Shavit

Lightweight architectural designs of Convolutional Neural Networks (CNNs) together with quantization have paved the way for the deployment of demanding computer vision applications on mobile devices. Parallel to this, alternative…

Machine Learning · Computer Science 2020-03-24 Javier Fernandez-Marques , Paul N. Whatmough , Andrew Mundy , Matthew Mattina

Adder neural network (AdderNet) is a new kind of deep model that replaces the original massive multiplications in convolutions by additions while preserving the high performance. Since the hardware complexity of additions is much lower than…

Machine Learning · Computer Science 2021-05-13 Wenshuo Li , Hanting Chen , Mingqiang Huang , Xinghao Chen , Chunjing Xu , Yunhe Wang

Convolutional Neural Network (CNN) has been widely used in various fields and played an important role. Convolution operators are the fundamental component of convolutional neural networks, and it is also the most time-consuming part of…

Artificial Intelligence · Computer Science 2021-11-02 Gan Tong , Libo Huang

Deep Convolutional Neural Networks have become a Swiss knife in solving critical artificial intelligence tasks. However, deploying deep CNN models for latency-critical tasks remains to be challenging because of the complex nature of CNNs.…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Chuanhao Zhuge , Xinheng Liu , Xiaofan Zhang , Sudeep Gummadi , Jinjun Xiong , Deming Chen

Winograd convolution is originally proposed to reduce the computing overhead by converting multiplication in neural network (NN) with addition via linear transformation. Other than the computing efficiency, we observe its great potential in…

Machine Learning · Computer Science 2022-02-18 Xinghua Xue , Haitong Huang , Cheng Liu , Ying Wang , Tao Luo , Lei Zhang

Neural Networks (NNs) have become the mainstream technology in the artificial intelligence (AI) renaissance over the past decade. Among different types of neural networks, convolutional neural networks (CNNs) have been widely adopted as…

Emerging Technologies · Computer Science 2019-12-05 Armin Mehrabian , Mario Miscuglio , Yousra Alkabani , Volker J. Sorger , Tarek El-Ghazawi

Winograd's minimal filtering algorithm has been widely used in Convolutional Neural Networks (CNNs) to reduce the number of multiplications for faster processing. However, it is only effective on convolutions with kernel size as 3x3 and…

Machine Learning · Computer Science 2020-02-06 Di Huang , Xishan Zhang , Rui Zhang , Tian Zhi , Deyuan He , Jiaming Guo , Chang Liu , Qi Guo , Zidong Du , Shaoli Liu , Tianshi Chen , Yunji Chen

Prior research has shown that Winograd algorithm can reduce the computational complexity of convolutional neural networks (CNN) with weights and activations represented in floating point. However it is difficult to apply the scheme to the…

Machine Learning · Computer Science 2020-07-27 Zhi-Gang Liu , Matthew Mattina

Convolution is the most time-consuming operation in deep neural network operations, so its performance is critical to the overall performance of the neural network. The commonly used methods for convolution on GPU include the general matrix…

Neural and Evolutionary Computing · Computer Science 2023-06-27 Shuai Lu , Jun Chu , Luanzheng Guo , Xu T. Liu

CPU-based inference can be an alternative to off-chip accelerators, and vector architectures are a promising option due to their efficiency. However, the large design space of convolutional algorithms and hardware implementations makes it…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-12-23 Sonia Rani Gupta , Nikela Papadopoulou , Miquel Pericas

We evolve PyDTNN, a framework for distributed parallel training of Deep Neural Networks (DNNs), into an efficient inference tool for convolutional neural networks. Our optimization process on multicore ARM processors involves several…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-05-20 Adrián Castelló , Sergio Barrachina , Manuel F. Dolz , Enrique S. Quintana-Ortí , Pau San Juan

Accelerating deep convolutional neural networks has become an active topic and sparked an interest in academia and industry. In this paper, we propose an efficient low-precision quantized Winograd convolution algorithm, called LANCE, which…

Computer Vision and Pattern Recognition · Computer Science 2020-07-29 Guangli Li , Lei Liu , Xueying Wang , Xiu Ma , Xiaobing Feng

Sparse methods and the use of Winograd convolutions are two orthogonal approaches, each of which significantly accelerates convolution computations in modern CNNs. Sparse Winograd merges these two and thus has the potential to offer a…

Computer Vision and Pattern Recognition · Computer Science 2017-10-17 Sheng Li , Jongsoo Park , Ping Tak Peter Tang
‹ Prev 1 2 3 10 Next ›