Related papers: Pruning Filters for Efficient ConvNets

Faster CNNs with Direct Sparse Convolutions and Guided Pruning

Phenomenally successful in practical inference problems, convolutional neural networks (CNN) are widely deployed in mobile devices, data centers, and even supercomputers. The number of parameters needed in CNNs, however, are often large and…

Computer Vision and Pattern Recognition · Computer Science 2017-08-01 Jongsoo Park , Sheng Li , Wei Wen , Ping Tak Peter Tang , Hai Li , Yiran Chen , Pradeep Dubey

Efficient Inference of CNNs via Channel Pruning

The deployment of Convolutional Neural Networks (CNNs) on resource constrained platforms such as mobile devices and embedded systems has been greatly hindered by their high implementation cost, and thus motivated a lot research interest in…

Computer Vision and Pattern Recognition · Computer Science 2019-08-12 Boyu Zhang , Azadeh Davoodi , Yu Hen Hu

Efficient CNNs via Passive Filter Pruning

Convolutional neural networks (CNNs) have shown state-of-the-art performance in various applications. However, CNNs are resource-hungry due to their requirement of high computational complexity and memory storage. Recent efforts toward…

Machine Learning · Computer Science 2025-08-27 Arshdeep Singh , Mark D. Plumbley

Building Efficient ConvNets using Redundant Feature Pruning

This paper presents an efficient technique to prune deep and/or wide convolutional neural network models by eliminating redundant features (or filters). Previous studies have shown that over-sized deep neural network models tend to produce…

Computer Vision and Pattern Recognition · Computer Science 2018-02-22 Babajide O. Ayinde , Jacek M. Zurada

Stability Based Filter Pruning for Accelerating Deep CNNs

Convolutional neural networks (CNN) have achieved impressive performance on the wide variety of tasks (classification, detection, etc.) across multiple domains at the cost of high computational and memory requirements. Thus, leveraging CNNs…

Computer Vision and Pattern Recognition · Computer Science 2018-11-21 Pravendra Singh , Vinay Sameer Raja Kadi , Nikhil Verma , Vinay P. Namboodiri

Compact Deep Convolutional Neural Networks With Coarse Pruning

The learning capability of a neural network improves with increasing depth at higher computational costs. Wider layers with dense kernel connectivity patterns furhter increase this cost and may hinder real-time inference. We propose feature…

Machine Learning · Computer Science 2016-11-01 Sajid Anwar , Wonyong Sung

Convolutional Neural Network Pruning Using Filter Attenuation

Filters are the essential elements in convolutional neural networks (CNNs). Filters are corresponded to the feature maps and form the main part of the computational and memory requirement for the CNN processing. In filter pruning methods, a…

Computer Vision and Pattern Recognition · Computer Science 2020-02-11 Morteza Mousa-Pasandi , Mohsen Hajabdollahi , Nader Karimi , Shadrokh Samavi , Shahram Shirani

A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks

Acceleration of convolutional neural network has received increasing attention during the past several years. Among various acceleration techniques, filter pruning has its inherent merit by effectively reducing the number of convolution…

Computer Vision and Pattern Recognition · Computer Science 2019-06-19 Dong Wang , Lei Zhou , Xiao Bai , Jun Zhou

Improve Convolutional Neural Network Pruning by Maximizing Filter Variety

Neural network pruning is a widely used strategy for reducing model storage and computing requirements. It allows to lower the complexity of the network by introducing sparsity in the weights. Because taking advantage of sparse matrices is…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Nathan Hubens , Matei Mancas , Bernard Gosselin , Marius Preda , Titus Zaharia

Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks

Resource-efficient convolution neural networks enable not only the intelligence on edge devices but also opportunities in system-level optimization such as scheduling. In this work, we aim to improve the performance of resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2018-10-19 Ting-Wu Chin , Cha Zhang , Diana Marculescu

Prune the Convolutional Neural Networks with Sparse Shrink

Nowadays, it is still difficult to adapt Convolutional Neural Network (CNN) based models for deployment on embedded devices. The heavy computation and large memory footprint of CNN models become the main burden in real application. In this…

Computer Vision and Pattern Recognition · Computer Science 2017-08-09 Xin Li , Changsong Liu

Dependency Aware Filter Pruning

Convolutional neural networks (CNNs) are typically over-parameterized, bringing considerable computational overhead and memory footprint in inference. Pruning a proportion of unimportant filters is an efficient way to mitigate the inference…

Computer Vision and Pattern Recognition · Computer Science 2020-05-07 Kai Zhao , Xin-Yu Zhang , Qi Han , Ming-Ming Cheng

Quantized Guided Pruning for Efficient Hardware Implementations of Convolutional Neural Networks

Convolutional Neural Networks (CNNs) are state-of-the-art in numerous computer vision tasks such as object classification and detection. However, the large amount of parameters they contain leads to a high computational complexity and…

Machine Learning · Computer Science 2019-01-01 Ghouthi Boukli Hacene , Vincent Gripon , Matthieu Arzel , Nicolas Farrugia , Yoshua Bengio

Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers

Model pruning has become a useful technique that improves the computational efficiency of deep learning, making it possible to deploy solutions in resource-limited scenarios. A widely-used practice in relevant work assumes that a…

Machine Learning · Computer Science 2018-02-06 Jianbo Ye , Xin Lu , Zhe Lin , James Z. Wang

Weight Pruning via Adaptive Sparsity Loss

Pruning neural networks has regained interest in recent years as a means to compress state-of-the-art deep neural networks and enable their deployment on resource-constrained devices. In this paper, we propose a robust compressive learning…

Machine Learning · Computer Science 2020-06-05 George Retsinas , Athena Elafrou , Georgios Goumas , Petros Maragos

Online Filter Clustering and Pruning for Efficient Convnets

Pruning filters is an effective method for accelerating deep neural networks (DNNs), but most existing approaches prune filters on a pre-trained network directly which limits in acceleration. Although each filter has its own effect in DNNs,…

Computer Vision and Pattern Recognition · Computer Science 2019-05-29 Zhengguang Zhou , Wengang Zhou , Richang Hong , Houqiang Li

Accelerator-Aware Pruning for Convolutional Neural Networks

Convolutional neural networks have shown tremendous performance capabilities in computer vision tasks, but their excessive amounts of weight storage and arithmetic operations prevent them from being adopted in embedded environments. One of…

Neural and Evolutionary Computing · Computer Science 2020-09-08 Hyeong-Ju Kang

When deep learning models on GPU can be accelerated by taking advantage of unstructured sparsity

This paper is focused on the improvement the efficiency of the sparse convolutional neural networks (CNNs) layers on graphic processing units (GPU). The Nvidia deep neural network (cuDnn) library provides the most effective implementation…

Machine Learning · Computer Science 2022-01-03 Marcin Pietroń , Dominik Żurek

Pruning Convolutional Neural Networks for Resource Efficient Inference

We propose a new formulation for pruning convolutional kernels in neural networks to enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by backpropagation - a computationally efficient procedure that…

Machine Learning · Computer Science 2017-06-12 Pavlo Molchanov , Stephen Tyree , Tero Karras , Timo Aila , Jan Kautz

Comprehensive Online Network Pruning via Learnable Scaling Factors

One of the major challenges in deploying deep neural network architectures is their size which has an adverse effect on their inference time and memory requirements. Deep CNNs can either be pruned width-wise by removing filters based on…

Computer Vision and Pattern Recognition · Computer Science 2020-10-07 Muhammad Umair Haider , Murtaza Taj