Related papers: Stability Based Filter Pruning for Accelerating De…

Play and Prune: Adaptive Filter Pruning for Deep Model Compression

While convolutional neural networks (CNN) have achieved impressive performance on various classification/recognition tasks, they typically consist of a massive number of parameters. This results in significant memory requirement as well as…

Computer Vision and Pattern Recognition · Computer Science 2019-05-14 Pravendra Singh , Vinay Kumar Verma , Piyush Rai , Vinay P. Namboodiri

Deep Model Compression based on the Training History

Deep Convolutional Neural Networks (DCNNs) have shown promising performances in several visual recognition problems which motivated the researchers to propose popular architectures such as LeNet, AlexNet, VGGNet, ResNet, and many more.…

Computer Vision and Pattern Recognition · Computer Science 2022-05-13 S. H. Shabbeer Basha , Mohammad Farazuddin , Viswanath Pulabaigari , Shiv Ram Dubey , Snehasis Mukherjee

An Entropy-based Pruning Method for CNN Compression

This paper aims to simultaneously accelerate and compress off-the-shelf CNN models via filter pruning strategy. The importance of each filter is evaluated by the proposed entropy-based method first. Then several unimportant filters are…

Computer Vision and Pattern Recognition · Computer Science 2017-06-20 Jian-Hao Luo , Jianxin Wu

Efficient CNNs via Passive Filter Pruning

Convolutional neural networks (CNNs) have shown state-of-the-art performance in various applications. However, CNNs are resource-hungry due to their requirement of high computational complexity and memory storage. Recent efforts toward…

Machine Learning · Computer Science 2025-08-27 Arshdeep Singh , Mark D. Plumbley

RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration

Convolutional Neural Networks (CNNs) have demonstrated exceptional performance in recent years. Compressing these models not only reduces storage requirements, making deployment to edge devices feasible, but also accelerates inference,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Boyao Wang , Volodymyr Kindratenko

Pruning Filters for Efficient ConvNets

The success of CNNs in various applications is accompanied by a significant increase in the computation and parameter storage costs. Recent efforts toward reducing these overheads involve pruning and compressing the weights of various…

Computer Vision and Pattern Recognition · Computer Science 2017-03-13 Hao Li , Asim Kadav , Igor Durdanovic , Hanan Samet , Hans Peter Graf

Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning

While Convolutional Neural Networks (CNNs) excel at learning complex latent-space representations, their over-parameterization can lead to overfitting and reduced performance, particularly with limited data. This, alongside their high…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Manish Sharma , Jamison Heard , Eli Saber , Panos P. Markopoulos

Complexity-Driven CNN Compression for Resource-constrained Edge AI

Recent advances in Artificial Intelligence (AI) on the Internet of Things (IoT)-enabled network edge has realized edge intelligence in several applications such as smart agriculture, smart hospitals, and smart factories by enabling…

Machine Learning · Computer Science 2024-01-18 Muhammad Zawish , Steven Davy , Lizy Abraham

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

This paper proposed a Soft Filter Pruning (SFP) method to accelerate the inference procedure of deep Convolutional Neural Networks (CNNs). Specifically, the proposed SFP enables the pruned filters to be updated when training the model after…

Computer Vision and Pattern Recognition · Computer Science 2018-08-22 Yang He , Guoliang Kang , Xuanyi Dong , Yanwei Fu , Yi Yang

ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression

We propose an efficient and unified framework, namely ThiNet, to simultaneously accelerate and compress CNN models in both training and inference stages. We focus on the filter level pruning, i.e., the whole filter would be discarded if it…

Computer Vision and Pattern Recognition · Computer Science 2017-07-21 Jian-Hao Luo , Jianxin Wu , Weiyao Lin

2PFPCE: Two-Phase Filter Pruning Based on Conditional Entropy

Deep Convolutional Neural Networks~(CNNs) offer remarkable performance of classifications and regressions in many high-dimensional problems and have been widely utilized in real-word cognitive applications. However, high computational cost…

Computer Vision and Pattern Recognition · Computer Science 2018-09-10 Chuhan Min , Aosen Wang , Yiran Chen , Wenyao Xu , Xin Chen

A novel channel pruning method for deep neural network compression

In recent years, deep neural networks have achieved great success in the field of computer vision. However, it is still a big challenge to deploy these deep models on resource-constrained embedded devices such as mobile robots, smart phones…

Computer Vision and Pattern Recognition · Computer Science 2018-05-30 Yiming Hu , Siyang Sun , Jianquan Li , Xingang Wang , Qingyi Gu

Efficient Inference of CNNs via Channel Pruning

The deployment of Convolutional Neural Networks (CNNs) on resource constrained platforms such as mobile devices and embedded systems has been greatly hindered by their high implementation cost, and thus motivated a lot research interest in…

Computer Vision and Pattern Recognition · Computer Science 2019-08-12 Boyu Zhang , Azadeh Davoodi , Yu Hen Hu

Compressing CNN models for resource-constrained systems by channel and layer pruning

Convolutional Neural Networks (CNNs) have achieved significant breakthroughs in various fields. However, these advancements have led to a substantial increase in the complexity and size of these networks. This poses a challenge when…

Machine Learning · Computer Science 2025-09-11 Ahmed Sadaqa , Di Liu

Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks

Recently there has been a lot of work on pruning filters from deep convolutional neural networks (CNNs) with the intention of reducing computations. The key idea is to rank the filters based on a certain criterion (say, $l_1$-norm, average…

Computer Vision and Pattern Recognition · Computer Science 2018-02-01 Deepak Mittal , Shweta Bhardwaj , Mitesh M. Khapra , Balaraman Ravindran

L2PF -- Learning to Prune Faster

Various applications in the field of autonomous driving are based on convolutional neural networks (CNNs), especially for processing camera data. The optimization of such CNNs is a major challenge in continuous development. Newly learned…

Computer Vision and Pattern Recognition · Computer Science 2021-01-08 Manoj-Rohit Vemparala , Nael Fasfous , Alexander Frickenstein , Mhd Ali Moraly , Aquib Jamal , Lukas Frickenstein , Christian Unger , Naveen-Shankar Nagaraja , Walter Stechele

Learning Efficient Convolutional Networks through Network Slimming

The deployment of deep convolutional neural networks (CNNs) in many real world applications is largely hindered by their high computational cost. In this paper, we propose a novel learning scheme for CNNs to simultaneously 1) reduce the…

Computer Vision and Pattern Recognition · Computer Science 2017-08-23 Zhuang Liu , Jianguo Li , Zhiqiang Shen , Gao Huang , Shoumeng Yan , Changshui Zhang

Deep Model Compression Via Two-Stage Deep Reinforcement Learning

Besides accuracy, the model size of convolutional neural networks (CNN) models is another important factor considering limited hardware resources in practical applications. For example, employing deep neural networks on mobile systems…

Machine Learning · Computer Science 2021-07-05 Huixin Zhan , Wei-Ming Lin , Yongcan Cao

Pruning Algorithms to Accelerate Convolutional Neural Networks for Edge Applications: A Survey

With the general trend of increasing Convolutional Neural Network (CNN) model sizes, model compression and acceleration techniques have become critical for the deployment of these models on edge devices. In this paper, we provide a…

Machine Learning · Computer Science 2020-05-12 Jiayi Liu , Samarth Tripathi , Unmesh Kurup , Mohak Shah

Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers

Model pruning has become a useful technique that improves the computational efficiency of deep learning, making it possible to deploy solutions in resource-limited scenarios. A widely-used practice in relevant work assumes that a…

Machine Learning · Computer Science 2018-02-06 Jianbo Ye , Xin Lu , Zhe Lin , James Z. Wang