Related papers: Dynamic Structure Pruning for Compressing CNNs

A Closer Look at Structured Pruning for Neural Network Compression

Structured pruning is a popular method for compressing a neural network: given a large trained network, one alternates between removing channel connections and fine-tuning; reducing the overall width of the network. However, the efficacy of…

Machine Learning · Statistics 2019-06-10 Elliot J. Crowley , Jack Turner , Amos Storkey , Michael O'Boyle

Structured Pruning of Neural Networks with Budget-Aware Regularization

Pruning methods have shown to be effective at reducing the size of deep neural networks while keeping accuracy almost intact. Among the most effective methods are those that prune a network while training it with a sparsity prior loss and…

Neural and Evolutionary Computing · Computer Science 2019-12-20 Carl Lemaire , Andrew Achkar , Pierre-Marc Jodoin

Pruning with Compensation: Efficient Channel Pruning for Deep Convolutional Neural Networks

Channel pruning is a promising technique to compress the parameters of deep convolutional neural networks(DCNN) and to speed up the inference. This paper aims to address the long-standing inefficiency of channel pruning. Most channel…

Computer Vision and Pattern Recognition · Computer Science 2021-09-01 Zhouyang Xie , Yan Fu , Shengzhao Tian , Junlin Zhou , Duanbing Chen

Structured Pruning for Deep Convolutional Neural Networks: A survey

The remarkable performance of deep Convolutional neural networks (CNNs) is generally attributed to their deeper and wider architectures, which can come with significant computational costs. Pruning neural networks has thus gained interest…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Yang He , Lingao Xiao

PruneNet: Channel Pruning via Global Importance

Channel pruning is one of the predominant approaches for accelerating deep neural networks. Most existing pruning methods either train from scratch with a sparsity inducing term such as group lasso, or prune redundant channels in a…

Machine Learning · Computer Science 2020-05-25 Ashish Khetan , Zohar Karnin

CUP: Cluster Pruning for Compressing Deep Neural Networks

We propose Cluster Pruning (CUP) for compressing and accelerating deep neural networks. Our approach prunes similar filters by clustering them based on features derived from both the incoming and outgoing weight connections. With CUP, we…

Computer Vision and Pattern Recognition · Computer Science 2019-11-21 Rahul Duggal , Cao Xiao , Richard Vuduc , Jimeng Sun

A Generalization of Continuous Relaxation in Structured Pruning

Deep learning harnesses massive parallel floating-point processing to train and evaluate large neural networks. Trends indicate that deeper and larger neural networks with an increasing number of parameters achieve higher accuracy than…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Brad Larson , Bishal Upadhyaya , Luke McDermott , Siddha Ganju

RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration

Convolutional Neural Networks (CNNs) have demonstrated exceptional performance in recent years. Compressing these models not only reduces storage requirements, making deployment to edge devices feasible, but also accelerates inference,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Boyao Wang , Volodymyr Kindratenko

A novel channel pruning method for deep neural network compression

In recent years, deep neural networks have achieved great success in the field of computer vision. However, it is still a big challenge to deploy these deep models on resource-constrained embedded devices such as mobile robots, smart phones…

Computer Vision and Pattern Recognition · Computer Science 2018-05-30 Yiming Hu , Siyang Sun , Jianquan Li , Xingang Wang , Qingyi Gu

Loss-Aware Automatic Selection of Structured Pruning Criteria for Deep Neural Network Acceleration

Structured pruning is a well-established technique for compressing neural networks, making it suitable for deployment in resource-limited edge devices. This paper presents an efficient Loss-Aware Automatic Selection of Structured Pruning…

Computer Vision and Pattern Recognition · Computer Science 2025-06-26 Deepak Ghimire , Kilho Lee , Seong-heum Kim

Manifold Regularized Dynamic Network Pruning

Neural network pruning is an essential approach for reducing the computational complexity of deep models so that they can be well deployed on resource-limited devices. Compared with conventional methods, the recently developed dynamic…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Yehui Tang , Yunhe Wang , Yixing Xu , Yiping Deng , Chao Xu , Dacheng Tao , Chang Xu

DMCP: Differentiable Markov Channel Pruning for Neural Networks

Recent works imply that the channel pruning can be regarded as searching optimal sub-structure from unpruned networks. However, existing works based on this observation require training and evaluating a large number of structures, which…

Computer Vision and Pattern Recognition · Computer Science 2020-05-11 Shaopeng Guo , Yujie Wang , Quanquan Li , Junjie Yan

Feature Flow Regularization: Improving Structured Sparsity in Deep Neural Networks

Pruning is a model compression method that removes redundant parameters in deep neural networks (DNNs) while maintaining accuracy. Most available filter pruning methods require complex treatments such as iterative pruning, features…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Yue Wu , Yuan Lan , Luchan Zhang , Yang Xiang

Structured Pruning of Deep Convolutional Neural Networks

Real time application of deep learning algorithms is often hindered by high computational complexity and frequent memory accesses. Network pruning is a promising technique to solve this problem. However, pruning usually results in irregular…

Neural and Evolutionary Computing · Computer Science 2015-12-31 Sajid Anwar , Kyuyeon Hwang , Wonyong Sung

Data-Efficient Structured Pruning via Submodular Optimization

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and…

Machine Learning · Computer Science 2023-02-14 Marwa El Halabi , Suraj Srinivas , Simon Lacoste-Julien

Dynamic Neural Network Channel Execution for Efficient Training

Existing methods for reducing the computational burden of neural networks at run-time, such as parameter pruning or dynamic computational path selection, focus solely on improving computational efficiency during inference. On the other…

Machine Learning · Computer Science 2019-05-17 Simeon E. Spasov , Pietro Lio

Towards Optimal Structured CNN Pruning via Generative Adversarial Learning

Structured pruning of filters or neurons has received increased focus for compressing convolutional neural networks. Most existing methods rely on multi-stage optimizations in a layer-wise manner for iteratively pruning and retraining which…

Computer Vision and Pattern Recognition · Computer Science 2019-03-25 Shaohui Lin , Rongrong Ji , Chenqian Yan , Baochang Zhang , Liujuan Cao , Qixiang Ye , Feiyue Huang , David Doermann

REPrune: Channel Pruning via Kernel Representative Selection

Channel pruning is widely accepted to accelerate modern convolutional neural networks (CNNs). The resulting pruned model benefits from its immediate deployment on general-purpose software and hardware resources. However, its large pruning…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Mincheol Park , Dongjin Kim , Cheonjun Park , Yuna Park , Gyeong Eun Gong , Won Woo Ro , Suhyun Kim

Structure-Aware Automatic Channel Pruning by Searching with Graph Embedding

Channel pruning is a powerful technique to reduce the computational overhead of deep neural networks, enabling efficient deployment on resource-constrained devices. However, existing pruning methods often rely on local heuristics or…

Artificial Intelligence · Computer Science 2025-06-16 Zifan Liu , Yuan Cao , Yanwei Yu , Heng Qi , Jie Gui

Dynamic Probabilistic Pruning: A general framework for hardware-constrained pruning at different granularities

Unstructured neural network pruning algorithms have achieved impressive compression rates. However, the resulting - typically irregular - sparse matrices hamper efficient hardware implementations, leading to additional memory usage and…

Machine Learning · Computer Science 2021-05-27 Lizeth Gonzalez-Carabarin , Iris A. M. Huijben , Bastiaan S. Veeling , Alexandre Schmid , Ruud J. G. van Sloun