Related papers: Structured Probabilistic Pruning for Convolutional…

LAPP: Layer Adaptive Progressive Pruning for Compressing CNNs from Scratch

Structured pruning is a commonly used convolutional neural network (CNN) compression approach. Pruning rate setting is a fundamental problem in structured pruning. Most existing works introduce too many additional learnable parameters to…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Pucheng Zhai , Kailing Guo , Fang Liu , Xiaofen Xing , Xiangmin Xu

StructADMM: A Systematic, High-Efficiency Framework of Structured Weight Pruning for DNNs

Weight pruning methods of DNNs have been demonstrated to achieve a good model pruning rate without loss of accuracy, thereby alleviating the significant computation/storage requirements of large-scale DNNs. Structured weight pruning methods…

Neural and Evolutionary Computing · Computer Science 2019-03-28 Tianyun Zhang , Shaokai Ye , Kaiqi Zhang , Xiaolong Ma , Ning Liu , Linfeng Zhang , Jian Tang , Kaisheng Ma , Xue Lin , Makan Fardad , Yanzhi Wang

Loss-Aware Automatic Selection of Structured Pruning Criteria for Deep Neural Network Acceleration

Structured pruning is a well-established technique for compressing neural networks, making it suitable for deployment in resource-limited edge devices. This paper presents an efficient Loss-Aware Automatic Selection of Structured Pruning…

Computer Vision and Pattern Recognition · Computer Science 2025-06-26 Deepak Ghimire , Kilho Lee , Seong-heum Kim

Dynamic Probabilistic Pruning: A general framework for hardware-constrained pruning at different granularities

Unstructured neural network pruning algorithms have achieved impressive compression rates. However, the resulting - typically irregular - sparse matrices hamper efficient hardware implementations, leading to additional memory usage and…

Machine Learning · Computer Science 2021-05-27 Lizeth Gonzalez-Carabarin , Iris A. M. Huijben , Bastiaan S. Veeling , Alexandre Schmid , Ruud J. G. van Sloun

Channel Pruning for Accelerating Very Deep Neural Networks

In this paper, we introduce a new channel pruning method to accelerate very deep convolutional neural networks.Given a trained CNN model, we propose an iterative two-step algorithm to effectively prune each layer, by a LASSO regression…

Computer Vision and Pattern Recognition · Computer Science 2017-08-22 Yihui He , Xiangyu Zhang , Jian Sun

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

This paper proposed a Soft Filter Pruning (SFP) method to accelerate the inference procedure of deep Convolutional Neural Networks (CNNs). Specifically, the proposed SFP enables the pruned filters to be updated when training the model after…

Computer Vision and Pattern Recognition · Computer Science 2018-08-22 Yang He , Guoliang Kang , Xuanyi Dong , Yanwei Fu , Yi Yang

Accelerating DNN Training with Structured Data Gradient Pruning

Weight pruning is a technique to make Deep Neural Network (DNN) inference more computationally efficient by reducing the number of model parameters over the course of training. However, most weight pruning techniques generally does not…

Machine Learning · Computer Science 2022-02-03 Bradley McDanel , Helia Dinh , John Magallanes

Slimmable Pruned Neural Networks

Slimmable Neural Networks (S-Net) is a novel network which enabled to select one of the predefined proportions of channels (sub-network) dynamically depending on the current computational resource availability. The accuracy of each…

Computer Vision and Pattern Recognition · Computer Science 2022-12-08 Hideaki Kuratsu , Atsuyoshi Nakamura

Leveraging Structured Pruning of Convolutional Neural Networks

Structured pruning is a popular method to reduce the cost of convolutional neural networks, that are the state of the art in many computer vision tasks. However, depending on the architecture, pruning introduces dimensional discrepancies…

Neural and Evolutionary Computing · Computer Science 2022-12-13 Hugo Tessier , Vincent Gripon , Mathieu Léonardon , Matthieu Arzel , David Bertrand , Thomas Hannagan

Towards Higher Ranks via Adversarial Weight Pruning

Convolutional Neural Networks (CNNs) are hard to deploy on edge devices due to its high computation and storage complexities. As a common practice for model compression, network pruning consists of two major categories: unstructured and…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Yuchuan Tian , Hanting Chen , Tianyu Guo , Chao Xu , Yunhe Wang

SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks

We introduce a pruning algorithm that provably sparsifies the parameters of a trained model in a way that approximately preserves the model's predictive accuracy. Our algorithm uses a small batch of input points to construct a data-informed…

Machine Learning · Computer Science 2021-03-16 Cenk Baykal , Lucas Liebenwein , Igor Gilitschenski , Dan Feldman , Daniela Rus

Three Dimensional Convolutional Neural Network Pruning with Regularization-Based Method

Despite enjoying extensive applications in video analysis, three-dimensional convolutional neural networks (3D CNNs)are restricted by their massive computation and storage consumption. To solve this problem, we propose a threedimensional…

Machine Learning · Computer Science 2019-05-21 Yuxin Zhang , Huan Wang , Yang Luo , Lu Yu , Haoji Hu , Hangguan Shan , Tony Q. S. Quek

A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks

Acceleration of convolutional neural network has received increasing attention during the past several years. Among various acceleration techniques, filter pruning has its inherent merit by effectively reducing the number of convolution…

Computer Vision and Pattern Recognition · Computer Science 2019-06-19 Dong Wang , Lei Zhou , Xiao Bai , Jun Zhou

Structured Pruning of Recurrent Neural Networks through Neuron Selection

Recurrent neural networks (RNNs) have recently achieved remarkable successes in a number of applications. However, the huge sizes and computational burden of these models make it difficult for their deployment on edge devices. A practically…

Machine Learning · Computer Science 2019-12-10 Liangjian Wen , Xuanyang Zhang , Haoli Bai , Zenglin Xu

Automatic Complementary Separation Pruning Toward Lightweight CNNs

In this paper, we present Automatic Complementary Separation Pruning (ACSP), a novel and fully automated pruning method for convolutional neural networks. ACSP integrates the strengths of both structured pruning and activation-based…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 David Levin , Gonen Singer

Pushing the Efficiency Limit Using Structured Sparse Convolutions

Weight pruning is among the most popular approaches for compressing deep convolutional neural networks. Recent work suggests that in a randomly initialized deep neural network, there exist sparse subnetworks that achieve performance…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Vinay Kumar Verma , Nikhil Mehta , Shijing Si , Ricardo Henao , Lawrence Carin

ACP: Automatic Channel Pruning via Clustering and Swarm Intelligence Optimization for CNN

As the convolutional neural network (CNN) gets deeper and wider in recent years, the requirements for the amount of data and hardware resources have gradually increased. Meanwhile, CNN also reveals salient redundancy in several tasks. The…

Computer Vision and Pattern Recognition · Computer Science 2021-01-19 Jingfei Chang , Yang Lu , Ping Xue , Yiqun Xu , Zhen Wei

Parameterized Structured Pruning for Deep Neural Networks

As a result of the growing size of Deep Neural Networks (DNNs), the gap to hardware capabilities in terms of memory and compute increases. To effectively compress DNNs, quantization and connection pruning are usually considered. However,…

Machine Learning · Computer Science 2019-06-13 Guenther Schindler , Wolfgang Roth , Franz Pernkopf , Holger Froening

Speeding up convolutional networks pruning with coarse ranking

Channel-based pruning has achieved significant successes in accelerating deep convolutional neural network, whose pipeline is an iterative three-step procedure: ranking, pruning and fine-tuning. However, this iterative procedure is…

Computer Vision and Pattern Recognition · Computer Science 2019-02-19 Zi Wang , Chengcheng Li , Dali Wang , Xiangyang Wang , Hairong Qi

Stochastic Subnetwork Annealing: A Regularization Technique for Fine Tuning Pruned Subnetworks

Pruning methods have recently grown in popularity as an effective way to reduce the size and computational complexity of deep neural networks. Large numbers of parameters can be removed from trained models with little discernible loss in…

Machine Learning · Computer Science 2024-01-18 Tim Whitaker , Darrell Whitley