Related papers: Channel Pruning via Optimal Thresholding

PruneNet: Channel Pruning via Global Importance

Channel pruning is one of the predominant approaches for accelerating deep neural networks. Most existing pruning methods either train from scratch with a sparsity inducing term such as group lasso, or prune redundant channels in a…

Machine Learning · Computer Science 2020-05-25 Ashish Khetan , Zohar Karnin

Data-Efficient Structured Pruning via Submodular Optimization

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and…

Machine Learning · Computer Science 2023-02-14 Marwa El Halabi , Suraj Srinivas , Simon Lacoste-Julien

Structured Directional Pruning via Perturbation Orthogonal Projection

Structured pruning is an effective compression technique to reduce the computation of neural networks, which is usually achieved by adding perturbations to reduce network parameters at the cost of slightly increasing training loss. A more…

Machine Learning · Computer Science 2021-10-22 Yinchuan Li , Xiaofeng Liu , Yunfeng Shao , Qing Wang , Yanhui Geng

New Pruning Method Based on DenseNet Network for Image Classification

Deep neural networks have made significant progress in the field of computer vision. Recent studies have shown that depth, width and shortcut connections of neural network architectures play a crucial role in their performance. One of the…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 Rui-Yang Ju , Ting-Yu Lin , Jen-Shiun Chiang

Towards Optimal Filter Pruning with Balanced Performance and Pruning Speed

Filter pruning has drawn more attention since resource constrained platform requires more compact model for deployment. However, current pruning methods suffer either from the inferior performance of one-shot methods, or the expensive time…

Computer Vision and Pattern Recognition · Computer Science 2020-10-15 Dong Li , Sitong Chen , Xudong Liu , Yunda Sun , Li Zhang

CPOT: Channel Pruning via Optimal Transport

Recent advances in deep neural networks (DNNs) lead to tremendously growing network parameters, making the deployments of DNNs on platforms with limited resources extremely difficult. Therefore, various pruning methods have been developed…

Machine Learning · Computer Science 2020-05-22 Yucong Shen , Li Shen , Hao-Zhi Huang , Xuan Wang , Wei Liu

OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks

Channel pruning can significantly accelerate and compress deep neural networks. Many channel pruning works utilize structured sparsity regularization to zero out all the weights in some channels and automatically obtain structure-sparse…

Computer Vision and Pattern Recognition · Computer Science 2019-07-02 Jiashi Li , Qi Qi , Jingyu Wang , Ce Ge , Yujian Li , Zhangzhang Yue , Haifeng Sun

Discrimination-aware Channel Pruning for Deep Neural Networks

Channel pruning is one of the predominant approaches for deep model compression. Existing pruning methods either train from scratch with sparsity constraints on channels, or minimize the reconstruction error between the pre-trained feature…

Computer Vision and Pattern Recognition · Computer Science 2019-01-15 Zhuangwei Zhuang , Mingkui Tan , Bohan Zhuang , Jing Liu , Yong Guo , Qingyao Wu , Junzhou Huang , Jinhui Zhu

Network Pruning via Annealing and Direct Sparsity Control

Artificial neural networks (ANNs) especially deep convolutional networks are very popular these days and have been proved to successfully offer quite reliable solutions to many vision problems. However, the use of deep neural networks is…

Machine Learning · Computer Science 2020-07-28 Yangzi Guo , Yiyuan She , Adrian Barbu

Learned Threshold Pruning

This paper presents a novel differentiable method for unstructured weight pruning of deep neural networks. Our learned-threshold pruning (LTP) method learns per-layer thresholds via gradient descent, unlike conventional methods where they…

Machine Learning · Computer Science 2021-03-22 Kambiz Azarian , Yash Bhalgat , Jinwon Lee , Tijmen Blankevoort

Pruning with Compensation: Efficient Channel Pruning for Deep Convolutional Neural Networks

Channel pruning is a promising technique to compress the parameters of deep convolutional neural networks(DCNN) and to speed up the inference. This paper aims to address the long-standing inefficiency of channel pruning. Most channel…

Computer Vision and Pattern Recognition · Computer Science 2021-09-01 Zhouyang Xie , Yan Fu , Shengzhao Tian , Junlin Zhou , Duanbing Chen

OTPrune: Distribution-Aligned Visual Token Pruning via Optimal Transport

Multi-modal large language models (MLLMs) achieve strong visual-language reasoning but suffer from high inference cost due to redundant visual tokens. Recent work explores visual token pruning to accelerate inference, while existing pruning…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Xiwen Chen , Wenhui Zhu , Gen Li , Xuanzhao Dong , Yujian Xiong , Hao Wang , Peijie Qiu , Qingquan Song , Zhipeng Wang , Shao Tang , Yalin Wang , Abolfazl Razi

Pruning from Scratch

Network pruning is an important research field aiming at reducing computational costs of neural networks. Conventional approaches follow a fixed paradigm which first trains a large and redundant network, and then determines which units…

Computer Vision and Pattern Recognition · Computer Science 2019-09-30 Yulong Wang , Xiaolu Zhang , Lingxi Xie , Jun Zhou , Hang Su , Bo Zhang , Xiaolin Hu

To Filter Prune, or to Layer Prune, That Is The Question

Recent advances in pruning of neural networks have made it possible to remove a large number of filters or weights without any perceptible drop in accuracy. The number of parameters and that of FLOPs are usually the reported metrics to…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Sara Elkerdawy , Mostafa Elhoushi , Abhineet Singh , Hong Zhang , Nilanjan Ray

PCAS: Pruning Channels with Attention Statistics for Deep Network Compression

Compression techniques for deep neural networks are important for implementing them on small embedded devices. In particular, channel-pruning is a useful technique for realizing compact networks. However, many conventional methods require…

Machine Learning · Statistics 2021-11-03 Kohei Yamamoto , Kurato Maeno

Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery

We propose a novel algorithm for combined unit and layer pruning of deep neural networks that functions during training and without requiring a pre-trained network to apply. Our algorithm optimally trades-off learning accuracy and pruning…

Machine Learning · Computer Science 2025-07-17 Valentin Frank Ingmar Guenter , Athanasios Sideris

A Closer Look at Structured Pruning for Neural Network Compression

Structured pruning is a popular method for compressing a neural network: given a large trained network, one alternates between removing channel connections and fine-tuning; reducing the overall width of the network. However, the efficacy of…

Machine Learning · Statistics 2019-06-10 Elliot J. Crowley , Jack Turner , Amos Storkey , Michael O'Boyle

Rethinking Class-Discrimination Based CNN Channel Pruning

Channel pruning has received ever-increasing focus on network compression. In particular, class-discrimination based channel pruning has made major headway, as it fits seamlessly with the classification objective of CNNs and provides good…

Computer Vision and Pattern Recognition · Computer Science 2020-05-01 Yuchen Liu , David Wentzlaff , S. Y. Kung

Gradual Channel Pruning while Training using Feature Relevance Scores for Convolutional Neural Networks

The enormous inference cost of deep neural networks can be scaled down by network compression. Pruning is one of the predominant approaches used for deep network compression. However, existing pruning techniques have one or more of the…

Machine Learning · Computer Science 2020-10-13 Sai Aparna Aketi , Sourjya Roy , Anand Raghunathan , Kaushik Roy

Efficient Inference of CNNs via Channel Pruning

The deployment of Convolutional Neural Networks (CNNs) on resource constrained platforms such as mobile devices and embedded systems has been greatly hindered by their high implementation cost, and thus motivated a lot research interest in…

Computer Vision and Pattern Recognition · Computer Science 2019-08-12 Boyu Zhang , Azadeh Davoodi , Yu Hen Hu