Related papers: COP: Customized Deep Model Compression via Regular…

Pruning by Explaining: A Novel Criterion for Deep Neural Network Pruning

The success of convolutional neural networks (CNNs) in various applications is accompanied by a significant increase in computation and parameter storage costs. Recent efforts to reduce these overheads involve pruning and compressing the…

Machine Learning · Computer Science 2021-03-15 Seul-Ki Yeom , Philipp Seegerer , Sebastian Lapuschkin , Alexander Binder , Simon Wiedemann , Klaus-Robert Müller , Wojciech Samek

Neural Network Compression via Effective Filter Analysis and Hierarchical Pruning

Network compression is crucial to making the deep networks to be more efficient, faster, and generalizable to low-end hardware. Current network compression methods have two open problems: first, there lacks a theoretical framework to…

Machine Learning · Computer Science 2022-06-09 Ziqi Zhou , Li Lian , Yilong Yin , Ze Wang

Stability Based Filter Pruning for Accelerating Deep CNNs

Convolutional neural networks (CNN) have achieved impressive performance on the wide variety of tasks (classification, detection, etc.) across multiple domains at the cost of high computational and memory requirements. Thus, leveraging CNNs…

Computer Vision and Pattern Recognition · Computer Science 2018-11-21 Pravendra Singh , Vinay Sameer Raja Kadi , Nikhil Verma , Vinay P. Namboodiri

Neural Network Pruning by Cooperative Coevolution

Neural network pruning is a popular model compression method which can significantly reduce the computing cost with negligible loss of accuracy. Recently, filters are often pruned directly by designing proper criteria or using auxiliary…

Neural and Evolutionary Computing · Computer Science 2022-05-10 Haopu Shang , Jia-Liang Wu , Wenjing Hong , Chao Qian

Integrating Pruning with Quantization for Efficient Deep Neural Networks Compression

Deep Neural Networks (DNNs) have achieved significant advances in a wide range of applications. However, their deployment on resource-constrained devices remains a challenge due to the large number of layers and parameters, which result in…

Neural and Evolutionary Computing · Computer Science 2025-09-05 Sara Makenali , Babak Rokh , Ali Azarpeyvand

Exploiting Channel Similarity for Accelerating Deep Convolutional Neural Networks

To address the limitations of existing magnitude-based pruning algorithms in cases where model weights or activations are of large and similar magnitude, we propose a novel perspective to discover parameter redundancy among channels and…

Machine Learning · Computer Science 2019-08-08 Yunxiang Zhang , Chenglong Zhao , Bingbing Ni , Jian Zhang , Haoran Deng

Conditional Automated Channel Pruning for Deep Neural Networks

Model compression aims to reduce the redundancy of deep networks to obtain compact models. Recently, channel pruning has become one of the predominant compression methods to deploy deep models on resource-constrained devices. Most channel…

Computer Vision and Pattern Recognition · Computer Science 2021-07-21 Yixin Liu , Yong Guo , Zichang Liu , Haohua Liu , Jingjie Zhang , Zejun Chen , Jing Liu , Jian Chen

Pruning with Compensation: Efficient Channel Pruning for Deep Convolutional Neural Networks

Channel pruning is a promising technique to compress the parameters of deep convolutional neural networks(DCNN) and to speed up the inference. This paper aims to address the long-standing inefficiency of channel pruning. Most channel…

Computer Vision and Pattern Recognition · Computer Science 2021-09-01 Zhouyang Xie , Yan Fu , Shengzhao Tian , Junlin Zhou , Duanbing Chen

Play and Prune: Adaptive Filter Pruning for Deep Model Compression

While convolutional neural networks (CNN) have achieved impressive performance on various classification/recognition tasks, they typically consist of a massive number of parameters. This results in significant memory requirement as well as…

Computer Vision and Pattern Recognition · Computer Science 2019-05-14 Pravendra Singh , Vinay Kumar Verma , Piyush Rai , Vinay P. Namboodiri

Dependency Aware Filter Pruning

Convolutional neural networks (CNNs) are typically over-parameterized, bringing considerable computational overhead and memory footprint in inference. Pruning a proportion of unimportant filters is an efficient way to mitigate the inference…

Computer Vision and Pattern Recognition · Computer Science 2020-05-07 Kai Zhao , Xin-Yu Zhang , Qi Han , Ming-Ming Cheng

CUP: Cluster Pruning for Compressing Deep Neural Networks

We propose Cluster Pruning (CUP) for compressing and accelerating deep neural networks. Our approach prunes similar filters by clustering them based on features derived from both the incoming and outgoing weight connections. With CUP, we…

Computer Vision and Pattern Recognition · Computer Science 2019-11-21 Rahul Duggal , Cao Xiao , Richard Vuduc , Jimeng Sun

Deep Model Compression based on the Training History

Deep Convolutional Neural Networks (DCNNs) have shown promising performances in several visual recognition problems which motivated the researchers to propose popular architectures such as LeNet, AlexNet, VGGNet, ResNet, and many more.…

Computer Vision and Pattern Recognition · Computer Science 2022-05-13 S. H. Shabbeer Basha , Mohammad Farazuddin , Viswanath Pulabaigari , Shiv Ram Dubey , Snehasis Mukherjee

Convolutional Neural Network Pruning with Structural Redundancy Reduction

Convolutional neural network (CNN) pruning has become one of the most successful network compression approaches in recent years. Existing works on network pruning usually focus on removing the least important filters in the network to…

Computer Vision and Pattern Recognition · Computer Science 2021-04-09 Zi Wang , Chengcheng Li , Xiangyang Wang

Exploring Linear Relationship in Feature Map Subspace for ConvNets Compression

While the research on convolutional neural networks (CNNs) is progressing quickly, the real-world deployment of these models is often limited by computing resources and memory constraints. In this paper, we address this issue by proposing a…

Computer Vision and Pattern Recognition · Computer Science 2018-03-16 Dong Wang , Lei Zhou , Xueni Zhang , Xiao Bai , Jun Zhou

Effective Layer Pruning Through Similarity Metric Perspective

Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the…

Machine Learning · Computer Science 2024-11-05 Ian Pons , Bruno Yamamoto , Anna H. Reali Costa , Artur Jordao

Compressing CNN models for resource-constrained systems by channel and layer pruning

Convolutional Neural Networks (CNNs) have achieved significant breakthroughs in various fields. However, these advancements have led to a substantial increase in the complexity and size of these networks. This poses a challenge when…

Machine Learning · Computer Science 2025-09-11 Ahmed Sadaqa , Di Liu

Interpreting Convolutional Neural Networks Through Compression

Convolutional neural networks (CNNs) achieve state-of-the-art performance in a wide variety of tasks in computer vision. However, interpreting CNNs still remains a challenge. This is mainly due to the large number of parameters in these…

Machine Learning · Statistics 2017-11-08 Reza Abbasi-Asl , Bin Yu

LAPP: Layer Adaptive Progressive Pruning for Compressing CNNs from Scratch

Structured pruning is a commonly used convolutional neural network (CNN) compression approach. Pruning rate setting is a fundamental problem in structured pruning. Most existing works introduce too many additional learnable parameters to…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Pucheng Zhai , Kailing Guo , Fang Liu , Xiaofen Xing , Xiangmin Xu

Structural Compression of Convolutional Neural Networks

Deep convolutional neural networks (CNNs) have been successful in many tasks in machine vision, however, millions of weights in the form of thousands of convolutional filters in CNNs makes them difficult for human intepretation or…

Computer Vision and Pattern Recognition · Computer Science 2020-03-26 Reza Abbasi-Asl , Bin Yu

COPS: Controlled Pruning Before Training Starts

State-of-the-art deep neural network (DNN) pruning techniques, applied one-shot before training starts, evaluate sparse architectures with the help of a single criterion -- called pruning score. Pruning weights based on a solitary score…

Machine Learning · Computer Science 2022-03-16 Paul Wimmer , Jens Mehnert , Alexandru Condurache