Related papers: Layer Pruning with Consensus: A Triple-Win Solutio…

To Filter Prune, or to Layer Prune, That Is The Question

Recent advances in pruning of neural networks have made it possible to remove a large number of filters or weights without any perceptible drop in accuracy. The number of parameters and that of FLOPs are usually the reported metrics to…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Sara Elkerdawy , Mostafa Elhoushi , Abhineet Singh , Hong Zhang , Nilanjan Ray

Effective Layer Pruning Through Similarity Metric Perspective

Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the…

Machine Learning · Computer Science 2024-11-05 Ian Pons , Bruno Yamamoto , Anna H. Reali Costa , Artur Jordao

Layer-wise Model Pruning based on Mutual Information

The proposed pruning strategy offers merits over weight-based pruning techniques: (1) it avoids irregular memory access since representations and matrices can be squeezed into their smaller but dense counterparts, leading to greater…

Computation and Language · Computer Science 2021-08-31 Chun Fan , Jiwei Li , Xiang Ao , Fei Wu , Yuxian Meng , Xiaofei Sun

Bonsai: A Framework for Convolutional Neural Network Acceleration Using Criterion-Based Pruning

As the need for more accurate and powerful Convolutional Neural Networks (CNNs) increases, so too does the size, execution time, memory footprint, and power consumption. To overcome this, solutions such as pruning have been proposed with…

Artificial Intelligence · Computer Science 2026-02-20 Joseph Bingham , Sam Helmich

Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks

Resource-efficient convolution neural networks enable not only the intelligence on edge devices but also opportunities in system-level optimization such as scheduling. In this work, we aim to improve the performance of resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2018-10-19 Ting-Wu Chin , Cha Zhang , Diana Marculescu

Pruning by Explaining: A Novel Criterion for Deep Neural Network Pruning

The success of convolutional neural networks (CNNs) in various applications is accompanied by a significant increase in computation and parameter storage costs. Recent efforts to reduce these overheads involve pruning and compressing the…

Machine Learning · Computer Science 2021-03-15 Seul-Ki Yeom , Philipp Seegerer , Sebastian Lapuschkin , Alexander Binder , Simon Wiedemann , Klaus-Robert Müller , Wojciech Samek

GradMAP: Faster Layer Pruning with Gradient Metric and Projection Compensation

Large Language Models (LLMs) exhibit strong reasoning abilities, but their high computational costs limit their practical deployment. Recent studies reveal significant redundancy in LLMs layers, making layer pruning an active research…

Computation and Language · Computer Science 2026-02-17 Hao Liu , Guangyan Li , Wensheng Zhang , Yongqiang Tang

Towards Optimal Filter Pruning with Balanced Performance and Pruning Speed

Filter pruning has drawn more attention since resource constrained platform requires more compact model for deployment. However, current pruning methods suffer either from the inferior performance of one-shot methods, or the expensive time…

Computer Vision and Pattern Recognition · Computer Science 2020-10-15 Dong Li , Sitong Chen , Xudong Liu , Yunda Sun , Li Zhang

Blending Pruning Criteria for Convolutional Neural Networks

The advancement of convolutional neural networks (CNNs) on various vision applications has attracted lots of attention. Yet the majority of CNNs are unable to satisfy the strict requirement for real-world deployment. To overcome this, the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Wei He , Zhongzhan Huang , Mingfu Liang , Senwei Liang , Haizhao Yang

Progressive Correspondence Pruning by Consensus Learning

Correspondence selection aims to correctly select the consistent matches (inliers) from an initial set of putative correspondences. The selection is challenging since putative matches are typically extremely unbalanced, largely dominated by…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Chen Zhao , Yixiao Ge , Feng Zhu , Rui Zhao , Hongsheng Li , Mathieu Salzmann

ThinResNet: A New Baseline for Structured Convolutional Networks Pruning

Pruning is a compression method which aims to improve the efficiency of neural networks by reducing their number of parameters while maintaining a good performance, thus enhancing the performance-to-cost ratio in nontrivial ways. Of…

Neural and Evolutionary Computing · Computer Science 2023-09-25 Hugo Tessier , Ghouti Boukli Hacene , Vincent Gripon

Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated Pruning

LLM-based recommender systems have made significant progress; however, the deployment cost associated with the large parameter volume of LLMs still hinders their real-world applications. This work explores parameter pruning to improve…

Information Retrieval · Computer Science 2025-07-10 Shanle Zheng , Keqin Bao , Jizhi Zhang , Yang Zhang , Fuli Feng , Xiangnan He

Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude Compensation

Layer pruning has emerged as a promising technique for compressing large language models (LLMs) while achieving acceleration proportional to the pruning ratio. In this work, we identify that removing any layer induces a significant…

Computation and Language · Computer Science 2025-07-25 Xinrui Chen , Hongxing Zhang , Fanyi Zeng , Yongxian Wei , Yizhi Wang , Xitong Ling , Guanghao Li , Chun Yuan

Filter Pruning via Filters Similarity in Consecutive Layers

Filter pruning is widely adopted to compress and accelerate the Convolutional Neural Networks (CNNs), but most previous works ignore the relationship between filters and channels in different layers. Processing each layer independently…

Computer Vision and Pattern Recognition · Computer Science 2023-04-27 Xiaorui Wang , Jun Wang , Xin Tang , Peng Gao , Rui Fang , Guotong Xie

SimDiff: Depth Pruning via Similarity and Difference

Depth pruning improves the deployment efficiency of large language models (LLMs) by identifying and removing redundant layers. A widely accepted standard for this identification process is to measure the similarity between layers using…

Artificial Intelligence · Computer Science 2026-04-22 Yuli Chen , Shuhao Zhang , Fanshen Meng , Bo Cheng , Jiale Han , Qiang Tong , Xiulei Liu

Pruning Everything, Everywhere, All at Once

Deep learning stands as the modern paradigm for solving cognitive tasks. However, as the problem complexity increases, models grow deeper and computationally prohibitive, hindering advancements in real-world and resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2025-06-06 Gustavo Henrique do Nascimento , Ian Pons , Anna Helena Reali Costa , Artur Jordao

One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression

Pruning is a core technique for compressing neural networks to improve computational efficiency. This process is typically approached in two ways: one-shot pruning, which involves a single pass of training and pruning, and iterative…

Machine Learning · Computer Science 2025-08-20 Mikołaj Janusz , Tomasz Wojnar , Yawei Li , Luca Benini , Kamil Adamczewski

Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework

Most neural network pruning methods, such as filter-level and layer-level prunings, prune the network model along one dimension (depth, width, or resolution) solely to meet a computational budget. However, such a pruning policy often leads…

Computer Vision and Pattern Recognition · Computer Science 2021-06-16 Wenxiao Wang , Minghao Chen , Shuai Zhao , Long Chen , Jinming Hu , Haifeng Liu , Deng Cai , Xiaofei He , Wei Liu

Convolutional Neural Network Simplification with Progressive Retraining

Kernel pruning methods have been proposed to speed up, simplify, and improve explanation of convolutional neural network (CNN) models. However, the effectiveness of a simplified model is often below the original one. In this letter, we…

Machine Learning · Computer Science 2021-08-19 D. Osaku , J. F. Gomes , A. X. Falcão

Layer Pruning via Fusible Residual Convolutional Block for Deep Neural Networks

In order to deploy deep convolutional neural networks (CNNs) on resource-limited devices, many model pruning methods for filters and weights have been developed, while only a few to layer pruning. However, compared with filter pruning and…

Computer Vision and Pattern Recognition · Computer Science 2020-12-01 Pengtao Xu , Jian Cao , Fanhua Shang , Wenyu Sun , Pu Li