Related papers: Adaptive Activation-based Structured Pruning

Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions

Pruning is a promising approach to compress deep learning models in order to deploy them on resource-constrained edge devices. However, many existing pruning solutions are based on unstructured pruning, which yields models that cannot…

Machine Learning · Computer Science 2023-03-16 Kaiqi Zhao , Animesh Jain , Ming Zhao

Iterative Activation-based Structured Pruning

Deploying complex deep learning models on edge devices is challenging because they have substantial compute and memory resource requirements, whereas edge devices' resource budget is limited. To solve this problem, extensive pruning…

Machine Learning · Computer Science 2022-01-26 Kaiqi Zhao , Animesh Jain , Ming Zhao

Data-Efficient Structured Pruning via Submodular Optimization

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and…

Machine Learning · Computer Science 2023-02-14 Marwa El Halabi , Suraj Srinivas , Simon Lacoste-Julien

AutoDFP: Automatic Data-Free Pruning via Channel Similarity Reconstruction

Structured pruning methods are developed to bridge the gap between the massive scale of neural networks and the limited hardware resources. Most current structured pruning methods rely on training datasets to fine-tune the compressed model,…

Machine Learning · Computer Science 2024-03-14 Siqi Li , Jun Chen , Jingyang Xiang , Chengrui Zhu , Yong Liu

Loss-Aware Automatic Selection of Structured Pruning Criteria for Deep Neural Network Acceleration

Structured pruning is a well-established technique for compressing neural networks, making it suitable for deployment in resource-limited edge devices. This paper presents an efficient Loss-Aware Automatic Selection of Structured Pruning…

Computer Vision and Pattern Recognition · Computer Science 2025-06-26 Deepak Ghimire , Kilho Lee , Seong-heum Kim

Structured Pruning for Multi-Task Deep Neural Networks

Although multi-task deep neural network (DNN) models have computation and storage benefits over individual single-task DNN models, they can be further optimized via model compression. Numerous structured pruning methods are already…

Machine Learning · Computer Science 2023-04-17 Siddhant Garg , Lijun Zhang , Hui Guan

AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates

Structured weight pruning is a representative model compression technique of DNNs to reduce the storage and computation requirements and accelerate inference. An automatic hyperparameter determination process is necessary due to the large…

Machine Learning · Computer Science 2019-09-12 Ning Liu , Xiaolong Ma , Zhiyuan Xu , Yanzhi Wang , Jian Tang , Jieping Ye

Attention-Based Guided Structured Sparsity of Deep Neural Networks

Network pruning is aimed at imposing sparsity in a neural network architecture by increasing the portion of zero-valued weights for reducing its size regarding energy-efficiency consideration and increasing evaluation speed. In most of the…

Machine Learning · Computer Science 2018-07-17 Amirsina Torfi , Rouzbeh A. Shirvani , Sobhan Soleymani , Nasser M. Nasrabadi

Automatic Structured Pruning for Efficient Architecture in Federated Learning

In Federated Learning (FL), training is conducted on client devices, typically with limited computational resources and storage capacity. To address these constraints, we propose an automatic pruning scheme tailored for FL systems. Our…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Thai Vu Nguyen , Long Bao Le , Anderson Avila

Activation Density driven Energy-Efficient Pruning in Training

Neural network pruning with suitable retraining can yield networks with considerably fewer parameters than the original with comparable degrees of accuracy. Typical pruning methods require large, fully trained networks as a starting point…

Machine Learning · Computer Science 2020-10-13 Timothy Foldy-Porto , Yeshwanth Venkatesha , Priyadarshini Panda

Deep Neural Networks pruning via the Structured Perspective Regularization

In Machine Learning, Artificial Neural Networks (ANNs) are a very powerful tool, broadly used in many applications. Often, the selected (deep) architectures include many layers, and therefore a large amount of parameters, which makes…

Machine Learning · Computer Science 2022-06-29 Matteo Cacciola , Antonio Frangioni , Xinlin Li , Andrea Lodi

Adaptive Structured Pruning of Convolutional Neural Networks for Time Series Classification

Deep learning models for Time Series Classification (TSC) have achieved strong predictive performance but their high computational and memory requirements often limit deployment on resource-constrained devices. While structured pruning can…

Machine Learning · Computer Science 2026-02-16 Javidan Abdullayev , Maxime Devanne , Cyril Meyer , Ali Ismail-Fawaz , Jonathan Weber , Germain Forestier

Structured Pruning for Deep Convolutional Neural Networks: A survey

The remarkable performance of deep Convolutional neural networks (CNNs) is generally attributed to their deeper and wider architectures, which can come with significant computational costs. Pruning neural networks has thus gained interest…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Yang He , Lingao Xiao

Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks

In this paper, we propose a novel layer-adaptive weight-pruning approach for Deep Neural Networks (DNNs) that addresses the challenge of optimizing the output distortion minimization while adhering to a target pruning ratio constraint. Our…

Computer Vision and Pattern Recognition · Computer Science 2023-08-25 Kaixin Xu , Zhe Wang , Xue Geng , Jie Lin , Min Wu , Xiaoli Li , Weisi Lin

C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression

Neural network compression has gained increasing attention in recent years, particularly in computer vision applications, where the need for model reduction is crucial for overcoming deployment constraints. Pruning is a widely used…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Baptiste Bauvin , Loïc Baret , Ola Ahmad

Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance

Recent advancements have scaled neural networks to unprecedented sizes, achieving remarkable performance across a wide range of tasks. However, deploying these large-scale models on resource-constrained devices poses significant challenges…

Machine Learning · Computer Science 2024-10-22 Mostafa Hussien , Mahmoud Afifi , Kim Khoa Nguyen , Mohamed Cheriet

Instruction-Following Pruning for Large Language Models

With the rapid scaling of large language models (LLMs), structured pruning has become a widely used technique to learn efficient, smaller models from larger ones, delivering superior performance compared to training similarly sized models…

Computation and Language · Computer Science 2025-06-04 Bairu Hou , Qibin Chen , Jianyu Wang , Guoli Yin , Chong Wang , Nan Du , Ruoming Pang , Shiyu Chang , Tao Lei

Towards Optimal Structured CNN Pruning via Generative Adversarial Learning

Structured pruning of filters or neurons has received increased focus for compressing convolutional neural networks. Most existing methods rely on multi-stage optimizations in a layer-wise manner for iteratively pruning and retraining which…

Computer Vision and Pattern Recognition · Computer Science 2019-03-25 Shaohui Lin , Rongrong Ji , Chenqian Yan , Baochang Zhang , Liujuan Cao , Qixiang Ye , Feiyue Huang , David Doermann

AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning

As deep neural networks (DNNs) are increasingly deployed on edge devices, optimizing models for constrained computational resources is critical. Existing auto-pruning methods face challenges due to the diversity of DNN models, various…

Artificial Intelligence · Computer Science 2026-04-21 Lixian Jing , Jianpeng Qi , Junyu Dong , Yanwei Yu

Structured Directional Pruning via Perturbation Orthogonal Projection

Structured pruning is an effective compression technique to reduce the computation of neural networks, which is usually achieved by adding perturbations to reduce network parameters at the cost of slightly increasing training loss. A more…

Machine Learning · Computer Science 2021-10-22 Yinchuan Li , Xiaofeng Liu , Yunfeng Shao , Qing Wang , Yanhui Geng