Related papers: X-Pruner: eXplainable Pruning for Vision Transform…

Pruning by Explaining: A Novel Criterion for Deep Neural Network Pruning

The success of convolutional neural networks (CNNs) in various applications is accompanied by a significant increase in computation and parameter storage costs. Recent efforts to reduce these overheads involve pruning and compressing the…

Machine Learning · Computer Science 2021-03-15 Seul-Ki Yeom , Philipp Seegerer , Sebastian Lapuschkin , Alexander Binder , Simon Wiedemann , Klaus-Robert Müller , Wojciech Samek

Vision Transformer Pruning

Vision transformer has achieved competitive performance on a variety of computer vision applications. However, their storage, run-time memory, and computational demands are hindering the deployment to mobile devices. Here we present a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-17 Mingjian Zhu , Yehui Tang , Kai Han

Interpretability-Aware Pruning for Efficient Medical Image Analysis

Deep learning has driven significant advances in medical image analysis, yet its adoption in clinical practice remains constrained by the large size and lack of transparency in modern models. Advances in interpretability techniques such as…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Nikita Malik , Pratinav Seth , Neeraj Kumar Singh , Chintan Chitroda , Vinay Kumar Sankarapu

ViNNPruner: Visual Interactive Pruning for Deep Learning

Neural networks grow vastly in size to tackle more sophisticated tasks. In many cases, such large networks are not deployable on particular hardware and need to be reduced in size. Pruning techniques help to shrink deep neural networks to…

Machine Learning · Computer Science 2022-06-01 Udo Schlegel , Samuel Schiegg , Daniel A. Keim

Differentiable Mask for Pruning Convolutional and Recurrent Networks

Pruning is one of the most effective model reduction techniques. Deep networks require massive computation and such models need to be compressed to bring them on edge devices. Most existing pruning techniques are focused on vision-based…

Machine Learning · Computer Science 2020-04-30 Ramchalam Kinattinkara Ramakrishnan , Eyyüb Sari , Vahid Partovi Nia

Isomorphic Pruning for Vision Models

Structured pruning reduces the computational overhead of deep neural networks by removing redundant sub-structures. However, assessing the relative importance of different sub-structures remains a significant challenge, particularly in…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Gongfan Fang , Xinyin Ma , Michael Bi Mi , Xinchao Wang

Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers

To solve ever more complex problems, Deep Neural Networks are scaled to billions of parameters, leading to huge computational costs. An effective approach to reduce computational requirements and increase efficiency is to prune unnecessary…

Artificial Intelligence · Computer Science 2024-10-24 Sayed Mohammad Vakilzadeh Hatefi , Maximilian Dreyer , Reduan Achtibat , Thomas Wiegand , Wojciech Samek , Sebastian Lapuschkin

C-SWAP: Explainability-Aware Structured Pruning for Efficient Neural Networks Compression

Neural network compression has gained increasing attention in recent years, particularly in computer vision applications, where the need for model reduction is crucial for overcoming deployment constraints. Pruning is a widely used…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Baptiste Bauvin , Loïc Baret , Ola Ahmad

Multi-Dimensional Model Compression of Vision Transformer

Vision transformers (ViT) have recently attracted considerable attentions, but the huge computational cost remains an issue for practical deployment. Previous ViT pruning methods tend to prune the model along one dimension solely, which may…

Computer Vision and Pattern Recognition · Computer Science 2022-01-04 Zejiang Hou , Sun-Yuan Kung

Utilizing Explainable AI for Quantization and Pruning of Deep Neural Networks

For many applications, utilizing DNNs (Deep Neural Networks) requires their implementation on a target architecture in an optimized manner concerning energy consumption, memory requirement, throughput, etc. DNN compression is used to reduce…

Computer Vision and Pattern Recognition · Computer Science 2020-08-21 Muhammad Sabih , Frank Hannig , Juergen Teich

A Unified Pruning Framework for Vision Transformers

Recently, vision transformer (ViT) and its variants have achieved promising performances in various computer vision tasks. Yet the high computational costs and training data requirements of ViTs limit their application in…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Hao Yu , Jianxin Wu

CNNPruner: Pruning Convolutional Neural Networks with Visual Analytics

Convolutional neural networks (CNNs) have demonstrated extraordinarily good performance in many computer vision tasks. The increasing size of CNN models, however, prevents them from being widely deployed to devices with limited…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Guan Li , Junpeng Wang , Han-Wei Shen , Kaixin Chen , Guihua Shan , Zhonghua Lu

Explainability-Inspired Layer-Wise Pruning of Deep Neural Networks for Efficient Object Detection

Deep neural networks (DNNs) have achieved remarkable success in object detection tasks, but their increasing complexity poses significant challenges for deployment on resource-constrained platforms. While model compression techniques such…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Abhinav Shukla , Nachiket Tapas

A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting

The current landscape in time-series forecasting is dominated by Transformer-based models. Their high parameter count and corresponding demand in computational resources pose a challenge to real-world deployment, especially for commercial…

Machine Learning · Computer Science 2024-12-18 Nicholas Kiefer , Arvid Weyrauch , Muhammed Öz , Achim Streit , Markus Götz , Charlotte Debus

Learnable Sparsity for Vision Generative Models

Diffusion models have achieved impressive advancements in various vision tasks. However, these gains often rely on increasing model size, which escalates computational complexity and memory demands, complicating deployment, raising…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Yang Zhang , Er Jin , Wenzhong Liang , Yanfei Dong , Ashkan Khakzar , Philip Torr , Johannes Stegmaier , Kenji Kawaguchi

EvoPruneDeepTL: An Evolutionary Pruning Model for Transfer Learning based Deep Neural Networks

In recent years, Deep Learning models have shown a great performance in complex optimization problems. They generally require large training datasets, which is a limitation in most practical cases. Transfer learning allows importing the…

Neural and Evolutionary Computing · Computer Science 2024-02-06 Javier Poyatos , Daniel Molina , Aritz. D. Martinez , Javier Del Ser , Francisco Herrera

Searching Intrinsic Dimensions of Vision Transformers

It has been shown by many researchers that transformers perform as well as convolutional neural networks in many computer vision tasks. Meanwhile, the large computational costs of its attention module hinder further studies and applications…

Computer Vision and Pattern Recognition · Computer Science 2022-04-19 Fanghui Xue , Biao Yang , Yingyong Qi , Jack Xin

MultiPruner: Balanced Structure Removal in Foundation Models

Recently, state-of-the-art approaches for pruning large pre-trained models (LPMs) have demonstrated that the training-free removal of non-critical residual blocks in Transformers is viable for reducing model size, achieving results that…

Machine Learning · Computer Science 2025-01-20 J. Pablo Muñoz , Jinjie Yuan , Nilesh Jain

SPViT: Enabling Faster Vision Transformers via Soft Token Pruning

Recently, Vision Transformer (ViT) has continuously established new milestones in the computer vision field, while the high computation and memory cost makes its propagation in industrial production difficult. Pruning, a traditional model…

Computer Vision and Pattern Recognition · Computer Science 2022-09-22 Zhenglun Kong , Peiyan Dong , Xiaolong Ma , Xin Meng , Mengshu Sun , Wei Niu , Xuan Shen , Geng Yuan , Bin Ren , Minghai Qin , Hao Tang , Yanzhi Wang

E$^3$-Pruner: Towards Efficient, Economical, and Effective Layer Pruning for Large Language Models

With the increasing size of large language models, layer pruning has gained increased attention as a hardware-friendly approach for model compression. However, existing layer pruning methods struggle to simultaneously address key practical…

Computation and Language · Computer Science 2025-11-24 Tao Yuan , Haoli Bai , Yinfei Pan , Xuyang Cao , Tianyu Zhang , Lu Hou , Ting Hu , Xianzhi Yu