Related papers: Vision Transformer Pruning

Vision Transformer Pruning Via Matrix Decomposition

This is a further development of Vision Transformer Pruning via matrix decomposition. The purpose of the Vision Transformer Pruning is to prune the dimension of the linear projection of the dataset by learning their associated importance…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Tianyi Sun

Searching Intrinsic Dimensions of Vision Transformers

It has been shown by many researchers that transformers perform as well as convolutional neural networks in many computer vision tasks. Meanwhile, the large computational costs of its attention module hinder further studies and applications…

Computer Vision and Pattern Recognition · Computer Science 2022-04-19 Fanghui Xue , Biao Yang , Yingyong Qi , Jack Xin

Multi-Dimensional Model Compression of Vision Transformer

Vision transformers (ViT) have recently attracted considerable attentions, but the huge computational cost remains an issue for practical deployment. Previous ViT pruning methods tend to prune the model along one dimension solely, which may…

Computer Vision and Pattern Recognition · Computer Science 2022-01-04 Zejiang Hou , Sun-Yuan Kung

Vision Transformer Compression with Structured Pruning and Low Rank Approximation

Transformer architecture has gained popularity due to its ability to scale with large dataset. Consequently, there is a need to reduce the model size and latency, especially for on-device deployment. We focus on vision transformer proposed…

Computer Vision and Pattern Recognition · Computer Science 2022-03-28 Ankur Kumar

Exploring Vision Neural Network Pruning via Screening Methodology

The remarkable performance of modern deep neural networks (DNNs) is largely driven by their massive scale, often comprising tens to hundreds of millions-or even billions-of parameters. However, such a scale incurs substantial storage and…

Machine Learning · Computer Science 2026-05-01 Mingyuan Wang , Yangzi Guo , Sida Liu , Yuhang Liu

Sparse then Prune: Toward Efficient Vision Transformers

The Vision Transformer architecture is a deep learning model inspired by the success of the Transformer model in Natural Language Processing. However, the self-attention mechanism, large number of parameters, and the requirement for a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Yogi Prasetyo , Novanto Yudistira , Agus Wahyu Widodo

ViNNPruner: Visual Interactive Pruning for Deep Learning

Neural networks grow vastly in size to tackle more sophisticated tasks. In many cases, such large networks are not deployable on particular hardware and need to be reduced in size. Pruning techniques help to shrink deep neural networks to…

Machine Learning · Computer Science 2022-06-01 Udo Schlegel , Samuel Schiegg , Daniel A. Keim

CNNPruner: Pruning Convolutional Neural Networks with Visual Analytics

Convolutional neural networks (CNNs) have demonstrated extraordinarily good performance in many computer vision tasks. The increasing size of CNN models, however, prevents them from being widely deployed to devices with limited…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Guan Li , Junpeng Wang , Han-Wei Shen , Kaixin Chen , Guihua Shan , Zhonghua Lu

To Filter Prune, or to Layer Prune, That Is The Question

Recent advances in pruning of neural networks have made it possible to remove a large number of filters or weights without any perceptible drop in accuracy. The number of parameters and that of FLOPs are usually the reported metrics to…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Sara Elkerdawy , Mostafa Elhoushi , Abhineet Singh , Hong Zhang , Nilanjan Ray

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Visual token compression is critical for Large Vision-Language Models (LVLMs) to efficiently process high-resolution inputs. Existing methods that typically adopt fixed compression ratios cannot adapt to scenes of varying complexity, often…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Quan-Sheng Zeng , Yunheng Li , Qilong Wang , Peng-Tao Jiang , Zuxuan Wu , Ming-Ming Cheng , Qibin Hou

Patch Slimming for Efficient Vision Transformers

This paper studies the efficiency problem for visual transformers by excavating redundant calculation in given networks. The recent transformer architecture has demonstrated its effectiveness for achieving excellent performance on a series…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Yehui Tang , Kai Han , Yunhe Wang , Chang Xu , Jianyuan Guo , Chao Xu , Dacheng Tao

Three things everyone should know about Vision Transformers

After their initial success in natural language processing, transformer architectures have rapidly gained traction in computer vision, providing state-of-the-art results for tasks such as image classification, detection, segmentation, and…

Computer Vision and Pattern Recognition · Computer Science 2022-03-21 Hugo Touvron , Matthieu Cord , Alaaeldin El-Nouby , Jakob Verbeek , Hervé Jégou

CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction

Vision transformer (ViT) has achieved competitive accuracy on a variety of computer vision applications, but its computational cost impedes the deployment on resource-limited mobile devices. We explore the sparsity in ViT and observe that…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Zhuoran Song , Yihong Xu , Zhezhi He , Li Jiang , Naifeng Jing , Xiaoyao Liang

When Does Pruning Benefit Vision Representations?

Pruning is widely used to reduce the complexity of deep learning models, but its effects on interpretability and representation learning remain poorly understood. This paper investigates how pruning influences vision models across three key…

Computer Vision and Pattern Recognition · Computer Science 2025-07-09 Enrico Cassano , Riccardo Renzulli , Andrea Bragagnolo , Marco Grangetto

Revisiting Token Pruning for Object Detection and Instance Segmentation

Vision Transformers (ViTs) have shown impressive performance in computer vision, but their high computational cost, quadratic in the number of tokens, limits their adoption in computation-constrained applications. However, this large number…

Computer Vision and Pattern Recognition · Computer Science 2023-12-14 Yifei Liu , Mathias Gehrig , Nico Messikommer , Marco Cannici , Davide Scaramuzza

Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

The rapid development of large-scale deep learning models questions the affordability of hardware platforms, which necessitates the pruning to reduce their computational and memory footprints. Sparse neural networks as the product, have…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Can Jin , Tianjin Huang , Yihua Zhang , Mykola Pechenizkiy , Sijia Liu , Shiwei Liu , Tianlong Chen

Pruning by Block Benefit: Exploring the Properties of Vision Transformer Blocks during Domain Adaptation

Vision Transformer have set new benchmarks in several tasks, but these models come with the lack of high computational costs which makes them impractical for resource limited hardware. Network pruning reduces the computational complexity by…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Patrick Glandorf , Bodo Rosenhahn

EAPruning: Evolutionary Pruning for Vision Transformers and CNNs

Structured pruning greatly eases the deployment of large neural networks in resource-constrained environments. However, current methods either involve strong domain expertise, require extra hyperparameter tuning, or are restricted only to a…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Qingyuan Li , Bo Zhang , Xiangxiang Chu

Adaptive MLP Pruning for Large Vision Transformers

Large vision transformers present impressive scalability, as their performance can be well improved with increased model capacity. Nevertheless, their cumbersome parameters results in exorbitant computational and memory demands. By…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Chengchao Shen

X-Pruner: eXplainable Pruning for Vision Transformers

Recently vision transformer models have become prominent models for a range of tasks. These models, however, usually suffer from intensive computational costs and heavy memory requirements, making them impractical for deployment on edge…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Lu Yu , Wei Xiang