English
Related papers

Related papers: Streamlining Tensor and Network Pruning in PyTorch

200 papers

Neural network pruning is a popular technique used to reduce the inference costs of modern, potentially overparameterized, networks. Starting from a pre-trained network, the process is as follows: remove redundant parameters, retrain, and…

Machine Learning · Computer Science 2021-03-05 Lucas Liebenwein , Cenk Baykal , Brandon Carter , David Gifford , Daniela Rus

To solve ever more complex problems, Deep Neural Networks are scaled to billions of parameters, leading to huge computational costs. An effective approach to reduce computational requirements and increase efficiency is to prune unnecessary…

Convolutional Neural Networks (CNNs) have achieved significant breakthroughs in various fields. However, these advancements have led to a substantial increase in the complexity and size of these networks. This poses a challenge when…

Machine Learning · Computer Science 2025-09-11 Ahmed Sadaqa , Di Liu

Convolutional Neural Networks (CNNs) suffer from different issues, such as computational complexity and the number of parameters. In recent years pruning techniques are employed to reduce the number of operations and model size in CNNs.…

Computer Vision and Pattern Recognition · Computer Science 2020-01-14 Morteza Mousa Pasandi , Mohsen Hajabdollahi , Nader Karimi , Shadrokh Samavi

With the general trend of increasing Convolutional Neural Network (CNN) model sizes, model compression and acceleration techniques have become critical for the deployment of these models on edge devices. In this paper, we provide a…

Machine Learning · Computer Science 2020-05-12 Jiayi Liu , Samarth Tripathi , Unmesh Kurup , Mohak Shah

Model pruning seeks to induce sparsity in a deep neural network's various connection matrices, thereby reducing the number of nonzero-valued parameters in the model. Recent reports (Han et al., 2015; Narang et al., 2017) prune deep networks…

Machine Learning · Statistics 2017-11-15 Michael Zhu , Suyog Gupta

The rapid development of large-scale deep learning models questions the affordability of hardware platforms, which necessitates the pruning to reduce their computational and memory footprints. Sparse neural networks as the product, have…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Can Jin , Tianjin Huang , Yihua Zhang , Mykola Pechenizkiy , Sijia Liu , Shiwei Liu , Tianlong Chen

Structured pruning is a popular method for compressing a neural network: given a large trained network, one alternates between removing channel connections and fine-tuning; reducing the overall width of the network. However, the efficacy of…

Machine Learning · Statistics 2019-06-10 Elliot J. Crowley , Jack Turner , Amos Storkey , Michael O'Boyle

Neural networks have emerged as a powerful tool for solving complex tasks across various domains, but their increasing size and computational requirements have posed significant challenges in deploying them on resource-constrained devices.…

Machine Learning · Computer Science 2023-12-05 Evan Dogariu

Over the last century, deep learning models have become the state-of-the-art for solving complex computer vision problems. These modern computer vision models have millions of parameters, which presents two major challenges: (1) the…

Computer Vision and Pattern Recognition · Computer Science 2025-01-15 Florian Merkle , David Weber , Pascal Schöttle , Stephan Schlögl , Martin Nocker

Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage resources. To enable the deployment of modern models on resource-constrained…

Machine Learning · Computer Science 2024-08-12 Hongrong Cheng , Miao Zhang , Javen Qinfeng Shi

Network pruning is an important research field aiming at reducing computational costs of neural networks. Conventional approaches follow a fixed paradigm which first trains a large and redundant network, and then determines which units…

Computer Vision and Pattern Recognition · Computer Science 2019-09-30 Yulong Wang , Xiaolu Zhang , Lingxi Xie , Jun Zhou , Hang Su , Bo Zhang , Xiaolin Hu

Pruning is a compression method which aims to improve the efficiency of neural networks by reducing their number of parameters while maintaining a good performance, thus enhancing the performance-to-cost ratio in nontrivial ways. Of…

Neural and Evolutionary Computing · Computer Science 2023-09-25 Hugo Tessier , Ghouti Boukli Hacene , Vincent Gripon

When deploying pre-trained neural network models in real-world applications, model consumers often encounter resource-constraint platforms such as mobile and smart devices. They typically use the pruning technique to reduce the size and…

Machine Learning · Computer Science 2025-06-19 Mark Huasong Meng , Guangdong Bai , Sin Gee Teo , Jin Song Dong

Artificial neural network pruning is a method in which artificial neural network sizes can be reduced while attempting to preserve the predicting capabilities of the network. This is done to make the model smaller or faster during inference…

Machine Learning · Computer Science 2025-05-21 Alexandre Broggi , Nathaniel Bastian , Lance Fiondella , Gokhan Kul

Neural networks grow vastly in size to tackle more sophisticated tasks. In many cases, such large networks are not deployable on particular hardware and need to be reduced in size. Pruning techniques help to shrink deep neural networks to…

Machine Learning · Computer Science 2022-06-01 Udo Schlegel , Samuel Schiegg , Daniel A. Keim

Transformer models have revolutionized natural language processing with their unparalleled ability to grasp complex contextual relationships. However, the vast number of parameters in these models has raised concerns regarding computational…

Machine Learning · Computer Science 2023-10-10 Sia Gholami , Marwan Omar

Image restoration tasks have witnessed great performance improvement in recent years by developing large deep models. Despite the outstanding performance, the heavy computation demanded by the deep models has restricted the application of…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Junghun Oh , Heewon Kim , Seungjun Nah , Cheeun Hong , Jonghyun Choi , Kyoung Mu Lee

Network pruning is one of the most dominant methods for reducing the heavy inference cost of deep neural networks. Existing methods often iteratively prune networks to attain high compression ratio without incurring significant loss in…

Computer Vision and Pattern Recognition · Computer Science 2020-08-17 Duong H. Le , Trung-Nhan Vo , Nam Thoai

In this work we present a method to improve the pruning step of the current state-of-the-art methodology to compress neural networks. The novelty of the proposed pruning technique is in its differentiability, which allows pruning to be…

Computer Vision and Pattern Recognition · Computer Science 2019-01-08 Franco Manessi , Alessandro Rozza , Simone Bianco , Paolo Napoletano , Raimondo Schettini
‹ Prev 1 2 3 10 Next ›