Related papers: Smallify: Learning Network Size while Training

Federated Progressive Sparsification (Purge, Merge, Tune)+

To improve federated training of neural networks, we develop FedSparsify, a sparsification strategy based on progressive weight magnitude pruning. Our method has several benefits. First, since the size of the network becomes increasingly…

Machine Learning · Computer Science 2023-05-17 Dimitris Stripelis , Umang Gupta , Greg Ver Steeg , Jose Luis Ambite

Ps and Qs: Quantization-aware pruning for efficient low latency neural network inference

Efficient machine learning implementations optimized for inference in hardware have wide-ranging benefits, depending on the application, from lower inference latency to higher data throughput and reduced energy consumption. Two popular…

Machine Learning · Computer Science 2021-07-21 Benjamin Hawks , Javier Duarte , Nicholas J. Fraser , Alessandro Pappalardo , Nhan Tran , Yaman Umuroglu

Neural Network Compression for Reinforcement Learning Tasks

In real applications of Reinforcement Learning (RL), such as robotics, low latency and energy efficient inference is very desired. The use of sparsity and pruning for optimizing Neural Network inference, and particularly to improve energy…

Machine Learning · Computer Science 2024-05-14 Dmitry A. Ivanov , Denis A. Larionov , Oleg V. Maslennikov , Vladimir V. Voevodin

Towards Sparsified Federated Neuroimaging Models via Weight Pruning

Federated training of large deep neural networks can often be restrictive due to the increasing costs of communicating the updates with increasing model sizes. Various model pruning techniques have been designed in centralized settings to…

Machine Learning · Computer Science 2022-08-25 Dimitris Stripelis , Umang Gupta , Nikhil Dhinagar , Greg Ver Steeg , Paul Thompson , José Luis Ambite

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

The growing energy and performance costs of deep learning have driven the community to reduce the size of neural networks by selectively pruning components. Similarly to their biological counterparts, sparse networks generalize just as…

Machine Learning · Computer Science 2021-02-02 Torsten Hoefler , Dan Alistarh , Tal Ben-Nun , Nikoli Dryden , Alexandra Peste

Automated Pruning for Deep Neural Network Compression

In this work we present a method to improve the pruning step of the current state-of-the-art methodology to compress neural networks. The novelty of the proposed pruning technique is in its differentiability, which allows pruning to be…

Computer Vision and Pattern Recognition · Computer Science 2019-01-08 Franco Manessi , Alessandro Rozza , Simone Bianco , Paolo Napoletano , Raimondo Schettini

Exploring Sparsity in Recurrent Neural Networks

Recurrent Neural Networks (RNN) are widely used to solve a variety of problems and as the quantity of data and the amount of available compute have increased, so have model sizes. The number of parameters in recent state-of-the-art networks…

Machine Learning · Computer Science 2017-11-08 Sharan Narang , Erich Elsen , Gregory Diamos , Shubho Sengupta

SNIP: Single-shot Network Pruning based on Connection Sensitivity

Pruning large neural networks while maintaining their performance is often desirable due to the reduced space and time complexity. In existing methods, pruning is done within an iterative optimization procedure with either heuristically…

Computer Vision and Pattern Recognition · Computer Science 2019-02-26 Namhoon Lee , Thalaiyasingam Ajanthan , Philip H. S. Torr

Detecting Dead Weights and Units in Neural Networks

Deep Neural Networks are highly over-parameterized and the size of the neural networks can be reduced significantly after training without any decrease in performance. One can clearly see this phenomenon in a wide range of architectures…

Machine Learning · Computer Science 2018-06-19 Utku Evci

Reducing the Training Time of Neural Networks by Partitioning

This paper presents a new method for pre-training neural networks that can decrease the total training time for a neural network while maintaining the final performance, which motivates its use on deep neural networks. By partitioning the…

Neural and Evolutionary Computing · Computer Science 2016-01-05 Conrado S. Miranda , Fernando J. Von Zuben

Finding the Optimal Network Depth in Classification Tasks

We develop a fast end-to-end method for training lightweight neural networks using multiple classifier heads. By allowing the model to determine the importance of each head and rewarding the choice of a single shallow classifier, we are…

Machine Learning · Computer Science 2020-04-20 Bartosz Wójcik , Maciej Wołczyk , Klaudia Bałazy , Jacek Tabor

Robustness to Pruning Predicts Generalization in Deep Neural Networks

Existing generalization measures that aim to capture a model's simplicity based on parameter counts or norms fail to explain generalization in overparameterized deep neural networks. In this paper, we introduce a new, theoretically…

Machine Learning · Computer Science 2021-03-11 Lorenz Kuhn , Clare Lyle , Aidan N. Gomez , Jonas Rothfuss , Yarin Gal

Training Sparse Neural Networks using Compressed Sensing

Pruning the weights of neural networks is an effective and widely-used technique for reducing model size and inference complexity. We develop and test a novel method based on compressed sensing which combines the pruning and training into a…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Jonathan W. Siegel , Jianhong Chen , Pengchuan Zhang , Jinchao Xu

Training Deep Neural Networks with Joint Quantization and Pruning of Weights and Activations

Quantization and pruning are core techniques used to reduce the inference costs of deep neural networks. State-of-the-art quantization techniques are currently applied to both the weights and activations; however, pruning is most often…

Machine Learning · Computer Science 2021-11-02 Xinyu Zhang , Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das

Efficient and Sparse Neural Networks by Pruning Weights in a Multiobjective Learning Approach

Overparameterization and overfitting are common concerns when designing and training deep neural networks, that are often counteracted by pruning and regularization strategies. However, these strategies remain secondary to most learning…

Machine Learning · Computer Science 2020-09-01 Malena Reiners , Kathrin Klamroth , Michael Stiglmayr

Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance

Recent advancements have scaled neural networks to unprecedented sizes, achieving remarkable performance across a wide range of tasks. However, deploying these large-scale models on resource-constrained devices poses significant challenges…

Machine Learning · Computer Science 2024-10-22 Mostafa Hussien , Mahmoud Afifi , Kim Khoa Nguyen , Mohamed Cheriet

Neural Network Pruning Through Constrained Reinforcement Learning

Network pruning reduces the size of neural networks by removing (pruning) neurons such that the performance drop is minimal. Traditional pruning approaches focus on designing metrics to quantify the usefulness of a neuron which is often…

Computer Vision and Pattern Recognition · Computer Science 2021-11-01 Shehryar Malik , Muhammad Umair Haider , Omer Iqbal , Murtaza Taj

Structured Pruning of Neural Networks with Budget-Aware Regularization

Pruning methods have shown to be effective at reducing the size of deep neural networks while keeping accuracy almost intact. Among the most effective methods are those that prune a network while training it with a sparsity prior loss and…

Neural and Evolutionary Computing · Computer Science 2019-12-20 Carl Lemaire , Andrew Achkar , Pierre-Marc Jodoin

Activation Density driven Energy-Efficient Pruning in Training

Neural network pruning with suitable retraining can yield networks with considerably fewer parameters than the original with comparable degrees of accuracy. Typical pruning methods require large, fully trained networks as a starting point…

Machine Learning · Computer Science 2020-10-13 Timothy Foldy-Porto , Yeshwanth Venkatesha , Priyadarshini Panda

Net-Trim: Convex Pruning of Deep Neural Networks with Performance Guarantee

We introduce and analyze a new technique for model reduction for deep neural networks. While large networks are theoretically capable of learning arbitrarily complex models, overfitting and model redundancy negatively affects the prediction…

Machine Learning · Computer Science 2017-11-27 Alireza Aghasi , Afshin Abdi , Nam Nguyen , Justin Romberg