Related papers: DiSparse: Disentangled Sparsification for Multitas…

Structured Pruning for Multi-Task Deep Neural Networks

Although multi-task deep neural network (DNN) models have computation and storage benefits over individual single-task DNN models, they can be further optimized via model compression. Numerous structured pruning methods are already…

Machine Learning · Computer Science 2023-04-17 Siddhant Garg , Lijun Zhang , Hui Guan

To prune, or not to prune: exploring the efficacy of pruning for model compression

Model pruning seeks to induce sparsity in a deep neural network's various connection matrices, thereby reducing the number of nonzero-valued parameters in the model. Recent reports (Han et al., 2015; Narang et al., 2017) prune deep networks…

Machine Learning · Statistics 2017-11-15 Michael Zhu , Suyog Gupta

Balancing Act: Constraining Disparate Impact in Sparse Models

Model pruning is a popular approach to enable the deployment of large deep learning models on edge devices with restricted computational or storage capacities. Although sparse models achieve performance comparable to that of their dense…

Machine Learning · Computer Science 2024-03-11 Meraj Hashemizadeh , Juan Ramirez , Rohan Sukumaran , Golnoosh Farnadi , Simon Lacoste-Julien , Jose Gallego-Posada

Dynamic Model Pruning with Feedback

Deep neural networks often have millions of parameters. This can hinder their deployment to low-end devices, not only due to high memory requirements but also because of increased latency at inference. We propose a novel model compression…

Machine Learning · Computer Science 2020-06-15 Tao Lin , Sebastian U. Stich , Luis Barba , Daniil Dmitriev , Martin Jaggi

Differentiable Mask for Pruning Convolutional and Recurrent Networks

Pruning is one of the most effective model reduction techniques. Deep networks require massive computation and such models need to be compressed to bring them on edge devices. Most existing pruning techniques are focused on vision-based…

Machine Learning · Computer Science 2020-04-30 Ramchalam Kinattinkara Ramakrishnan , Eyyüb Sari , Vahid Partovi Nia

Distral: Robust Multitask Reinforcement Learning

Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data efficiency is multitask learning with shared neural network…

Machine Learning · Computer Science 2017-07-14 Yee Whye Teh , Victor Bapst , Wojciech Marian Czarnecki , John Quan , James Kirkpatrick , Raia Hadsell , Nicolas Heess , Razvan Pascanu

One-Shot Pruning for Fast-adapting Pre-trained Models on Devices

Large-scale pre-trained models have been remarkably successful in resolving downstream tasks. Nonetheless, deploying these models on low-capability devices still requires an effective approach, such as model pruning. However, pruning the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Haiyan Zhao , Guodong Long

Learnable Sparsity for Vision Generative Models

Diffusion models have achieved impressive advancements in various vision tasks. However, these gains often rely on increasing model size, which escalates computational complexity and memory demands, complicating deployment, raising…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Yang Zhang , Er Jin , Wenzhong Liang , Yanfei Dong , Ashkan Khakzar , Philip Torr , Johannes Stegmaier , Kenji Kawaguchi

Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a Survey

State-of-the-art deep learning models have a parameter count that reaches into the billions. Training, storing and transferring such models is energy and time consuming, thus costly. A big part of these costs is caused by training the…

Machine Learning · Computer Science 2023-05-26 Paul Wimmer , Jens Mehnert , Alexandru Paul Condurache

Group channel pruning and spatial attention distilling for object detection

Due to the over-parameterization of neural networks, many model compression methods based on pruning and quantization have emerged. They are remarkable in reducing the size, parameter number, and computational complexity of the model.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-05 Yun Chu , Pu Li , Yong Bai , Zhuhua Hu , Yongqing Chen , Jiafeng Lu

Task-specific Compression for Multi-task Language Models using Attribution-based Pruning

Multi-task language models show outstanding performance for various natural language understanding tasks with only a single model. However, these language models utilize an unnecessarily large number of model parameters, even when used only…

Computation and Language · Computer Science 2023-02-14 Nakyeong Yang , Yunah Jang , Hwanhee Lee , Seohyeong Jung , Kyomin Jung

Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers

We present a novel network pruning algorithm called Dynamic Sparse Training that can jointly find the optimal network parameters and sparse network structure in a unified optimization process with trainable pruning thresholds. These…

Machine Learning · Computer Science 2020-05-15 Junjie Liu , Zhe Xu , Runbin Shi , Ray C. C. Cheung , Hayden K. H. So

Transfer Learning for Structured Pruning under Limited Task Data

Large, pre-trained models are problematic to use in resource constrained applications. Fortunately, task-aware structured pruning methods offer a solution. These approaches reduce model size by dropping structural units like layers and…

Computation and Language · Computer Science 2023-11-14 Lucio Dery , David Grangier , Awni Hannun

Structural Pruning for Diffusion Models

Generative modeling has recently undergone remarkable advancements, primarily propelled by the transformative implications of Diffusion Probabilistic Models (DPMs). The impressive capability of these models, however, often entails…

Machine Learning · Computer Science 2023-10-03 Gongfan Fang , Xinyin Ma , Xinchao Wang

Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks

Deep networks are typically trained with many more parameters than the size of the training dataset. Recent empirical evidence indicates that the practice of overparameterization not only benefits training large models, but also assists -…

Machine Learning · Computer Science 2020-12-17 Xiangyu Chang , Yingcong Li , Samet Oymak , Christos Thrampoulidis

CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge Devices

Multi-task learning has garnered widespread attention in the industry due to its efficient data utilization and strong generalization capabilities, making it particularly suitable for providing high-quality intelligent services to users.…

Machine Learning · Computer Science 2026-01-06 Jingxuan Zhou , Weidong Bao , Ji Wang , Zhengyi Zhong

Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection

It is widely acknowledged that large and sparse models have higher accuracy than small and dense models under the same model size constraints. This motivates us to train a large model and then remove its redundant neurons or weights by…

Computer Vision and Pattern Recognition · Computer Science 2024-01-12 Jianwei Li , Weizhi Gao , Qi Lei , Dongkuan Xu

Deep Neural Compression Via Concurrent Pruning and Self-Distillation

Pruning aims to reduce the number of parameters while maintaining performance close to the original network. This work proposes a novel \emph{self-distillation} based pruning strategy, whereby the representational similarity between the…

Machine Learning · Computer Science 2021-10-01 James O' Neill , Sourav Dutta , Haytham Assem

Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning

Although disentangled representations are often said to be beneficial for downstream tasks, current empirical and theoretical understanding is limited. In this work, we provide evidence that disentangled representations coupled with sparse…

Machine Learning · Computer Science 2023-06-08 Sébastien Lachapelle , Tristan Deleu , Divyat Mahajan , Ioannis Mitliagkas , Yoshua Bengio , Simon Lacoste-Julien , Quentin Bertrand

Model Sparsity Can Simplify Machine Unlearning

In response to recent data regulation requirements, machine unlearning (MU) has emerged as a critical process to remove the influence of specific examples from a given model. Although exact unlearning can be achieved through complete model…

Machine Learning · Computer Science 2024-01-30 Jinghan Jia , Jiancheng Liu , Parikshit Ram , Yuguang Yao , Gaowen Liu , Yang Liu , Pranay Sharma , Sijia Liu