English
Related papers

Related papers: DiSparse: Disentangled Sparsification for Multitas…

200 papers

Although multi-task deep neural network (DNN) models have computation and storage benefits over individual single-task DNN models, they can be further optimized via model compression. Numerous structured pruning methods are already…

Machine Learning · Computer Science 2023-04-17 Siddhant Garg , Lijun Zhang , Hui Guan

Model pruning seeks to induce sparsity in a deep neural network's various connection matrices, thereby reducing the number of nonzero-valued parameters in the model. Recent reports (Han et al., 2015; Narang et al., 2017) prune deep networks…

Machine Learning · Statistics 2017-11-15 Michael Zhu , Suyog Gupta

Model pruning is a popular approach to enable the deployment of large deep learning models on edge devices with restricted computational or storage capacities. Although sparse models achieve performance comparable to that of their dense…

Deep neural networks often have millions of parameters. This can hinder their deployment to low-end devices, not only due to high memory requirements but also because of increased latency at inference. We propose a novel model compression…

Machine Learning · Computer Science 2020-06-15 Tao Lin , Sebastian U. Stich , Luis Barba , Daniil Dmitriev , Martin Jaggi

Pruning is one of the most effective model reduction techniques. Deep networks require massive computation and such models need to be compressed to bring them on edge devices. Most existing pruning techniques are focused on vision-based…

Machine Learning · Computer Science 2020-04-30 Ramchalam Kinattinkara Ramakrishnan , Eyyüb Sari , Vahid Partovi Nia

Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data efficiency is multitask learning with shared neural network…

Large-scale pre-trained models have been remarkably successful in resolving downstream tasks. Nonetheless, deploying these models on low-capability devices still requires an effective approach, such as model pruning. However, pruning the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Haiyan Zhao , Guodong Long

Diffusion models have achieved impressive advancements in various vision tasks. However, these gains often rely on increasing model size, which escalates computational complexity and memory demands, complicating deployment, raising…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Yang Zhang , Er Jin , Wenzhong Liang , Yanfei Dong , Ashkan Khakzar , Philip Torr , Johannes Stegmaier , Kenji Kawaguchi

State-of-the-art deep learning models have a parameter count that reaches into the billions. Training, storing and transferring such models is energy and time consuming, thus costly. A big part of these costs is caused by training the…

Machine Learning · Computer Science 2023-05-26 Paul Wimmer , Jens Mehnert , Alexandru Paul Condurache

Due to the over-parameterization of neural networks, many model compression methods based on pruning and quantization have emerged. They are remarkable in reducing the size, parameter number, and computational complexity of the model.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-05 Yun Chu , Pu Li , Yong Bai , Zhuhua Hu , Yongqing Chen , Jiafeng Lu

Multi-task language models show outstanding performance for various natural language understanding tasks with only a single model. However, these language models utilize an unnecessarily large number of model parameters, even when used only…

Computation and Language · Computer Science 2023-02-14 Nakyeong Yang , Yunah Jang , Hwanhee Lee , Seohyeong Jung , Kyomin Jung

We present a novel network pruning algorithm called Dynamic Sparse Training that can jointly find the optimal network parameters and sparse network structure in a unified optimization process with trainable pruning thresholds. These…

Machine Learning · Computer Science 2020-05-15 Junjie Liu , Zhe Xu , Runbin Shi , Ray C. C. Cheung , Hayden K. H. So

Large, pre-trained models are problematic to use in resource constrained applications. Fortunately, task-aware structured pruning methods offer a solution. These approaches reduce model size by dropping structural units like layers and…

Computation and Language · Computer Science 2023-11-14 Lucio Dery , David Grangier , Awni Hannun

Generative modeling has recently undergone remarkable advancements, primarily propelled by the transformative implications of Diffusion Probabilistic Models (DPMs). The impressive capability of these models, however, often entails…

Machine Learning · Computer Science 2023-10-03 Gongfan Fang , Xinyin Ma , Xinchao Wang

Deep networks are typically trained with many more parameters than the size of the training dataset. Recent empirical evidence indicates that the practice of overparameterization not only benefits training large models, but also assists -…

Machine Learning · Computer Science 2020-12-17 Xiangyu Chang , Yingcong Li , Samet Oymak , Christos Thrampoulidis

Multi-task learning has garnered widespread attention in the industry due to its efficient data utilization and strong generalization capabilities, making it particularly suitable for providing high-quality intelligent services to users.…

Machine Learning · Computer Science 2026-01-06 Jingxuan Zhou , Weidong Bao , Ji Wang , Zhengyi Zhong

It is widely acknowledged that large and sparse models have higher accuracy than small and dense models under the same model size constraints. This motivates us to train a large model and then remove its redundant neurons or weights by…

Computer Vision and Pattern Recognition · Computer Science 2024-01-12 Jianwei Li , Weizhi Gao , Qi Lei , Dongkuan Xu

Pruning aims to reduce the number of parameters while maintaining performance close to the original network. This work proposes a novel \emph{self-distillation} based pruning strategy, whereby the representational similarity between the…

Machine Learning · Computer Science 2021-10-01 James O' Neill , Sourav Dutta , Haytham Assem

Although disentangled representations are often said to be beneficial for downstream tasks, current empirical and theoretical understanding is limited. In this work, we provide evidence that disentangled representations coupled with sparse…

In response to recent data regulation requirements, machine unlearning (MU) has emerged as a critical process to remove the influence of specific examples from a given model. Although exact unlearning can be achieved through complete model…

Machine Learning · Computer Science 2024-01-30 Jinghan Jia , Jiancheng Liu , Parikshit Ram , Yuguang Yao , Gaowen Liu , Yang Liu , Pranay Sharma , Sijia Liu
‹ Prev 1 2 3 10 Next ›