Task-specific Compression for Multi-task Language Models using Attribution-based Pruning

Nakyeong Yang; Yunah Jang; Hwanhee Lee; Seohyeong Jung; Kyomin Jung

Task-specific Compression for Multi-task Language Models using Attribution-based Pruning

Computation and Language 2023-02-14 v2 Artificial Intelligence

Authors: Nakyeong Yang , Yunah Jang , Hwanhee Lee , Seohyeong Jung , Kyomin Jung

Abstract

Multi-task language models show outstanding performance for various natural language understanding tasks with only a single model. However, these language models utilize an unnecessarily large number of model parameters, even when used only for a specific task. This paper proposes a novel training-free compression method for multi-task language models using a pruning method. Specifically, we use an attribution method to determine which neurons are essential for performing a specific task. We task-specifically prune unimportant neurons and leave only task-specific parameters. Furthermore, we extend our method to be applicable in low-resource and unsupervised settings. Since our compression method is training-free, it uses few computing resources and does not destroy the pre-trained knowledge of language models. Experimental results on the six widely-used datasets show that our proposed pruning method significantly outperforms baseline pruning methods. In addition, we demonstrate that our method preserves performance even in an unseen domain setting.

Keywords

model compression network pruning multi-task learning

Cite

@article{arxiv.2205.04157,
  title  = {Task-specific Compression for Multi-task Language Models using Attribution-based Pruning},
  author = {Nakyeong Yang and Yunah Jang and Hwanhee Lee and Seohyeong Jung and Kyomin Jung},
  journal= {arXiv preprint arXiv:2205.04157},
  year   = {2023}
}

Comments

11 pages, 4 figures

Task-specific Compression for Multi-task Language Models using Attribution-based Pruning

Abstract

Keywords

Cite

Comments

Related papers