English

Dynamic Network Surgery for Efficient DNNs

Neural and Evolutionary Computing 2016-11-11 v2 Computer Vision and Pattern Recognition Machine Learning

Abstract

Deep learning has become a ubiquitous technology to improve machine intelligence. However, most of the existing deep models are structurally very complex, making them difficult to be deployed on the mobile platforms with limited computational power. In this paper, we propose a novel network compression method called dynamic network surgery, which can remarkably reduce the network complexity by making on-the-fly connection pruning. Unlike the previous methods which accomplish this task in a greedy way, we properly incorporate connection splicing into the whole process to avoid incorrect pruning and make it as a continual network maintenance. The effectiveness of our method is proved with experiments. Without any accuracy loss, our method can efficiently compress the number of parameters in LeNet-5 and AlexNet by a factor of 108×\bm{108}\times and 17.7×\bm{17.7}\times respectively, proving that it outperforms the recent pruning method by considerable margins. Code and some models are available at https://github.com/yiwenguo/Dynamic-Network-Surgery.

Keywords

Cite

@article{arxiv.1608.04493,
  title  = {Dynamic Network Surgery for Efficient DNNs},
  author = {Yiwen Guo and Anbang Yao and Yurong Chen},
  journal= {arXiv preprint arXiv:1608.04493},
  year   = {2016}
}

Comments

Accepted by NIPS 2016