Dynamic Sparse Graph for Efficient Deep Learning

Liu Liu; Lei Deng; Xing Hu; Maohua Zhu; Guoqi Li; Yufei Ding; Yuan Xie

Dynamic Sparse Graph for Efficient Deep Learning

Machine Learning 2019-05-08 v2 Machine Learning

Authors: Liu Liu , Lei Deng , Xing Hu , Maohua Zhu , Guoqi Li , Yufei Ding , Yuan Xie

Abstract

We propose to execute deep neural networks (DNNs) with dynamic and sparse graph (DSG) structure for compressive memory and accelerative execution during both training and inference. The great success of DNNs motivates the pursuing of lightweight models for the deployment onto embedded devices. However, most of the previous studies optimize for inference while neglect training or even complicate it. Training is far more intractable, since (i) the neurons dominate the memory cost rather than the weights in inference; (ii) the dynamic activation makes previous sparse acceleration via one-off optimization on fixed weight invalid; (iii) batch normalization (BN) is critical for maintaining accuracy while its activation reorganization damages the sparsity. To address these issues, DSG activates only a small amount of neurons with high selectivity at each iteration via a dimension-reduction search (DRS) and obtains the BN compatibility via a double-mask selection (DMS). Experiments show significant memory saving (1.7-4.5x) and operation reduction (2.3-4.4x) with little accuracy loss on various benchmarks.

Keywords

sparse learning deep neural network acceleration deep neural network

Cite

@article{arxiv.1810.00859,
  title  = {Dynamic Sparse Graph for Efficient Deep Learning},
  author = {Liu Liu and Lei Deng and Xing Hu and Maohua Zhu and Guoqi Li and Yufei Ding and Yuan Xie},
  journal= {arXiv preprint arXiv:1810.00859},
  year   = {2019}
}

Comments

ICLR 2019

Dynamic Sparse Graph for Efficient Deep Learning

Abstract

Keywords

Cite

Comments

Related papers