Adjustable Bounded Rectifiers: Towards Deep Binary Representations

Zhirong Wu; Dahua Lin; Xiaoou Tang

Adjustable Bounded Rectifiers: Towards Deep Binary Representations

Machine Learning 2015-11-20 v1 Machine Learning

Authors: Zhirong Wu , Dahua Lin , Xiaoou Tang

Abstract

Binary representation is desirable for its memory efficiency, computation speed and robustness. In this paper, we propose adjustable bounded rectifiers to learn binary representations for deep neural networks. While hard constraining representations across layers to be binary makes training unreasonably difficult, we softly encourage activations to diverge from real values to binary by approximating step functions. Our final representation is completely binary. We test our approach on MNIST, CIFAR10, and ILSVRC2012 dataset, and systematically study the training dynamics of the binarization process. Our approach can binarize the last layer representation without loss of performance and binarize all the layers with reasonably small degradations. The memory space that it saves may allow more sophisticated models to be deployed, thus compensating the loss. To the best of our knowledge, this is the first work to report results on current deep network architectures using complete binary middle representations. Given the learned representations, we find that the firing or inhibition of a binary neuron is usually associated with a meaningful interpretation across different classes. This suggests that the semantic structure of a neural network may be manifested through a guided binarization process.

Keywords

binary neural network neural network neural network training

Cite

@article{arxiv.1511.06201,
  title  = {Adjustable Bounded Rectifiers: Towards Deep Binary Representations},
  author = {Zhirong Wu and Dahua Lin and Xiaoou Tang},
  journal= {arXiv preprint arXiv:1511.06201},
  year   = {2015}
}

Comments

Under review as a conference paper at ICLR 2016

Adjustable Bounded Rectifiers: Towards Deep Binary Representations

Abstract

Keywords

Cite

Comments

Related papers