English

Separable Layers Enable Structured Efficient Linear Substitutions

Machine Learning 2019-06-04 v1 Machine Learning

Abstract

In response to the development of recent efficient dense layers, this paper shows that something as simple as replacing linear components in pointwise convolutions with structured linear decompositions also produces substantial gains in the efficiency/accuracy tradeoff. Pointwise convolutions are fully connected layers and are thus prepared for replacement by structured transforms. Networks using such layers are able to learn the same tasks as those using standard convolutions, and provide Pareto-optimal benefits in efficiency/accuracy, both in terms of computation (mult-adds) and parameter count (and hence memory). Code is available at https://github.com/BayesWatch/deficient-efficient.

Keywords

Cite

@article{arxiv.1906.00859,
  title  = {Separable Layers Enable Structured Efficient Linear Substitutions},
  author = {Gavin Gray and Elliot J. Crowley and Amos Storkey},
  journal= {arXiv preprint arXiv:1906.00859},
  year   = {2019}
}