Separable Layers Enable Structured Efficient Linear Substitutions

Gavin Gray; Elliot J. Crowley; Amos Storkey

Separable Layers Enable Structured Efficient Linear Substitutions

Machine Learning 2019-06-04 v1 Machine Learning

Authors: Gavin Gray , Elliot J. Crowley , Amos Storkey

Abstract

In response to the development of recent efficient dense layers, this paper shows that something as simple as replacing linear components in pointwise convolutions with structured linear decompositions also produces substantial gains in the efficiency/accuracy tradeoff. Pointwise convolutions are fully connected layers and are thus prepared for replacement by structured transforms. Networks using such layers are able to learn the same tasks as those using standard convolutions, and provide Pareto-optimal benefits in efficiency/accuracy, both in terms of computation (mult-adds) and parameter count (and hence memory). Code is available at https://github.com/BayesWatch/deficient-efficient.

Separable Layers Enable Structured Efficient Linear Substitutions

Abstract

Keywords

Cite

Related papers