English

Structured Transforms for Small-Footprint Deep Learning

Machine Learning 2015-10-07 v1 Computer Vision and Pattern Recognition Machine Learning

Abstract

We consider the task of building compact deep learning pipelines suitable for deployment on storage and power constrained mobile devices. We propose a unified framework to learn a broad family of structured parameter matrices that are characterized by the notion of low displacement rank. Our structured transforms admit fast function and gradient evaluation, and span a rich range of parameter sharing configurations whose statistical modeling capacity can be explicitly tuned along a continuum from structured to unstructured. Experimental results show that these transforms can significantly accelerate inference and forward/backward passes during training, and offer superior accuracy-compactness-speed tradeoffs in comparison to a number of existing techniques. In keyword spotting applications in mobile speech recognition, our methods are much more effective than standard linear low-rank bottleneck layers and nearly retain the performance of state of the art models, while providing more than 3.5-fold compression.

Keywords

Cite

@article{arxiv.1510.01722,
  title  = {Structured Transforms for Small-Footprint Deep Learning},
  author = {Vikas Sindhwani and Tara N. Sainath and Sanjiv Kumar},
  journal= {arXiv preprint arXiv:1510.01722},
  year   = {2015}
}

Comments

To appear in NIPS 2015; 9 pages

R2 v1 2026-06-22T11:14:15.485Z