English

Stateful ODE-Nets using Basis Function Expansions

Machine Learning 2021-11-09 v2 Machine Learning

Abstract

The recently-introduced class of ordinary differential equation networks (ODE-Nets) establishes a fruitful connection between deep learning and dynamical systems. In this work, we reconsider formulations of the weights as continuous-in-depth functions using linear combinations of basis functions which enables us to leverage parameter transformations such as function projections. In turn, this view allows us to formulate a novel stateful ODE-Block that handles stateful layers. The benefits of this new ODE-Block are twofold: first, it enables incorporating meaningful continuous-in-depth batch normalization layers to achieve state-of-the-art performance; second, it enables compressing the weights through a change of basis, without retraining, while maintaining near state-of-the-art performance and reducing both inference time and memory footprint. Performance is demonstrated by applying our stateful ODE-Block to (a) image classification tasks using convolutional units and (b) sentence-tagging tasks using transformer encoder units.

Keywords

Cite

@article{arxiv.2106.10820,
  title  = {Stateful ODE-Nets using Basis Function Expansions},
  author = {Alejandro Queiruga and N. Benjamin Erichson and Liam Hodgkinson and Michael W. Mahoney},
  journal= {arXiv preprint arXiv:2106.10820},
  year   = {2021}
}

Comments

Accepted at 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

R2 v1 2026-06-24T03:24:30.216Z