English

Learning Canonical Transformations

Computer Vision and Pattern Recognition 2020-11-18 v1 Artificial Intelligence

Abstract

Humans understand a set of canonical geometric transformations (such as translation and rotation) that support generalization by being untethered to any specific object. We explore inductive biases that help a neural network model learn these transformations in pixel space in a way that can generalize out-of-domain. Specifically, we find that high training set diversity is sufficient for the extrapolation of translation to unseen shapes and scales, and that an iterative training scheme achieves significant extrapolation of rotation in time.

Keywords

Cite

@article{arxiv.2011.08822,
  title  = {Learning Canonical Transformations},
  author = {Zachary Dulberg and Jonathan Cohen},
  journal= {arXiv preprint arXiv:2011.08822},
  year   = {2020}
}

Comments

NeurIPS 2020 Workshop on BabyMind

R2 v1 2026-06-23T20:19:26.772Z