English

Analyzing Visual Representations in Embodied Navigation Tasks

Computer Vision and Pattern Recognition 2020-03-16 v1 Artificial Intelligence Machine Learning

Abstract

Recent advances in deep reinforcement learning require a large amount of training data and generally result in representations that are often over specialized to the target task. In this work, we present a methodology to study the underlying potential causes for this specialization. We use the recently proposed projection weighted Canonical Correlation Analysis (PWCCA) to measure the similarity of visual representations learned in the same environment by performing different tasks. We then leverage our proposed methodology to examine the task dependence of visual representations learned on related but distinct embodied navigation tasks. Surprisingly, we find that slight differences in task have no measurable effect on the visual representation for both SqueezeNet and ResNet architectures. We then empirically demonstrate that visual representations learned on one task can be effectively transferred to a different task.

Keywords

Cite

@article{arxiv.2003.05993,
  title  = {Analyzing Visual Representations in Embodied Navigation Tasks},
  author = {Erik Wijmans and Julian Straub and Dhruv Batra and Irfan Essa and Judy Hoffman and Ari Morcos},
  journal= {arXiv preprint arXiv:2003.05993},
  year   = {2020}
}
R2 v1 2026-06-23T14:13:17.856Z