English

Bayesian Relational Memory for Semantic Visual Navigation

Computer Vision and Pattern Recognition 2019-09-11 v1 Machine Learning Robotics

Abstract

We introduce a new memory architecture, Bayesian Relational Memory (BRM), to improve the generalization ability for semantic visual navigation agents in unseen environments, where an agent is given a semantic target to navigate towards. BRM takes the form of a probabilistic relation graph over semantic entities (e.g., room types), which allows (1) capturing the layout prior from training environments, i.e., prior knowledge, (2) estimating posterior layout at test time, i.e., memory update, and (3) efficient planning for navigation, altogether. We develop a BRM agent consisting of a BRM module for producing sub-goals and a goal-conditioned locomotion module for control. When testing in unseen environments, the BRM agent outperforms baselines that do not explicitly utilize the probabilistic relational memory structure

Keywords

Cite

@article{arxiv.1909.04306,
  title  = {Bayesian Relational Memory for Semantic Visual Navigation},
  author = {Yi Wu and Yuxin Wu and Aviv Tamar and Stuart Russell and Georgia Gkioxari and Yuandong Tian},
  journal= {arXiv preprint arXiv:1909.04306},
  year   = {2019}
}

Comments

Accepted at ICCV 2019

R2 v1 2026-06-23T11:10:40.651Z