Related papers: Egocentric Spatial Memory

End-to-End Egospheric Spatial Memory

Spatial memory, or the ability to remember and recall specific locations and objects, is central to autonomous agents' ability to carry out tasks in real environments. However, most existing artificial memory modules are not very adept at…

Robotics · Computer Science 2021-02-18 Daniel Lenton , Stephen James , Ronald Clark , Andrew J. Davison

ECO: Egocentric Cognitive Mapping

We present a new method to localize a camera within a previously unseen environment perceived from an egocentric point of view. Although this is, in general, an ill-posed problem, humans can effortlessly and efficiently determine their…

Computer Vision and Pattern Recognition · Computer Science 2018-12-04 Jayant Sharma , Zixing Wang , Alberto Speranzon , Vijay Venkataraman , Hyun Soo Park

Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views

We study the task of semantic mapping - specifically, an embodied agent (a robot or an egocentric AI assistant) is given a tour of a new environment and asked to build an allocentric top-down semantic map ("what is where?") from egocentric…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Vincent Cartillier , Zhile Ren , Neha Jain , Stefan Lee , Irfan Essa , Dhruv Batra

EgoMap: Projective mapping and structured egocentric memory for Deep RL

Tasks involving localization, memorization and planning in partially observable 3D environments are an ongoing challenge in Deep Reinforcement Learning. We present EgoMap, a spatially structured neural memory architecture. EgoMap augments a…

Machine Learning · Computer Science 2020-02-10 Edward Beeching , Christian Wolf , Jilles Dibangoye , Olivier Simonin

Building spatial world models from sparse transitional episodic memories

Many animals possess a remarkable capacity to rapidly construct flexible cognitive maps of their environments. These maps are crucial for ethologically relevant behaviors such as navigation, exploration, and planning. Existing computational…

Artificial Intelligence · Computer Science 2026-02-04 Zizhan He , Maxime Daigle , Pouya Bashivan

Neural SLAM: Learning to Explore with External Memory

We present an approach for agents to learn representations of a global map from sensor data, to aid their exploration in new environments. To achieve this, we embed procedures mimicking that of traditional Simultaneous Localization and…

Machine Learning · Computer Science 2021-01-01 Jingwei Zhang , Lei Tai , Ming Liu , Joschka Boedecker , Wolfram Burgard

Short-Term Prediction and Multi-Camera Fusion on Semantic Grids

An environment representation (ER) is a substantial part of every autonomous system. It introduces a common interface between perception and other system components, such as decision making, and allows downstream algorithms to deal with…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Lukas Hoyer , Patrick Kesper , Anna Khoreva , Volker Fischer

Entropic associative memory for real world images

The entropic associative memory (EAM) is a computational model of natural memory incorporating some of its putative properties of being associative, distributed, declarative, abstractive and constructive. Previous experiments satisfactorily…

Machine Learning · Computer Science 2024-05-22 Noé Hernández , Rafael Morales , Luis A. Pineda

SpatialMem: Metric-Aligned Long-Horizon Video Memory for Language Grounding and QA

We present SpatialMem, a memory-centric system for long-horizon, language-grounded retrieval and QA from egocentric video, where metric 3D serves as an interpretable indexing scaffold rather than an explicit mapping objective. Starting from…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Xinyi Zheng , Yunze Liu , Chi-Hao Wu , Fan Zhang , Hao Zheng , Wenqi Zhou , Walterio W. Mayol-Cuevas , Junxiao Shen

Learning Navigational Visual Representations with Semantic Map Supervision

Being able to perceive the semantics and the spatial structure of the environment is essential for visual navigation of a household robot. However, most existing works only employ visual backbones pre-trained either with independent images…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Yicong Hong , Yang Zhou , Ruiyi Zhang , Franck Dernoncourt , Trung Bui , Stephen Gould , Hao Tan

EMPNet: Neural Localisation and Mapping Using Embedded Memory Points

Continuously estimating an agent's state space and a representation of its surroundings has proven vital towards full autonomy. A shared common ground among systems which successfully achieve this feat is the integration of previously…

Computer Vision and Pattern Recognition · Computer Science 2019-08-05 Gil Avraham , Yan Zuo , Thanuja Dharmasiri , Tom Drummond

SEM: Enhancing Spatial Understanding for Robust Robot Manipulation

A key challenge in robot manipulation lies in developing policy models with strong spatial understanding, the ability to reason about 3D geometry, object relations, and robot embodiment. Existing methods often fall short: 3D point cloud…

Robotics · Computer Science 2025-09-25 Xuewu Lin , Tianwei Lin , Lichao Huang , Hongyu Xie , Yiwei Jin , Keyu Li , Zhizhong Su

Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition

In this paper we propose an end-to-end trainable deep neural network model for egocentric activity recognition. Our model is built on the observation that egocentric activities are highly characterized by the objects and their locations in…

Computer Vision and Pattern Recognition · Computer Science 2018-08-01 Swathikiran Sudhakaran , Oswald Lanz

Empirical Capacity Model for Self-Attention Neural Networks

Large pretrained self-attention neural networks, or transformers, have been very successful in various tasks recently. The performance of a model on a given task depends on its ability to memorize and generalize the training data. Large…

Machine Learning · Computer Science 2024-08-01 Aki Härmä , Marcin Pietrasik , Anna Wilbik

Superposed Episodic and Semantic Memory via Sparse Distributed Representation

The abilities to perceive, learn, and use generalities, similarities, classes, i.e., semantic memory (SM), is central to cognition. Machine learning (ML), neural network, and AI research has been primarily driven by tasks requiring such…

Neural and Evolutionary Computing · Computer Science 2017-10-24 Rod Rinkus , Jasmin Leveille

Energy-Regularized Spatial Masking: A Novel Approach to Enhancing Robustness and Interpretability in Vision Models

Deep convolutional neural networks achieve remarkable performance by exhaustively processing dense spatial feature maps, yet this brute-force strategy introduces significant computational redundancy and encourages reliance on spurious…

Computer Vision and Pattern Recognition · Computer Science 2026-04-15 Tom Devynck , Bilal Faye , Djamel Bouchaffra , Nadjib Lazaar , Hanane Azzag , Mustapha Lebbah

Memory based neural networks for end-to-end autonomous driving

Recent works in end-to-end control for autonomous driving have investigated the use of vision-based exteroceptive perception. Inspired by such results, we propose a new end-to-end memory-based neural architecture for robot steering and…

Robotics · Computer Science 2022-05-25 Sergio Paniego Blanco , Sakshay Mahna , Utkarsh A. Mishra , JoseMaria Canas

EgoExoMem: Cross-View Memory Reasoning over Synchronized Egocentric and Exocentric Videos

Egocentric memory is widely used in embodied intelligence, but it may be insufficient for comprehensive spatial-temporal reasoning. Inspired by human recall from both field and observer perspectives, we introduce EgoExoMem, the first…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Ruiping Liu , Junwei Zheng , Yufan Chen , Di Wen , Shaofang Quan , Chengzhi Wu , Jiaming Zhang , Kailun Yang , Kunyu Peng , Rainer Stiefelhagen

An Integrated Approach to Autonomous Environment Modeling

In this paper, we present an integrated solution to memory-efficient environment modeling by an autonomous mobile robot equipped with a laser range-finder. Majority of nowadays approaches to autonomous environment modeling, called…

Robotics · Computer Science 2019-01-23 Miroslav Kulich , Viktor Kozák , Libor Přeučil

What to Do Next? Memorizing skills from Egocentric Instructional Video

Learning to perform activities through demonstration requires extracting meaningful information about the environment from observations. In this research, we investigate the challenge of planning high-level goal-oriented actions in a…

Machine Learning · Computer Science 2025-07-08 Jing Bi , Chenliang Xu