Related papers: Simultaneous Mapping and Target Driven Navigation

Visual Representations for Semantic Target Driven Navigation

What is a good visual representation for autonomous agents? We address this question in the context of semantic visual navigation, which is the problem of a robot finding its way through a complex environment to a target object, e.g. go to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Arsalan Mousavian , Alexander Toshev , Marek Fiser , Jana Kosecka , Ayzaan Wahid , James Davidson

Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning

We present a target-driven navigation system to improve mapless visual navigation in indoor scenes. Our method takes a multi-view observation of a robot and a target as inputs at each time step to provide a sequence of actions that move the…

Robotics · Computer Science 2022-05-10 Qiaoyun Wu , Xiaoxi Gong , Kai Xu , Dinesh Manocha , Jingxuan Dong , Jun Wang

Learning to Map for Active Semantic Goal Navigation

We consider the problem of object goal navigation in unseen environments. Solving this problem requires learning of contextual semantic priors, a challenging endeavour given the spatial and semantic variability of indoor environments.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Georgios Georgakis , Bernadette Bucher , Karl Schmeckpeper , Siddharth Singh , Kostas Daniilidis

Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation

In the context of visual navigation, the capacity to map a novel environment is necessary for an agent to exploit its observation history in the considered place and efficiently reach known goals. This ability can be associated with spatial…

Computer Vision and Pattern Recognition · Computer Science 2023-04-26 Pierre Marza , Laetitia Matignon , Olivier Simonin , Christian Wolf

Multi-Object Navigation with dynamically learned neural implicit representations

Understanding and mapping a new environment are core abilities of any autonomously navigating agent. While classical robotics usually estimates maps in a stand-alone manner with SLAM variants, which maintain a topological or metric…

Computer Vision and Pattern Recognition · Computer Science 2023-09-28 Pierre Marza , Laetitia Matignon , Olivier Simonin , Christian Wolf

SGoLAM: Simultaneous Goal Localization and Mapping for Multi-Object Goal Navigation

We present SGoLAM, short for simultaneous goal localization and mapping, which is a simple and efficient algorithm for Multi-Object Goal navigation. Given an agent equipped with an RGB-D camera and a GPS/Compass sensor, our objective is to…

Computer Vision and Pattern Recognition · Computer Science 2021-10-15 Junho Kim , Eun Sun Lee , Mingi Lee , Donsu Zhang , Young Min Kim

Semantic Mapping in Indoor Embodied AI -- A Survey on Advances, Challenges, and Future Directions

Intelligent embodied agents (e.g. robots) need to perform complex semantic tasks in unfamiliar environments. Among many skills that the agents need to possess, building and maintaining a semantic map of the environment is most crucial in…

Robotics · Computer Science 2025-08-13 Sonia Raychaudhuri , Angel X. Chang

Visual Navigation with Spatial Attention

This work focuses on object goal visual navigation, aiming at finding the location of an object from a given class, where in each step the agent is provided with an egocentric RGB image of the scene. We propose to learn the agent's policy…

Computer Vision and Pattern Recognition · Computer Science 2021-04-21 Bar Mayo , Tamir Hazan , Ayellet Tal

Online Object-Oriented Semantic Mapping and Map Updating

Creating and maintaining an accurate representation of the environment is an essential capability for every service robot. Especially for household robots acting in indoor environments, semantic information is important. In this paper, we…

Robotics · Computer Science 2025-01-09 Nils Dengler , Tobias Zaenker , Francesco Verdoja , Maren Bennewitz

Image-Goal Navigation in Complex Environments via Modular Learning

We present a novel approach for image-goal navigation, where an agent navigates with a goal image rather than accurate target information, which is more challenging. Our goal is to decouple the learning of navigation goal planning,…

Robotics · Computer Science 2022-02-23 Qiaoyun Wu , Jun Wang , Jing Liang , Xiaoxi Gong , Dinesh Manocha

Learning to Navigate in Complex Environments

Learning to navigate in complex environments with dynamic elements is an important milestone in developing AI agents. In this work we formulate the navigation question as a reinforcement learning problem and show that data efficiency and…

Artificial Intelligence · Computer Science 2017-01-16 Piotr Mirowski , Razvan Pascanu , Fabio Viola , Hubert Soyer , Andrew J. Ballard , Andrea Banino , Misha Denil , Ross Goroshin , Laurent Sifre , Koray Kavukcuoglu , Dharshan Kumaran , Raia Hadsell

Hybrid Decision Making for Scalable Multi-Agent Navigation: Integrating Semantic Maps, Discrete Coordination, and Model Predictive Control

This paper presents a framework for multi-agent navigation in structured but dynamic environments, integrating three key components: a shared semantic map encoding metric and semantic environmental knowledge, a claim policy for coordinating…

Robotics · Computer Science 2024-10-17 Koen de Vos , Elena Torta , Herman Bruyninckx , Cesar Lopez Martinez , Rene van de Molengraft

Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation

We introduce a learning-based approach for room navigation using semantic maps. Our proposed architecture learns to predict top-down belief maps of regions that lie beyond the agent's field of view while modeling architectural and stylistic…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Medhini Narasimhan , Erik Wijmans , Xinlei Chen , Trevor Darrell , Dhruv Batra , Devi Parikh , Amanpreet Singh

Towards Navigation by Reasoning over Spatial Configurations

We deal with the navigation problem where the agent follows natural language instructions while observing the environment. Focusing on language understanding, we show the importance of spatial semantics in grounding navigation instructions…

Computation and Language · Computer Science 2021-05-17 Yue Zhang , Quan Guo , Parisa Kordjamshidi

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

Look, Listen, and Act: Towards Audio-Visual Embodied Navigation

A crucial ability of mobile intelligent agents is to integrate the evidence from multiple sensory inputs in an environment and to make a sequence of actions to reach their goals. In this paper, we attempt to approach the problem of…

Computer Vision and Pattern Recognition · Computer Science 2020-03-10 Chuang Gan , Yiwei Zhang , Jiajun Wu , Boqing Gong , Joshua B. Tenenbaum

Mapping High-level Semantic Regions in Indoor Environments without Object Recognition

Robots require a semantic understanding of their surroundings to operate in an efficient and explainable way in human environments. In the literature, there has been an extensive focus on object labeling and exhaustive scene graph…

Robotics · Computer Science 2024-04-16 Roberto Bigazzi , Lorenzo Baraldi , Shreyas Kousik , Rita Cucchiara , Marco Pavone

GraphMapper: Efficient Visual Navigation by Scene Graph Generation

Understanding the geometric relationships between objects in a scene is a core capability in enabling both humans and autonomous agents to navigate in new environments. A sparse, unified representation of the scene topology will allow…

Computer Vision and Pattern Recognition · Computer Science 2022-05-18 Zachary Seymour , Niluthpol Chowdhury Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation

Navigating complex indoor environments requires a deep understanding of the space the robotic agent is acting into to correctly inform the navigation process of the agent towards the goal location. In recent learning-based navigation…

Robotics · Computer Science 2023-10-05 Marco Rosano , Antonino Furnari , Luigi Gulino , Corrado Santoro , Giovanni Maria Farinella

MemoNav: Working Memory Model for Visual Navigation

Image-goal navigation is a challenging task that requires an agent to navigate to a goal indicated by an image in unfamiliar environments. Existing methods utilizing diverse scene memories suffer from inefficient exploration since they use…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Hongxin Li , Zeyu Wang , Xu Yang , Yuran Yang , Shuqi Mei , Zhaoxiang Zhang