Related papers: MultiON: Benchmarking Semantic Map Memory using Mu…

Sequence-Agnostic Multi-Object Navigation

The Multi-Object Navigation (MultiON) task requires a robot to localize an instance (each) of multiple object classes. It is a fundamental task for an assistive robot in a home or a factory. Existing methods for MultiON have viewed this as…

Robotics · Computer Science 2023-05-11 Nandiraju Gireesh , Ayush Agrawal , Ahana Datta , Snehasis Banerjee , Mohan Sridharan , Brojeshwar Bhowmick , Madhava Krishna

Multi-Object Navigation in real environments using hybrid policies

Navigation has been classically solved in robotics through the combination of SLAM and planning. More recently, beyond waypoint planning, problems involving significant components of (visual) high-level reasoning have been explored in…

Robotics · Computer Science 2024-01-26 Assem Sadek , Guillaume Bono , Boris Chidlovskii , Atilla Baskurt , Christian Wolf

Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation

In the context of visual navigation, the capacity to map a novel environment is necessary for an agent to exploit its observation history in the considered place and efficiently reach known goals. This ability can be associated with spatial…

Computer Vision and Pattern Recognition · Computer Science 2023-04-26 Pierre Marza , Laetitia Matignon , Olivier Simonin , Christian Wolf

MemoNav: Working Memory Model for Visual Navigation

Image-goal navigation is a challenging task that requires an agent to navigate to a goal indicated by an image in unfamiliar environments. Existing methods utilizing diverse scene memories suffer from inefficient exploration since they use…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Hongxin Li , Zeyu Wang , Xu Yang , Yuran Yang , Shuqi Mei , Zhaoxiang Zhang

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

Auxiliary Tasks and Exploration Enable ObjectNav

ObjectGoal Navigation (ObjectNav) is an embodied task wherein agents are to navigate to an object instance in an unseen environment. Prior works have shown that end-to-end ObjectNav agents that use vanilla visual and recurrent modules, e.g.…

Computer Vision and Pattern Recognition · Computer Science 2021-08-04 Joel Ye , Dhruv Batra , Abhishek Das , Erik Wijmans

Collaborative Visual Navigation

As a fundamental problem for Artificial Intelligence, multi-agent system (MAS) is making rapid progress, mainly driven by multi-agent reinforcement learning (MARL) techniques. However, previous MARL methods largely focused on grid-world…

Computer Vision and Pattern Recognition · Computer Science 2021-07-21 Haiyang Wang , Wenguan Wang , Xizhou Zhu , Jifeng Dai , Liwei Wang

Learning to Navigate in Complex Environments

Learning to navigate in complex environments with dynamic elements is an important milestone in developing AI agents. In this work we formulate the navigation question as a reinforcement learning problem and show that data efficiency and…

Artificial Intelligence · Computer Science 2017-01-16 Piotr Mirowski , Razvan Pascanu , Fabio Viola , Hubert Soyer , Andrew J. Ballard , Andrea Banino , Misha Denil , Ross Goroshin , Laurent Sifre , Koray Kavukcuoglu , Dharshan Kumaran , Raia Hadsell

Multimodal Perception for Goal-oriented Navigation: A Survey

Goal-oriented navigation presents a fundamental challenge for autonomous systems, requiring agents to navigate complex environments to reach designated targets. This survey offers a comprehensive analysis of multimodal navigation approaches…

Robotics · Computer Science 2025-04-23 I-Tak Ieong , Hao Tang

MemoNav: Selecting Informative Memories for Visual Navigation

Image-goal navigation is a challenging task, as it requires the agent to navigate to a target indicated by an image in a previously unseen scene. Current methods introduce diverse memory mechanisms which save navigation history to solve…

Computer Vision and Pattern Recognition · Computer Science 2022-08-23 Hongxin Li , Xu Yang , Yuran Yang , Shuqi Mei , Zhaoxiang Zhang

Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge

In visual semantic navigation, the robot navigates to a target object with egocentric visual observations and the class label of the target is given. It is a meaningful task inspiring a surge of relevant research. However, most of the…

Artificial Intelligence · Computer Science 2021-09-21 Xinzhu Liu , Di Guo , Huaping Liu , Fuchun Sun

Object Memory Transformer for Object Goal Navigation

This paper presents a reinforcement learning method for object goal navigation (ObjNav) where an agent navigates in 3D indoor environments to reach a target object based on long-term observations of objects and scenes. To this end, we…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Rui Fukushima , Kei Ota , Asako Kanezaki , Yoko Sasaki , Yusuke Yoshiyasu

Learning to Map for Active Semantic Goal Navigation

We consider the problem of object goal navigation in unseen environments. Solving this problem requires learning of contextual semantic priors, a challenging endeavour given the spatial and semantic variability of indoor environments.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Georgios Georgakis , Bernadette Bucher , Karl Schmeckpeper , Siddharth Singh , Kostas Daniilidis

UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents

Aerial navigation is a fundamental yet underexplored capability in embodied intelligence, enabling agents to operate in large-scale, unstructured environments where traditional navigation paradigms fall short. However, most existing…

Robotics · Computer Science 2025-08-25 Jianqiang Xiao , Yuexuan Sun , Yixin Shao , Boxi Gan , Rongqiang Liu , Yanjing Wu , Weili Guan , Xiang Deng

Object Goal Navigation using Goal-Oriented Semantic Exploration

This work studies the problem of object goal navigation which involves navigating to an instance of the given object category in unseen environments. End-to-end learning-based navigation methods struggle at this task as they are ineffective…

Computer Vision and Pattern Recognition · Computer Science 2020-07-03 Devendra Singh Chaplot , Dhiraj Gandhi , Abhinav Gupta , Ruslan Salakhutdinov

Object Goal Navigation with Recursive Implicit Maps

Object goal navigation aims to navigate an agent to locations of a given object category in unseen environments. Classical methods explicitly build maps of environments and require extensive engineering while lacking semantic information…

Computer Vision and Pattern Recognition · Computer Science 2023-08-11 Shizhe Chen , Thomas Chabal , Ivan Laptev , Cordelia Schmid

Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation Learning

Mobile agents that can leverage help from humans can potentially accomplish more complex tasks than they could entirely on their own. We develop "Help, Anna!" (HANNA), an interactive photo-realistic simulator in which an agent fulfills…

Human-Computer Interaction · Computer Science 2019-11-25 Khanh Nguyen , Hal Daumé

Image-Goal Navigation in Complex Environments via Modular Learning

We present a novel approach for image-goal navigation, where an agent navigates with a goal image rather than accurate target information, which is more challenging. Our goal is to decouple the learning of navigation goal planning,…

Robotics · Computer Science 2022-02-23 Qiaoyun Wu , Jun Wang , Jing Liang , Xiaoxi Gong , Dinesh Manocha

Simultaneous Mapping and Target Driven Navigation

This work presents a modular architecture for simultaneous mapping and target driven navigation in indoors environments. The semantic and appearance stored in 2.5D map is distilled from RGB images, semantic segmentation and outputs of…

Computer Vision and Pattern Recognition · Computer Science 2019-11-20 Georgios Georgakis , Yimeng Li , Jana Kosecka

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

We present a scalable approach for learning open-world object-goal navigation (ObjectNav) -- the task of asking a virtual robot (agent) to find any instance of an object in an unexplored environment (e.g., "find a sink"). Our approach is…

Computer Vision and Pattern Recognition · Computer Science 2023-10-16 Arjun Majumdar , Gunjan Aggarwal , Bhavika Devnani , Judy Hoffman , Dhruv Batra