Related papers: Object-oriented Targets for Visual Navigation usin…

Navigating to Objects in Unseen Environments by Distance Prediction

Object Goal Navigation (ObjectNav) task is to navigate an agent to an object category in unseen environments without a pre-built map. In this paper, we solve this task by predicting the distance to the target using semantically-related…

Robotics · Computer Science 2022-07-14 Minzhao Zhu , Binglei Zhao , Tao Kong

Visual Navigation with Spatial Attention

This work focuses on object goal visual navigation, aiming at finding the location of an object from a given class, where in each step the agent is provided with an egocentric RGB image of the scene. We propose to learn the agent's policy…

Computer Vision and Pattern Recognition · Computer Science 2021-04-21 Bar Mayo , Tamir Hazan , Ayellet Tal

Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge

In visual semantic navigation, the robot navigates to a target object with egocentric visual observations and the class label of the target is given. It is a meaningful task inspiring a surge of relevant research. However, most of the…

Artificial Intelligence · Computer Science 2021-09-21 Xinzhu Liu , Di Guo , Huaping Liu , Fuchun Sun

Visual Representations for Semantic Target Driven Navigation

What is a good visual representation for autonomous agents? We address this question in the context of semantic visual navigation, which is the problem of a robot finding its way through a complex environment to a target object, e.g. go to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Arsalan Mousavian , Alexander Toshev , Marek Fiser , Jana Kosecka , Ayzaan Wahid , James Davidson

Exploiting Scene-specific Features for Object Goal Navigation

Can the intrinsic relation between an object and the room in which it is usually located help agents in the Visual Navigation Task? We study this question in the context of Object Navigation, a problem in which an agent has to reach an…

Computer Vision and Pattern Recognition · Computer Science 2020-08-24 Tommaso Campari , Paolo Eccher , Luciano Serafini , Lamberto Ballan

VTNet: Visual Transformer Network for Object Goal Navigation

Object goal navigation aims to steer an agent towards a target object based on observations of the agent. It is of pivotal importance to design effective visual representations of the observed scene in determining navigation actions. In…

Computer Vision and Pattern Recognition · Computer Science 2021-05-21 Heming Du , Xin Yu , Liang Zheng

Learning to Map for Active Semantic Goal Navigation

We consider the problem of object goal navigation in unseen environments. Solving this problem requires learning of contextual semantic priors, a challenging endeavour given the spatial and semantic variability of indoor environments.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Georgios Georgakis , Bernadette Bucher , Karl Schmeckpeper , Siddharth Singh , Kostas Daniilidis

SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

This paper addresses the problem of object-goal navigation in autonomous inspections in real-world environments. Object-goal navigation is crucial to enable effective inspections in various settings, often requiring the robot to identify…

Robotics · Computer Science 2024-11-19 Muhammad Fadhil Ginting , Sung-Kyun Kim , David D. Fan , Matteo Palieri , Mykel J. Kochenderfer , Ali-akbar Agha-Mohammadi

Towards Navigation by Reasoning over Spatial Configurations

We deal with the navigation problem where the agent follows natural language instructions while observing the environment. Focusing on language understanding, we show the importance of spatial semantics in grounding navigation instructions…

Computation and Language · Computer Science 2021-05-17 Yue Zhang , Quan Guo , Parisa Kordjamshidi

Embodied Learning for Lifelong Visual Perception

We study lifelong visual perception in an embodied setup, where we develop new models and compare various agents that navigate in buildings and occasionally request annotations which, in turn, are used to refine their visual perception…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 David Nilsson , Aleksis Pirinen , Erik Gärtner , Cristian Sminchisescu

Self-supervised Visual Reinforcement Learning with Object-centric Representations

Autonomous agents need large repertoires of skills to act reasonably on new tasks that they have not seen before. However, acquiring these skills using only a stream of high-dimensional, unstructured, and unlabeled observations is a tricky…

Machine Learning · Computer Science 2021-02-09 Andrii Zadaianchuk , Maximilian Seitzer , Georg Martius

Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation

In the context of visual navigation, the capacity to map a novel environment is necessary for an agent to exploit its observation history in the considered place and efficiently reach known goals. This ability can be associated with spatial…

Computer Vision and Pattern Recognition · Computer Science 2023-04-26 Pierre Marza , Laetitia Matignon , Olivier Simonin , Christian Wolf

Learning Object Relation Graph and Tentative Policy for Visual Navigation

Target-driven visual navigation aims at navigating an agent towards a given target based on the observation of the agent. In this task, it is critical to learn informative visual representation and robust navigation policy. Aiming to…

Computer Vision and Pattern Recognition · Computer Science 2020-07-23 Heming Du , Xin Yu , Liang Zheng

Neural Topological SLAM for Visual Navigation

This paper studies the problem of image-goal navigation which involves navigating to the location indicated by a goal image in a novel previously unseen environment. To tackle this problem, we design topological representations for space…

Computer Vision and Pattern Recognition · Computer Science 2020-06-01 Devendra Singh Chaplot , Ruslan Salakhutdinov , Abhinav Gupta , Saurabh Gupta

Where to Fetch: Extracting Visual Scene Representation from Large Pre-Trained Models for Robotic Goal Navigation

To complete a complex task where a robot navigates to a goal object and fetches it, the robot needs to have a good understanding of the instructions and the surrounding environment. Large pre-trained models have shown capabilities to…

Robotics · Computer Science 2024-08-21 Yu Li , Dayou Li , Chenkun Zhao , Ruifeng Wang , Ran Song , Wei Zhang

Instance-Level Semantic Maps for Vision Language Navigation

Humans have a natural ability to perform semantic associations with the surrounding objects in the environment. This allows them to create a mental map of the environment, allowing them to navigate on-demand when given linguistic…

Robotics · Computer Science 2023-11-21 Laksh Nanwani , Anmol Agarwal , Kanishk Jain , Raghav Prabhakar , Aaron Monis , Aditya Mathur , Krishna Murthy , Abdul Hafez , Vineet Gandhi , K. Madhava Krishna

Structured Exploration Through Instruction Enhancement for Object Navigation

Finding an object of a specific class in an unseen environment remains an unsolved navigation problem. Hence, we propose a hierarchical learning-based method for object navigation. The top-level is capable of high-level planning, and…

Artificial Intelligence · Computer Science 2022-11-17 Matthias Hutsebaut-Buysse , Kevin Mets , Tom De Schepper , Steven Latré

SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation

Natural language instructions for visual navigation often use scene descriptions (e.g., "bedroom") and object references (e.g., "green chairs") to provide a breadcrumb trail to a goal location. This work presents a transformer-based…

Computer Vision and Pattern Recognition · Computer Science 2021-10-28 Abhinav Moudgil , Arjun Majumdar , Harsh Agrawal , Stefan Lee , Dhruv Batra

Visual Semantic Navigation using Scene Priors

How do humans navigate to target objects in novel scenes? Do we use the semantic/functional priors we have built over years to efficiently search and navigate? For example, to search for mugs, we search cabinets near the coffee machine and…

Computer Vision and Pattern Recognition · Computer Science 2018-10-16 Wei Yang , Xiaolong Wang , Ali Farhadi , Abhinav Gupta , Roozbeh Mottaghi

Object Goal Navigation using Data Regularized Q-Learning

Object Goal Navigation requires a robot to find and navigate to an instance of a target object class in a previously unseen environment. Our framework incrementally builds a semantic map of the environment over time, and then repeatedly…

Robotics · Computer Science 2022-08-30 Nandiraju Gireesh , D. A. Sasi Kiran , Snehasis Banerjee , Mohan Sridharan , Brojeshwar Bhowmick , Madhava Krishna