Related papers: Visual Semantic Navigation using Scene Priors

Using Image Priors to Improve Scene Understanding

Semantic segmentation algorithms that can robustly segment objects across multiple camera viewpoints are crucial for assuring navigation and safety in emerging applications such as autonomous driving. Existing algorithms treat each image in…

Computer Vision and Pattern Recognition · Computer Science 2019-10-04 Brigit Schroeder , Hanlin Tang , Alexandre Alahi

Learning to Map for Active Semantic Goal Navigation

We consider the problem of object goal navigation in unseen environments. Solving this problem requires learning of contextual semantic priors, a challenging endeavour given the spatial and semantic variability of indoor environments.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Georgios Georgakis , Bernadette Bucher , Karl Schmeckpeper , Siddharth Singh , Kostas Daniilidis

Learning Embeddings that Capture Spatial Semantics for Indoor Navigation

Incorporating domain-specific priors in search and navigation tasks has shown promising results in improving generalization and sample complexity over end-to-end trained policies. In this work, we study how object embeddings that capture…

Robotics · Computer Science 2021-08-03 Vidhi Jain , Prakhar Agarwal , Shishir Patil , Katia Sycara

Object-oriented Targets for Visual Navigation using Rich Semantic Representations

When searching for an object humans navigate through a scene using semantic information and spatial relationships. We look for an object using our knowledge of its attributes and relationships with other objects to infer the probable…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Jean-Benoit Delbrouck , Stéphane Dupont

Frontier Semantic Exploration for Visual Target Navigation

This work focuses on the problem of visual target navigation, which is very important for autonomous robots as it is closely related to high-level tasks. To find a special object in unknown environments, classical and learning-based…

Robotics · Computer Science 2023-12-27 Bangguo Yu , Hamidreza Kasaei , Ming Cao

Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge

In visual semantic navigation, the robot navigates to a target object with egocentric visual observations and the class label of the target is given. It is a meaningful task inspiring a surge of relevant research. However, most of the…

Artificial Intelligence · Computer Science 2021-09-21 Xinzhu Liu , Di Guo , Huaping Liu , Fuchun Sun

Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection

When given two similar images, humans identify their differences by comparing the appearance (e.g., color, texture) with the help of semantics (e.g., objects, relations). However, mainstream binary change detection models adopt a supervised…

Computer Vision and Pattern Recognition · Computer Science 2025-09-10 Yuhang Gan , Wenjie Xuan , Zhiming Luo , Lei Fang , Zengmao Wang , Juhua Liu , Bo Du

Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning

Two less addressed issues of deep reinforcement learning are (1) lack of generalization capability to new target goals, and (2) data inefficiency i.e., the model requires several (and often costly) episodes of trial and error to converge,…

Computer Vision and Pattern Recognition · Computer Science 2016-09-19 Yuke Zhu , Roozbeh Mottaghi , Eric Kolve , Joseph J. Lim , Abhinav Gupta , Li Fei-Fei , Ali Farhadi

Learning to Navigate Using Mid-Level Visual Priors

How much does having visual priors about the world (e.g. the fact that the world is 3D) assist in learning to perform downstream motor tasks (e.g. navigating a complex environment)? What are the consequences of not utilizing such visual…

Computer Vision and Pattern Recognition · Computer Science 2019-12-25 Alexander Sax , Jeffrey O. Zhang , Bradley Emi , Amir Zamir , Silvio Savarese , Leonidas Guibas , Jitendra Malik

Vision-based Navigation Using Deep Reinforcement Learning

Deep reinforcement learning (RL) has been successfully applied to a variety of game-like environments. However, the application of deep RL to visual navigation with realistic environments is a challenging task. We propose a novel learning…

Robotics · Computer Science 2019-11-12 Jonáš Kulhánek , Erik Derner , Tim de Bruin , Robert Babuška

Visual Semantic Planning using Deep Successor Representations

A crucial capability of real-world intelligent agents is their ability to plan a sequence of actions to achieve their goals in the visual world. In this work, we address the problem of visual semantic planning: the task of predicting a…

Computer Vision and Pattern Recognition · Computer Science 2017-08-17 Yuke Zhu , Daniel Gordon , Eric Kolve , Dieter Fox , Li Fei-Fei , Abhinav Gupta , Roozbeh Mottaghi , Ali Farhadi

Utilizing Semantic Visual Landmarks for Precise Vehicle Navigation

This paper presents a new approach for integrating semantic information for vision-based vehicle navigation. Although vision-based vehicle navigation systems using pre-mapped visual landmarks are capable of achieving submeter level accuracy…

Computer Vision and Pattern Recognition · Computer Science 2018-01-04 Varun Murali , Han-Pang Chiu , Supun Samarasekera , Rakesh , Kumar

Semantic-Based Active Perception for Humanoid Visual Tasks with Foveal Sensors

The aim of this work is to establish how accurately a recent semantic-based foveal active perception model is able to complete visual tasks that are regularly performed by humans, namely, scene exploration and visual search. This model…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 João Luzio , Alexandre Bernardino , Plinio Moreno

SSCNav: Confidence-Aware Semantic Scene Completion for Visual Semantic Navigation

This paper focuses on visual semantic navigation, the task of producing actions for an active agent to navigate to a specified target object category in an unknown environment. To complete this task, the algorithm should simultaneously…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Yiqing Liang , Boyuan Chen , Shuran Song

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

SEEK: Semantic Reasoning for Object Goal Navigation in Real World Inspection Tasks

This paper addresses the problem of object-goal navigation in autonomous inspections in real-world environments. Object-goal navigation is crucial to enable effective inspections in various settings, often requiring the robot to identify…

Robotics · Computer Science 2024-11-19 Muhammad Fadhil Ginting , Sung-Kyun Kim , David D. Fan , Matteo Palieri , Mykel J. Kochenderfer , Ali-akbar Agha-Mohammadi

Learning Occupancy Priors of Human Motion from Semantic Maps of Urban Environments

Understanding and anticipating human activity is an important capability for intelligent systems in mobile robotics, autonomous driving, and video surveillance. While learning from demonstrations with on-site collected trajectory data is a…

Robotics · Computer Science 2021-02-18 Andrey Rudenko , Luigi Palmieri , Johannes Doellinger , Achim J. Lilienthal , Kai O. Arras

Where are the Keys? -- Learning Object-Centric Navigation Policies on Semantic Maps with Graph Convolutional Networks

Emerging object-based SLAM algorithms can build a graph representation of an environment comprising nodes for robot poses and object landmarks. However, while this map will contain static objects such as furniture or appliances, many…

Machine Learning · Computer Science 2021-01-22 Niko Sünderhauf

Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation

Generalisation to unseen contexts remains a challenge for embodied navigation agents. In the context of semantic audio-visual navigation (SAVi) tasks, the notion of generalisation should include both generalising to unseen indoor visual…

Robotics · Computer Science 2022-12-23 Gyan Tatiya , Jonathan Francis , Luca Bondi , Ingrid Navarro , Eric Nyberg , Jivko Sinapov , Jean Oh

Context-aware Human Motion Prediction

The problem of predicting human motion given a sequence of past observations is at the core of many applications in robotics and computer vision. Current state-of-the-art formulate this problem as a sequence-to-sequence task, in which a…

Computer Vision and Pattern Recognition · Computer Science 2020-03-25 Enric Corona , Albert Pumarola , Guillem Alenyà , Francesc Moreno-Noguer