Related papers: Interactive Semantic Map Representation for Skill-…

Visual Representations for Semantic Target Driven Navigation

What is a good visual representation for autonomous agents? We address this question in the context of semantic visual navigation, which is the problem of a robot finding its way through a complex environment to a target object, e.g. go to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Arsalan Mousavian , Alexander Toshev , Marek Fiser , Jana Kosecka , Ayzaan Wahid , James Davidson

Mapping High-level Semantic Regions in Indoor Environments without Object Recognition

Robots require a semantic understanding of their surroundings to operate in an efficient and explainable way in human environments. In the literature, there has been an extensive focus on object labeling and exhaustive scene graph…

Robotics · Computer Science 2024-04-16 Roberto Bigazzi , Lorenzo Baraldi , Shreyas Kousik , Rita Cucchiara , Marco Pavone

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation

Navigating complex indoor environments requires a deep understanding of the space the robotic agent is acting into to correctly inform the navigation process of the agent towards the goal location. In recent learning-based navigation…

Robotics · Computer Science 2023-10-05 Marco Rosano , Antonino Furnari , Luigi Gulino , Corrado Santoro , Giovanni Maria Farinella

Frontier Semantic Exploration for Visual Target Navigation

This work focuses on the problem of visual target navigation, which is very important for autonomous robots as it is closely related to high-level tasks. To find a special object in unknown environments, classical and learning-based…

Robotics · Computer Science 2023-12-27 Bangguo Yu , Hamidreza Kasaei , Ming Cao

Learning Navigational Visual Representations with Semantic Map Supervision

Being able to perceive the semantics and the spatial structure of the environment is essential for visual navigation of a household robot. However, most existing works only employ visual backbones pre-trained either with independent images…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Yicong Hong , Yang Zhou , Ruiyi Zhang , Franck Dernoncourt , Trung Bui , Stephen Gould , Hao Tan

STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation

Visual navigation requires the robot to reach a specified goal such as an image, based on a sequence of first-person visual observations. While recent learning-based approaches have made significant progress, they often focus on improving…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Hao Ren , Zetong Bi , Yiming Zeng , Zhaoliang Wan , Lu Qi , Hui Cheng

Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery

To autonomously navigate and plan interactions in real-world environments, robots require the ability to robustly perceive and map complex, unstructured surrounding scenes. Besides building an internal representation of the observed scene…

Robotics · Computer Science 2021-05-18 Margarita Grinvald , Fadri Furrer , Tonci Novkovic , Jen Jen Chung , Cesar Cadena , Roland Siegwart , Juan Nieto

Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge

In visual semantic navigation, the robot navigates to a target object with egocentric visual observations and the class label of the target is given. It is a meaningful task inspiring a surge of relevant research. However, most of the…

Artificial Intelligence · Computer Science 2021-09-21 Xinzhu Liu , Di Guo , Huaping Liu , Fuchun Sun

Efficient Manipulation-Enhanced Semantic Mapping With Uncertainty-Informed Action Selection

Service robots operating in cluttered human environments such as homes, offices, and schools cannot rely on predefined object arrangements and must continuously update their semantic and spatial estimates while dealing with possible…

Robotics · Computer Science 2025-09-03 Nils Dengler , Jesper Mücke , Rohit Menon , Maren Bennewitz

Multi-Object Navigation in real environments using hybrid policies

Navigation has been classically solved in robotics through the combination of SLAM and planning. More recently, beyond waypoint planning, problems involving significant components of (visual) high-level reasoning have been explored in…

Robotics · Computer Science 2024-01-26 Assem Sadek , Guillaume Bono , Boris Chidlovskii , Atilla Baskurt , Christian Wolf

Semantic Robot Programming for Goal-Directed Manipulation in Cluttered Scenes

We present the Semantic Robot Programming (SRP) paradigm as a convergence of robot programming by demonstration and semantic mapping. In SRP, a user can directly program a robot manipulator by demonstrating a snapshot of their intended goal…

Robotics · Computer Science 2018-10-22 Zhen Zeng , Zheming Zhou , Zhiqiang Sui , Odest Chadwicke Jenkins

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator

Interactive robots navigating photo-realistic environments need to be trained to effectively leverage and handle the dynamic nature of dialogue in addition to the challenges underlying vision-and-language navigation (VLN). In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2022-03-17 Ayush Shrivastava , Karthik Gopalakrishnan , Yang Liu , Robinson Piramuthu , Gokhan Tür , Devi Parikh , Dilek Hakkani-Tür

Visual Semantic Navigation with Real Robots

Visual Semantic Navigation (VSN) is the ability of a robot to learn visual semantic information for navigating in unseen environments. These VSN models are typically tested in those virtual environments where they are trained, mainly using…

Robotics · Computer Science 2025-01-10 Carlos Gutiérrez-Álvarez , Pablo Ríos-Navarro , Rafael Flor-Rodríguez , Francisco Javier Acevedo-Rodríguez , Roberto J. López-Sastre

Imaginative World Modeling with Scene Graphs for Embodied Agent Navigation

Semantic navigation requires an agent to navigate toward a specified target in an unseen environment. Employing an imaginative navigation strategy that predicts future scenes before taking action, can empower the agent to find target…

Robotics · Computer Science 2025-08-12 Yue Hu , Junzhe Wu , Ruihan Xu , Hang Liu , Avery Xi , Henry X. Liu , Ram Vasudevan , Maani Ghaffari

Active Scene Understanding via Online Semantic Reconstruction

We propose a novel approach to robot-operated active understanding of unknown indoor scenes, based on online RGBD reconstruction with semantic segmentation. In our method, the exploratory robot scanning is both driven by and targeting at…

Graphics · Computer Science 2022-01-14 Lintao Zheng , Chenyang Zhu , Jiazhao Zhang , Hang Zhao , Hui Huang , Matthias Niessner , Kai Xu

How To Not Train Your Dragon: Training-free Embodied Object Goal Navigation with Semantic Frontiers

Object goal navigation is an important problem in Embodied AI that involves guiding the agent to navigate to an instance of the object category in an unknown environment -- typically an indoor scene. Unfortunately, current state-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2023-05-29 Junting Chen , Guohao Li , Suryansh Kumar , Bernard Ghanem , Fisher Yu

SegmATRon: Embodied Adaptive Semantic Segmentation for Indoor Environment

This paper presents an adaptive transformer model named SegmATRon for embodied image semantic segmentation. Its distinctive feature is the adaptation of model weights during inference on several images using a hybrid multicomponent loss…

Computer Vision and Pattern Recognition · Computer Science 2023-10-19 Tatiana Zemskova , Margarita Kichik , Dmitry Yudin , Aleksei Staroverov , Aleksandr Panov

SGN-CIRL: Scene Graph-based Navigation with Curriculum, Imitation, and Reinforcement Learning

The 3D scene graph models spatial relationships between objects, enabling the agent to efficiently navigate in a partially observable environment and predict the location of the target object.This paper proposes an original framework named…

Robotics · Computer Science 2025-06-06 Nikita Oskolkov , Huzhenyu Zhang , Dmitry Makarov , Dmitry Yudin , Aleksandr Panov

Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning

We present a target-driven navigation system to improve mapless visual navigation in indoor scenes. Our method takes a multi-view observation of a robot and a target as inputs at each time step to provide a sequence of actions that move the…

Robotics · Computer Science 2022-05-10 Qiaoyun Wu , Xiaoxi Gong , Kai Xu , Dinesh Manocha , Jingxuan Dong , Jun Wang