English
Related papers

Related papers: Interactive Semantic Map Representation for Skill-…

200 papers

What is a good visual representation for autonomous agents? We address this question in the context of semantic visual navigation, which is the problem of a robot finding its way through a complex environment to a target object, e.g. go to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Arsalan Mousavian , Alexander Toshev , Marek Fiser , Jana Kosecka , Ayzaan Wahid , James Davidson

Robots require a semantic understanding of their surroundings to operate in an efficient and explainable way in human environments. In the literature, there has been an extensive focus on object labeling and exhaustive scene graph…

Robotics · Computer Science 2024-04-16 Roberto Bigazzi , Lorenzo Baraldi , Shreyas Kousik , Rita Cucchiara , Marco Pavone

Navigating complex indoor environments requires a deep understanding of the space the robotic agent is acting into to correctly inform the navigation process of the agent towards the goal location. In recent learning-based navigation…

Robotics · Computer Science 2023-10-05 Marco Rosano , Antonino Furnari , Luigi Gulino , Corrado Santoro , Giovanni Maria Farinella

This work focuses on the problem of visual target navigation, which is very important for autonomous robots as it is closely related to high-level tasks. To find a special object in unknown environments, classical and learning-based…

Robotics · Computer Science 2023-12-27 Bangguo Yu , Hamidreza Kasaei , Ming Cao

Being able to perceive the semantics and the spatial structure of the environment is essential for visual navigation of a household robot. However, most existing works only employ visual backbones pre-trained either with independent images…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Yicong Hong , Yang Zhou , Ruiyi Zhang , Franck Dernoncourt , Trung Bui , Stephen Gould , Hao Tan

Visual navigation requires the robot to reach a specified goal such as an image, based on a sequence of first-person visual observations. While recent learning-based approaches have made significant progress, they often focus on improving…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Hao Ren , Zetong Bi , Yiming Zeng , Zhaoliang Wan , Lu Qi , Hui Cheng

To autonomously navigate and plan interactions in real-world environments, robots require the ability to robustly perceive and map complex, unstructured surrounding scenes. Besides building an internal representation of the observed scene…

In visual semantic navigation, the robot navigates to a target object with egocentric visual observations and the class label of the target is given. It is a meaningful task inspiring a surge of relevant research. However, most of the…

Artificial Intelligence · Computer Science 2021-09-21 Xinzhu Liu , Di Guo , Huaping Liu , Fuchun Sun

Service robots operating in cluttered human environments such as homes, offices, and schools cannot rely on predefined object arrangements and must continuously update their semantic and spatial estimates while dealing with possible…

Robotics · Computer Science 2025-09-03 Nils Dengler , Jesper Mücke , Rohit Menon , Maren Bennewitz

Navigation has been classically solved in robotics through the combination of SLAM and planning. More recently, beyond waypoint planning, problems involving significant components of (visual) high-level reasoning have been explored in…

Robotics · Computer Science 2024-01-26 Assem Sadek , Guillaume Bono , Boris Chidlovskii , Atilla Baskurt , Christian Wolf

We present the Semantic Robot Programming (SRP) paradigm as a convergence of robot programming by demonstration and semantic mapping. In SRP, a user can directly program a robot manipulator by demonstrating a snapshot of their intended goal…

Robotics · Computer Science 2018-10-22 Zhen Zeng , Zheming Zhou , Zhiqiang Sui , Odest Chadwicke Jenkins

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

Interactive robots navigating photo-realistic environments need to be trained to effectively leverage and handle the dynamic nature of dialogue in addition to the challenges underlying vision-and-language navigation (VLN). In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2022-03-17 Ayush Shrivastava , Karthik Gopalakrishnan , Yang Liu , Robinson Piramuthu , Gokhan Tür , Devi Parikh , Dilek Hakkani-Tür

Visual Semantic Navigation (VSN) is the ability of a robot to learn visual semantic information for navigating in unseen environments. These VSN models are typically tested in those virtual environments where they are trained, mainly using…

Semantic navigation requires an agent to navigate toward a specified target in an unseen environment. Employing an imaginative navigation strategy that predicts future scenes before taking action, can empower the agent to find target…

Robotics · Computer Science 2025-08-12 Yue Hu , Junzhe Wu , Ruihan Xu , Hang Liu , Avery Xi , Henry X. Liu , Ram Vasudevan , Maani Ghaffari

We propose a novel approach to robot-operated active understanding of unknown indoor scenes, based on online RGBD reconstruction with semantic segmentation. In our method, the exploratory robot scanning is both driven by and targeting at…

Graphics · Computer Science 2022-01-14 Lintao Zheng , Chenyang Zhu , Jiazhao Zhang , Hang Zhao , Hui Huang , Matthias Niessner , Kai Xu

Object goal navigation is an important problem in Embodied AI that involves guiding the agent to navigate to an instance of the object category in an unknown environment -- typically an indoor scene. Unfortunately, current state-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2023-05-29 Junting Chen , Guohao Li , Suryansh Kumar , Bernard Ghanem , Fisher Yu

This paper presents an adaptive transformer model named SegmATRon for embodied image semantic segmentation. Its distinctive feature is the adaptation of model weights during inference on several images using a hybrid multicomponent loss…

Computer Vision and Pattern Recognition · Computer Science 2023-10-19 Tatiana Zemskova , Margarita Kichik , Dmitry Yudin , Aleksei Staroverov , Aleksandr Panov

The 3D scene graph models spatial relationships between objects, enabling the agent to efficiently navigate in a partially observable environment and predict the location of the target object.This paper proposes an original framework named…

Robotics · Computer Science 2025-06-06 Nikita Oskolkov , Huzhenyu Zhang , Dmitry Makarov , Dmitry Yudin , Aleksandr Panov

We present a target-driven navigation system to improve mapless visual navigation in indoor scenes. Our method takes a multi-view observation of a robot and a target as inputs at each time step to provide a sequence of actions that move the…

Robotics · Computer Science 2022-05-10 Qiaoyun Wu , Xiaoxi Gong , Kai Xu , Dinesh Manocha , Jingxuan Dong , Jun Wang
‹ Prev 1 2 3 10 Next ›