Related papers: Active Semantic Perception

Active Perception using Neural Radiance Fields

We study active perception from first principles to argue that an autonomous agent performing active perception should maximize the mutual information that past observations posses about future ones. Doing so requires (a) a representation…

Robotics · Computer Science 2024-05-07 Siming He , Christopher D. Hsu , Dexter Ong , Yifei Simon Shao , Pratik Chaudhari

Mapping High-level Semantic Regions in Indoor Environments without Object Recognition

Robots require a semantic understanding of their surroundings to operate in an efficient and explainable way in human environments. In the literature, there has been an extensive focus on object labeling and exhaustive scene graph…

Robotics · Computer Science 2024-04-16 Roberto Bigazzi , Lorenzo Baraldi , Shreyas Kousik , Rita Cucchiara , Marco Pavone

Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs

This paper addresses the high demand in advanced intelligent robot navigation for a more holistic understanding of spatial environments, by introducing a novel system that harnesses the capabilities of Large Language Models (LLMs) to…

Robotics · Computer Science 2025-03-20 Yao Cheng , Zhe Han , Fengyang Jiang , Huaizhen Wang , Fengyu Zhou , Qingshan Yin , Lei Wei

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

A comprehensive semantic understanding of a scene is important for many applications - but in what space should diverse semantic information (e.g., objects, scene categories, material types, texture, etc.) be grounded and what should be its…

Computer Vision and Pattern Recognition · Computer Science 2019-10-08 Iro Armeni , Zhi-Yang He , JunYoung Gwak , Amir R. Zamir , Martin Fischer , Jitendra Malik , Silvio Savarese

Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation

We introduce a learning-based approach for room navigation using semantic maps. Our proposed architecture learns to predict top-down belief maps of regions that lie beyond the agent's field of view while modeling architectural and stylistic…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Medhini Narasimhan , Erik Wijmans , Xinlei Chen , Trevor Darrell , Dhruv Batra , Devi Parikh , Amanpreet Singh

Learning Spatial and Temporal Hierarchies: Hierarchical Active Inference for navigation in Multi-Room Maze Environments

Cognitive maps play a crucial role in facilitating flexible behaviour by representing spatial and conceptual relationships within an environment. The ability to learn and infer the underlying structure of the environment is crucial for…

Artificial Intelligence · Computer Science 2023-09-20 Daria de Tinguy , Toon Van de Maele , Tim Verbelen , Bart Dhoedt

Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

We introduce the task of predicting functional 3D scene graphs for real-world indoor environments from posed RGB-D images. Unlike traditional 3D scene graphs that focus on spatial relationships of objects, functional 3D scene graphs capture…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Chenyangguang Zhang , Alexandros Delitzas , Fangjinhua Wang , Ruida Zhang , Xiangyang Ji , Marc Pollefeys , Francis Engelmann

Context Aware Robot Navigation using Interactively Built Semantic Maps

We discuss the process of building semantic maps, how to interactively label entities in them, and how to use them to enable context-aware navigation behaviors in human environments. We utilize planar surfaces, such as walls and tables, and…

Robotics · Computer Science 2018-08-15 Akansel Cosgun , Henrik Christensen

3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding

A 3D scene graph represents a compact scene model by capturing both the objects present and the semantic relationships between them, making it a promising structure for robotic applications. To effectively interact with users, an embodied…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Tatiana Zemskova , Dmitry Yudin

SemanticPaint: A Framework for the Interactive Segmentation of 3D Scenes

We present an open-source, real-time implementation of SemanticPaint, a system for geometric reconstruction, object-class segmentation and learning of 3D scenes. Using our system, a user can walk into a room wearing a depth camera and a…

Computer Vision and Pattern Recognition · Computer Science 2017-09-05 Stuart Golodetz , Michael Sapienza , Julien P. C. Valentin , Vibhav Vineet , Ming-Ming Cheng , Anurag Arnab , Victor A. Prisacariu , Olaf Kähler , Carl Yuheng Ren , David W. Murray , Shahram Izadi , Philip H. S. Torr

Semantic-Based Active Perception for Humanoid Visual Tasks with Foveal Sensors

The aim of this work is to establish how accurately a recent semantic-based foveal active perception model is able to complete visual tasks that are regularly performed by humans, namely, scene exploration and visual search. This model…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 João Luzio , Alexandre Bernardino , Plinio Moreno

3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and Humans

We present a unified representation for actionable spatial perception: 3D Dynamic Scene Graphs. Scene graphs are directed graphs where nodes represent entities in the scene (e.g. objects, walls, rooms), and edges represent relations (e.g.…

Robotics · Computer Science 2020-06-18 Antoni Rosinol , Arjun Gupta , Marcus Abate , Jingnan Shi , Luca Carlone

Active Metric-Semantic Mapping by Multiple Aerial Robots

Traditional approaches for active mapping focus on building geometric maps. For most real-world applications, however, actionable information is related to semantically meaningful objects in the environment. We propose an approach to the…

Robotics · Computer Science 2023-08-15 Xu Liu , Ankit Prabhu , Fernando Cladera , Ian D. Miller , Lifeng Zhou , Camillo J. Taylor , Vijay Kumar

Understanding while Exploring: Semantics-driven Active Mapping

Effective robotic autonomy in unknown environments demands proactive exploration and precise understanding of both geometry and semantics. In this paper, we propose ActiveSGM, an active semantic mapping framework designed to predict the…

Robotics · Computer Science 2025-11-14 Liyan Chen , Huangying Zhan , Hairong Yin , Yi Xu , Philippos Mordohai

Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning

Representing and understanding 3D environments in a structured manner is crucial for autonomous agents to navigate and reason about their surroundings. While traditional Simultaneous Localization and Mapping (SLAM) methods generate metric…

Robotics · Computer Science 2026-02-03 Albert Gassol Puigjaner , Angelos Zacharia , Kostas Alexis

Indoor Semantic Scene Understanding using Multi-modality Fusion

Seamless Human-Robot Interaction is the ultimate goal of developing service robotic systems. For this, the robotic agents have to understand their surroundings to better complete a given task. Semantic scene understanding allows a robotic…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Muraleekrishna Gopinathan , Giang Truong , Jumana Abu-Khalaf

Relational Semantic Reasoning on 3D Scene Graphs for Open World Interactive Object Search

Open-world interactive object search in household environments requires understanding semantic relationships between objects and their surrounding context to guide exploration efficiently. Prior methods either rely on vision-language…

Robotics · Computer Science 2026-05-28 Imen Mahdi , Matteo Cassinelli , Fabien Despinoy , Tim Welschehold , Abhinav Valada

Robust Graph Matching through Semantic Relationship Generation for SLAM

Graph-based representations such as Scene Graphs enable localization in structured indoor environments by matching a locally observed graph, constructed from sensor data, to a prior map. This process is particularly challenging in…

Robotics · Computer Science 2026-04-29 David Perez-Saura , Jose Andres Millan-Romera , Miguel Fernandez-Cortizas , Holger Voos , Pascual Campoy , Jose Luis Sanchez-Lopez

Semantic Interaction in Augmented Reality Environments for Microsoft HoloLens

Augmented Reality is a promising technique for human-machine interaction. Especially in robotics, which always considers systems in their environment, it is highly beneficial to display visualizations and receive user input directly in…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Peer Schüett , Max Schwarz , Sven Behnke

Language-guided Active Sensing of Confined, Cluttered Environments via Object Rearrangement Planning

Language-guided active sensing is a robotics subtask where a robot with an onboard sensor interacts efficiently with the environment via object manipulation to maximize perceptual information, following given language instructions. These…

Robotics · Computer Science 2024-02-06 Weihan Chen , Hanwen Ren , Ahmed H. Qureshi