English
Related papers

Related papers: Active Semantic Perception

200 papers

We study active perception from first principles to argue that an autonomous agent performing active perception should maximize the mutual information that past observations posses about future ones. Doing so requires (a) a representation…

Robotics · Computer Science 2024-05-07 Siming He , Christopher D. Hsu , Dexter Ong , Yifei Simon Shao , Pratik Chaudhari

Robots require a semantic understanding of their surroundings to operate in an efficient and explainable way in human environments. In the literature, there has been an extensive focus on object labeling and exhaustive scene graph…

Robotics · Computer Science 2024-04-16 Roberto Bigazzi , Lorenzo Baraldi , Shreyas Kousik , Rita Cucchiara , Marco Pavone

This paper addresses the high demand in advanced intelligent robot navigation for a more holistic understanding of spatial environments, by introducing a novel system that harnesses the capabilities of Large Language Models (LLMs) to…

Robotics · Computer Science 2025-03-20 Yao Cheng , Zhe Han , Fengyang Jiang , Huaizhen Wang , Fengyu Zhou , Qingshan Yin , Lei Wei

A comprehensive semantic understanding of a scene is important for many applications - but in what space should diverse semantic information (e.g., objects, scene categories, material types, texture, etc.) be grounded and what should be its…

Computer Vision and Pattern Recognition · Computer Science 2019-10-08 Iro Armeni , Zhi-Yang He , JunYoung Gwak , Amir R. Zamir , Martin Fischer , Jitendra Malik , Silvio Savarese

We introduce a learning-based approach for room navigation using semantic maps. Our proposed architecture learns to predict top-down belief maps of regions that lie beyond the agent's field of view while modeling architectural and stylistic…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Medhini Narasimhan , Erik Wijmans , Xinlei Chen , Trevor Darrell , Dhruv Batra , Devi Parikh , Amanpreet Singh

Cognitive maps play a crucial role in facilitating flexible behaviour by representing spatial and conceptual relationships within an environment. The ability to learn and infer the underlying structure of the environment is crucial for…

Artificial Intelligence · Computer Science 2023-09-20 Daria de Tinguy , Toon Van de Maele , Tim Verbelen , Bart Dhoedt

We introduce the task of predicting functional 3D scene graphs for real-world indoor environments from posed RGB-D images. Unlike traditional 3D scene graphs that focus on spatial relationships of objects, functional 3D scene graphs capture…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Chenyangguang Zhang , Alexandros Delitzas , Fangjinhua Wang , Ruida Zhang , Xiangyang Ji , Marc Pollefeys , Francis Engelmann

We discuss the process of building semantic maps, how to interactively label entities in them, and how to use them to enable context-aware navigation behaviors in human environments. We utilize planar surfaces, such as walls and tables, and…

Robotics · Computer Science 2018-08-15 Akansel Cosgun , Henrik Christensen

A 3D scene graph represents a compact scene model by capturing both the objects present and the semantic relationships between them, making it a promising structure for robotic applications. To effectively interact with users, an embodied…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Tatiana Zemskova , Dmitry Yudin

We present an open-source, real-time implementation of SemanticPaint, a system for geometric reconstruction, object-class segmentation and learning of 3D scenes. Using our system, a user can walk into a room wearing a depth camera and a…

The aim of this work is to establish how accurately a recent semantic-based foveal active perception model is able to complete visual tasks that are regularly performed by humans, namely, scene exploration and visual search. This model…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 João Luzio , Alexandre Bernardino , Plinio Moreno

We present a unified representation for actionable spatial perception: 3D Dynamic Scene Graphs. Scene graphs are directed graphs where nodes represent entities in the scene (e.g. objects, walls, rooms), and edges represent relations (e.g.…

Robotics · Computer Science 2020-06-18 Antoni Rosinol , Arjun Gupta , Marcus Abate , Jingnan Shi , Luca Carlone

Traditional approaches for active mapping focus on building geometric maps. For most real-world applications, however, actionable information is related to semantically meaningful objects in the environment. We propose an approach to the…

Robotics · Computer Science 2023-08-15 Xu Liu , Ankit Prabhu , Fernando Cladera , Ian D. Miller , Lifeng Zhou , Camillo J. Taylor , Vijay Kumar

Effective robotic autonomy in unknown environments demands proactive exploration and precise understanding of both geometry and semantics. In this paper, we propose ActiveSGM, an active semantic mapping framework designed to predict the…

Robotics · Computer Science 2025-11-14 Liyan Chen , Huangying Zhan , Hairong Yin , Yi Xu , Philippos Mordohai

Representing and understanding 3D environments in a structured manner is crucial for autonomous agents to navigate and reason about their surroundings. While traditional Simultaneous Localization and Mapping (SLAM) methods generate metric…

Robotics · Computer Science 2026-02-03 Albert Gassol Puigjaner , Angelos Zacharia , Kostas Alexis

Seamless Human-Robot Interaction is the ultimate goal of developing service robotic systems. For this, the robotic agents have to understand their surroundings to better complete a given task. Semantic scene understanding allows a robotic…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Muraleekrishna Gopinathan , Giang Truong , Jumana Abu-Khalaf

Open-world interactive object search in household environments requires understanding semantic relationships between objects and their surrounding context to guide exploration efficiently. Prior methods either rely on vision-language…

Robotics · Computer Science 2026-05-28 Imen Mahdi , Matteo Cassinelli , Fabien Despinoy , Tim Welschehold , Abhinav Valada

Graph-based representations such as Scene Graphs enable localization in structured indoor environments by matching a locally observed graph, constructed from sensor data, to a prior map. This process is particularly challenging in…

Augmented Reality is a promising technique for human-machine interaction. Especially in robotics, which always considers systems in their environment, it is highly beneficial to display visualizations and receive user input directly in…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Peer Schüett , Max Schwarz , Sven Behnke

Language-guided active sensing is a robotics subtask where a robot with an onboard sensor interacts efficiently with the environment via object manipulation to maximize perceptual information, following given language instructions. These…

Robotics · Computer Science 2024-02-06 Weihan Chen , Hanwen Ren , Ahmed H. Qureshi
‹ Prev 1 2 3 10 Next ›