Related papers: GraphMapper: Efficient Visual Navigation by Scene …

3-D Scene Graph: A Sparse and Semantic Representation of Physical Environments for Intelligent Agents

Intelligent agents gather information and perceive semantics within the environments before taking on given tasks. The agents store the collected information in the form of environment models that compactly represent the surrounding…

Computer Vision and Pattern Recognition · Computer Science 2019-08-15 Ue-Hwan Kim , Jin-Man Park , Taek-Jin Song , Jong-Hwan Kim

Hierarchical Representations and Explicit Memory: Learning Effective Navigation Policies on 3D Scene Graphs using Graph Neural Networks

Representations are crucial for a robot to learn effective navigation policies. Recent work has shown that mid-level perceptual abstractions, such as depth estimates or 2D semantic segmentation, lead to more effective policies when provided…

Robotics · Computer Science 2022-05-09 Zachary Ravichandran , Lisa Peng , Nathan Hughes , J. Daniel Griffith , Luca Carlone

A Behavioral Approach to Visual Navigation with Graph Localization Networks

Inspired by research in psychology, we introduce a behavioral approach for visual navigation using topological maps. Our goal is to enable a robot to navigate from one location to another, relying only on its visual input and the…

Computer Vision and Pattern Recognition · Computer Science 2019-03-04 Kevin Chen , Juan Pablo de Vicente , Gabriel Sepulveda , Fei Xia , Alvaro Soto , Marynel Vazquez , Silvio Savarese

Scene Graph Generation by Iterative Message Passing

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

Graph2Nav: 3D Object-Relation Graph Generation to Robot Navigation

We propose Graph2Nav, a real-time 3D object-relation graph generation framework, for autonomous navigation in the real world. Our framework fully generates and exploits both 3D objects and a rich set of semantic relationships among objects…

Robotics · Computer Science 2025-04-24 Tixiao Shan , Abhinav Rajvanshi , Niluthpol Mithun , Han-Pang Chiu

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar

Predicting Topological Maps for Visual Navigation in Unexplored Environments

We propose a robotic learning system for autonomous exploration and navigation in unexplored environments. We are motivated by the idea that even an unseen environment may be familiar from previous experiences in similar environments. The…

Robotics · Computer Science 2022-11-24 Huangying Zhan , Hamid Rezatofighi , Ian Reid

STAGE: Scalable and Traversability-Aware Graph based Exploration Planner for Dynamically Varying Environments

In this article, we propose a novel navigation framework that leverages a two layered graph representation of the environment for efficient large-scale exploration, while it integrates a novel uncertainty awareness scheme to handle dynamic…

Robotics · Computer Science 2024-02-07 Akash Patel , Mario A V Saucedo , Christoforos Kanellakis , George Nikolakopoulos

Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships. Such rich semantic representation is very appealing, yet difficult to obtain from…

Computer Vision and Pattern Recognition · Computer Science 2018-11-08 Paul Gay , Stuart James , Alessio Del Bue

Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs

Controllable scene synthesis consists of generating 3D information that satisfy underlying specifications. Thereby, these specifications should be abstract, i.e. allowing easy user interaction, whilst providing enough interface for detailed…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Helisa Dhamo , Fabian Manhardt , Nassir Navab , Federico Tombari

Scenes and Surroundings: Scene Graph Generation using Relation Transformer

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

Mapping High-level Semantic Regions in Indoor Environments without Object Recognition

Robots require a semantic understanding of their surroundings to operate in an efficient and explainable way in human environments. In the literature, there has been an extensive focus on object labeling and exhaustive scene graph…

Robotics · Computer Science 2024-04-16 Roberto Bigazzi , Lorenzo Baraldi , Shreyas Kousik , Rita Cucchiara , Marco Pavone

Trajectory Forecasting on Temporal Graphs

Predicting future locations of agents in the scene is an important problem in self-driving. In recent years, there has been a significant progress in representing the scene and the agents in it. The interactions of agents with the scene and…

Computer Vision and Pattern Recognition · Computer Science 2022-07-04 Görkay Aydemir , Adil Kaan Akan , Fatma Güney

GraphPad: Inference-Time 3D Scene Graph Updates for Embodied Question Answering

Structured scene representations are a core component of embodied agents, helping to consolidate raw sensory streams into readable, modular, and searchable formats. Due to their high computational overhead, many approaches build such…

Artificial Intelligence · Computer Science 2025-06-03 Muhammad Qasim Ali , Saeejith Nair , Alexander Wong , Yuchen Cui , Yuhao Chen

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

TopoNav: Topological Graphs as a Key Enabler for Advanced Object Navigation

Object Navigation (ObjectNav) has made great progress with large language models (LLMs), but still faces challenges in memory management, especially in long-horizon tasks and dynamic scenes. To address this, we propose TopoNav, a new…

Robotics · Computer Science 2025-09-03 Peiran Liu , Qiang Zhang , Daojie Peng , Lingfeng Zhang , Yihao Qin , Hang Zhou , Jun Ma , Renjing Xu , Yiding Ji

Visuomotor Understanding for Representation Learning of Driving Scenes

Dashboard cameras capture a tremendous amount of driving scene video each day. These videos are purposefully coupled with vehicle sensing data, such as from the speedometer and inertial sensors, providing an additional sensing modality for…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Seokju Lee , Junsik Kim , Tae-Hyun Oh , Yongseop Jeong , Donggeun Yoo , Stephen Lin , In So Kweon

Scene Graph Generation with Geometric Context

Scene Graph Generation has gained much attention in computer vision research with the growing demand in image understanding projects like visual question answering, image captioning, self-driving cars, crowd behavior analysis, activity…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Vishal Kumar , Albert Mundu , Satish Kumar Singh

Graph Canvas for Controllable 3D Scene Generation

Spatial intelligence is foundational to AI systems that interact with the physical world, particularly in 3D scene generation and spatial comprehension. Current methodologies for 3D scene generation often rely heavily on predefined…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Libin Liu , Shen Chen , Sen Jia , Jingzhe Shi , Zhongyu Jiang , Can Jin , Wu Zongkai , Jenq-Neng Hwang , Lei Li

Multiview Scene Graph

A proper scene representation is central to the pursuit of spatial intelligence where agents can robustly reconstruct and efficiently understand 3D scenes. A scene representation is either metric, such as landmark maps in 3D reconstruction,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Juexiao Zhang , Gao Zhu , Sihang Li , Xinhao Liu , Haorui Song , Xinran Tang , Chen Feng