Related papers: Differentiable Scene Graphs

A Scene Graph Backed Approach to Open Set Semantic Mapping

While Open Set Semantic Mapping and 3D Semantic Scene Graphs (3DSSGs) are established paradigms in robotic perception, deploying them effectively to support high-level reasoning in large-scale, real-world environments remains a significant…

Robotics · Computer Science 2026-02-04 Martin Günther , Felix Igelbrink , Oscar Lima , Lennart Niecksch , Marian Renz , Martin Atzmueller

Dependency-Aware Discrete Diffusion for Scene Graph Generation

Scene graphs (SGs) represent objects and their relationships as structured graphs, enabling applications in image generation, robotics, and 3D understanding. Recent work suggests that conditioning image generation on scene graphs improves…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Rajalaxmi Rajagopalan , Romit Roy Choudhury

Visual Relationship Detection using Scene Graphs: A Survey

Understanding a scene by decoding the visual relationships depicted in an image has been a long studied problem. While the recent advances in deep learning and the usage of deep neural networks have achieved near human accuracy on many…

Computer Vision and Pattern Recognition · Computer Science 2020-05-19 Aniket Agarwal , Ayush Mangal , Vipul

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Zeeshan Hayder , Xuming He

Seeing Beyond the Scene: Enhancing Vision-Language Models with Interactional Reasoning

Traditional scene graphs primarily focus on spatial relationships, limiting vision-language models' (VLMs) ability to reason about complex interactions in visual scenes. This paper addresses two key challenges: (1) conventional…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Dayong Liang , Changmeng Zheng , Zhiyuan Wen , Yi Cai , Xiao-Yong Wei , Qing Li

Image Semantic Relation Generation

Scene graphs provide structured semantic understanding beyond images. For downstream tasks, such as image retrieval, visual question answering, visual relationship detection, and even autonomous vehicle technology, scene graphs can not only…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Mingzhe Du

Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis

Scene understanding is a critical problem in computer vision. In this paper, we propose a 3D point-based scene graph generation ($\mathbf{SGG_{point}}$) framework to effectively bridge perception and reasoning to achieve scene understanding…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Chaoyi Zhang , Jianhui Yu , Yang Song , Weidong Cai

FDSG: Forecasting Dynamic Scene Graphs

Dynamic scene graph generation extends scene graph generation from images to videos by modeling entity relationships and their temporal evolution. However, existing methods either generate scene graphs from observed frames without…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Yi Yang , Yuren Cong , Hao Cheng , Bodo Rosenhahn , Michael Ying Yang

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Can We Build Scene Graphs, Not Classify Them? FlowSG: Progressive Image-Conditioned Scene Graph Generation with Flow Matching

Scene Graph Generation (SGG) unifies object localization and visual relationship reasoning by predicting boxes and subject-predicate-object triples. Yet most pipelines treat SGG as a one-shot, deterministic classification problem rather…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Xin Hu , Ke Qin , Wen Yin , Yuan-Fang Li , Ming Li , Tao He

Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation

Learning to compose visual relationships from raw images in the form of scene graphs is a highly challenging task due to contextual dependencies, but it is essential in computer vision applications that depend on scene understanding.…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Neau Maëlic , Paulo E. Santos , Anne-Gwenn Bosser , Cédric Buche

Object-Centric Representation Learning for Enhanced 3D Semantic Scene Graph Prediction

3D Semantic Scene Graph Prediction aims to detect objects and their semantic relationships in 3D scenes, and has emerged as a crucial technology for robotics and AR/VR applications. While previous research has addressed dataset limitations…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 KunHo Heo , GiHyun Kim , SuYeon Kim , MyeongAh Cho

Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions

Scene understanding has been of high interest in computer vision. It encompasses not only identifying objects in a scene, but also their relationships within the given context. With this goal, a recent line of works tackles 3D semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Johanna Wald , Helisa Dhamo , Nassir Navab , Federico Tombari

Scene Graph Generation: A Comprehensive Survey

Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Guangming Zhu , Liang Zhang , Youliang Jiang , Yixuan Dang , Haoran Hou , Peiyi Shen , Mingtao Feng , Xia Zhao , Qiguang Miao , Syed Afaq Ali Shah , Mohammed Bennamoun

SGG-R$^{\rm 3}$: From Next-Token Prediction to End-to-End Unbiased Scene Graph Generation

Scene Graph Generation (SGG) structures visual scenes as graphs of objects and their relations. While Multimodal Large Language Models (MLLMs) have advanced end-to-end SGG, current methods are hindered by both a lack of task-specific…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Jiaye Feng , Qixiang Yin , Yuankun Liu , Tong Mo , Weiping Li

DGSG-Mind: Dynamic 3D Gaussian Scene Graphs for Long-Term Scene Understanding and Grounding

Integrating open-vocabulary semantic information into dynamic 3D scene representations is essential for long-term embodied scene understanding. However, existing methods often suffer from fragile instance association due to incomplete…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Luzhou Ge , Xiangyu Zhu , Jinyan Liu , Xuesong Li

3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and Humans

We present a unified representation for actionable spatial perception: 3D Dynamic Scene Graphs. Scene graphs are directed graphs where nodes represent entities in the scene (e.g. objects, walls, rooms), and edges represent relations (e.g.…

Robotics · Computer Science 2020-06-18 Antoni Rosinol , Arjun Gupta , Marcus Abate , Jingnan Shi , Luca Carlone

Learning Pixel Representations for Generic Segmentation

Deep learning approaches to generic (non-semantic) segmentation have so far been indirect and relied on edge detection. This is in contrast to semantic segmentation, where DNNs are applied directly. We propose an alternative approach called…

Computer Vision and Pattern Recognition · Computer Science 2019-09-27 Oran Shayer , Michael Lindenbaum

KeySG: Hierarchical Keyframe-Based 3D Scene Graphs

In recent years, 3D scene graphs have emerged as a powerful world representation, offering both geometric accuracy and semantic richness. Combining 3D scene graphs with large language models enables robots to reason, plan, and navigate in…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Abdelrhman Werby , Dennis Rotondi , Fabio Scaparro , Kai O. Arras

Generative Visual Commonsense Answering and Explaining with Generative Scene Graph Constructing

Visual Commonsense Reasoning, which is regarded as one challenging task to pursue advanced visual scene comprehension, has been used to diagnose the reasoning ability of AI systems. However, reliable reasoning requires a good grasp of the…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Fan Yuan , Xiaoyuan Fang , Rong Quan , Jing Li , Wei Bi , Xiaogang Xu , Piji Li