Related papers: DDS: Decoupled Dynamic Scene-Graph Generation Netw…

Scene Graph Generation from Objects, Phrases and Region Captions

Object detection, scene graph generation and region captioning, which are three scene understanding tasks at different semantic levels, are tied together: scene graphs are generated on top of objects detected in an image with their pairwise…

Computer Vision and Pattern Recognition · Computer Science 2017-09-18 Yikang Li , Wanli Ouyang , Bolei Zhou , Kun Wang , Xiaogang Wang

Dependency-Aware Discrete Diffusion for Scene Graph Generation

Scene graphs (SGs) represent objects and their relationships as structured graphs, enabling applications in image generation, robotics, and 3D understanding. Recent work suggests that conditioning image generation on scene graphs improves…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Rajalaxmi Rajagopalan , Romit Roy Choudhury

Scenes and Surroundings: Scene Graph Generation using Relation Transformer

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Zeeshan Hayder , Xuming He

Generating Triples with Adversarial Networks for Scene Graph Construction

Driven by successes in deep learning, computer vision research has begun to move beyond object detection and image classification to more sophisticated tasks like image captioning or visual question answering. Motivating such endeavors is…

Computer Vision and Pattern Recognition · Computer Science 2018-02-09 Matthew Klawonn , Eric Heim

Unconditional Scene Graph Generation

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

Inter-object Discriminative Graph Modeling for Indoor Scene Recognition

Variable scene layouts and coexisting objects across scenes make indoor scene recognition still a challenging task. Leveraging object information within scenes to enhance the distinguishability of feature representations has emerged as a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-22 Chuanxin Song , Hanbo Wu , Xin Ma

FDSG: Forecasting Dynamic Scene Graphs

Dynamic scene graph generation extends scene graph generation from images to videos by modeling entity relationships and their temporal evolution. However, existing methods either generate scene graphs from observed frames without…

Computer Vision and Pattern Recognition · Computer Science 2025-07-21 Yi Yang , Yuren Cong , Hao Cheng , Bodo Rosenhahn , Michael Ying Yang

Semantic Scene Graph Generation Based on an Edge Dual Scene Graph and Message Passing Neural Network

Along with generative AI, interest in scene graph generation (SGG), which comprehensively captures the relationships and interactions between objects in an image and creates a structured graph-based representation, has significantly…

Computer Vision and Pattern Recognition · Computer Science 2023-11-03 Hyeongjin Kim , Sangwon Kim , Jong Taek Lee , Byoung Chul Ko

Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions

Scene understanding has been of high interest in computer vision. It encompasses not only identifying objects in a scene, but also their relationships within the given context. With this goal, a recent line of works tackles 3D semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Johanna Wald , Helisa Dhamo , Nassir Navab , Federico Tombari

Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Dynamic scene graph generation from a video is challenging due to the temporal dynamics of the scene and the inherent temporal fluctuations of predictions. We hypothesize that capturing long-term temporal dependencies is the key to…

Computer Vision and Pattern Recognition · Computer Science 2022-10-20 Shengyu Feng , Subarna Tripathi , Hesham Mostafa , Marcel Nassar , Somdeb Majumdar

Differentiable Scene Graphs

Reasoning about complex visual scenes involves perception of entities and their relations. Scene graphs provide a natural representation for reasoning tasks, by assigning labels to both entities (nodes) and relations (edges). Unfortunately,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Moshiko Raboh , Roei Herzig , Gal Chechik , Jonathan Berant , Amir Globerson

Scene Synthesis via Uncertainty-Driven Attribute Synchronization

Developing deep neural networks to generate 3D scenes is a fundamental problem in neural synthesis with immediate applications in architectural CAD, computer graphics, as well as in generating virtual robot training environments. This task…

Computer Vision and Pattern Recognition · Computer Science 2021-09-02 Haitao Yang , Zaiwei Zhang , Siming Yan , Haibin Huang , Chongyang Ma , Yi Zheng , Chandrajit Bajaj , Qixing Huang

Attentive Relational Networks for Mapping Images to Scene Graphs

Scene graph generation refers to the task of automatically mapping an image into a semantic structural graph, which requires correctly labeling each extracted object and their interaction relationships. Despite the recent success in object…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Mengshi Qi , Weijian Li , Zhengyuan Yang , Yunhong Wang , Jiebo Luo

Visual Relationship Detection using Scene Graphs: A Survey

Understanding a scene by decoding the visual relationships depicted in an image has been a long studied problem. While the recent advances in deep learning and the usage of deep neural networks have achieved near human accuracy on many…

Computer Vision and Pattern Recognition · Computer Science 2020-05-19 Aniket Agarwal , Ayush Mangal , Vipul

Object-Centric Representation Learning for Enhanced 3D Semantic Scene Graph Prediction

3D Semantic Scene Graph Prediction aims to detect objects and their semantic relationships in 3D scenes, and has emerged as a crucial technology for robotics and AR/VR applications. While previous research has addressed dataset limitations…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 KunHo Heo , GiHyun Kim , SuYeon Kim , MyeongAh Cho

Context-Dependent Diffusion Network for Visual Relationship Detection

Visual relationship detection can bridge the gap between computer vision and natural language for scene understanding of images. Different from pure object recognition tasks, the relation triplets of subject-predicate-object lie on an…

Computer Vision and Pattern Recognition · Computer Science 2018-09-18 Zhen Cui , Chunyan Xu , Wenming Zheng , Jian Yang

Exploiting Relationship for Complex-scene Image Generation

The significant progress on Generative Adversarial Networks (GANs) has facilitated realistic single-object image generation based on language input. However, complex-scene generation (with various interactions among multiple objects) still…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Tianyu Hua , Hongdong Zheng , Yalong Bai , Wei Zhang , Xiao-Ping Zhang , Tao Mei

Scene Graph Generation: A Comprehensive Survey

Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Guangming Zhu , Liang Zhang , Youliang Jiang , Yixuan Dang , Haoran Hou , Peiyi Shen , Mingtao Feng , Xia Zhao , Qiguang Miao , Syed Afaq Ali Shah , Mohammed Bennamoun

LinkNet: Relational Embedding for Scene Graph

Objects and their relationships are critical contents for image understanding. A scene graph provides a structured description that captures these properties of an image. However, reasoning about the relationships between objects is very…

Computer Vision and Pattern Recognition · Computer Science 2018-11-16 Sanghyun Woo , Dahun Kim , Donghyeon Cho , In So Kweon