Related papers: Image Semantic Relation Generation

Visual Relationship Detection using Scene Graphs: A Survey

Understanding a scene by decoding the visual relationships depicted in an image has been a long studied problem. While the recent advances in deep learning and the usage of deep neural networks have achieved near human accuracy on many…

Computer Vision and Pattern Recognition · Computer Science 2020-05-19 Aniket Agarwal , Ayush Mangal , Vipul

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Zeeshan Hayder , Xuming He

Seeing Beyond the Scene: Enhancing Vision-Language Models with Interactional Reasoning

Traditional scene graphs primarily focus on spatial relationships, limiting vision-language models' (VLMs) ability to reason about complex interactions in visual scenes. This paper addresses two key challenges: (1) conventional…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Dayong Liang , Changmeng Zheng , Zhiyuan Wen , Yi Cai , Xiao-Yong Wei , Qing Li

Scene Graph Generation from Objects, Phrases and Region Captions

Object detection, scene graph generation and region captioning, which are three scene understanding tasks at different semantic levels, are tied together: scene graphs are generated on top of objects detected in an image with their pairwise…

Computer Vision and Pattern Recognition · Computer Science 2017-09-18 Yikang Li , Wanli Ouyang , Bolei Zhou , Kun Wang , Xiaogang Wang

Scene Graph Generation by Iterative Message Passing

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Panoptic Scene Graph Generation (PSG) aims to segment objects and recognize their relations, enabling the structured understanding of an image. Previous methods focus on predicting predefined object and relation categories, hence limiting…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Zijian Zhou , Zheng Zhu , Holger Caesar , Miaojing Shi

A Comprehensive Survey of Scene Graphs: Generation and Application

Scene graph is a structured representation of a scene that can clearly express the objects, attributes, and relationships between objects in the scene. As computer vision technology continues to develop, people are no longer satisfied with…

Computer Vision and Pattern Recognition · Computer Science 2022-01-10 Xiaojun Chang , Pengzhen Ren , Pengfei Xu , Zhihui Li , Xiaojiang Chen , Alex Hauptmann

Learning to Generate Scene Graph from Natural Language Supervision

Learning from image-text data has demonstrated recent success for many recognition tasks, yet is currently limited to visual features or individual visual concepts such as objects. In this paper, we propose one of the first methods that…

Computer Vision and Pattern Recognition · Computer Science 2021-09-07 Yiwu Zhong , Jing Shi , Jianwei Yang , Chenliang Xu , Yin Li

Adaptive Visual Scene Understanding: Incremental Scene Graph Generation

Scene graph generation (SGG) analyzes images to extract meaningful information about objects and their relationships. In the dynamic visual world, it is crucial for AI systems to continuously detect new objects and establish their…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Naitik Khandelwal , Xiao Liu , Mengmi Zhang

Panoptic Scene Graph Generation

Existing research addresses scene graph generation (SGG) -- a critical technology for scene understanding in images -- from a detection perspective, i.e., objects are detected using bounding boxes followed by prediction of their pairwise…

Computer Vision and Pattern Recognition · Computer Science 2022-07-25 Jingkang Yang , Yi Zhe Ang , Zujin Guo , Kaiyang Zhou , Wayne Zhang , Ziwei Liu

Scene Graph Generation: A Comprehensive Survey

Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Guangming Zhu , Liang Zhang , Youliang Jiang , Yixuan Dang , Haoran Hou , Peiyi Shen , Mingtao Feng , Xia Zhao , Qiguang Miao , Syed Afaq Ali Shah , Mohammed Bennamoun

Location-Free Scene Graph Generation

Scene Graph Generation (SGG) is a visual understanding task, aiming to describe a scene as a graph of entities and their relationships with each other. Existing works rely on location labels in form of bounding boxes or segmentation masks,…

Computer Vision and Pattern Recognition · Computer Science 2025-01-22 Ege Özsoy , Felix Holm , Mahdi Saleh , Tobias Czempiel , Chantal Pellegrini , Nassir Navab , Benjamin Busam

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Semantic Scene Graph Generation Based on an Edge Dual Scene Graph and Message Passing Neural Network

Along with generative AI, interest in scene graph generation (SGG), which comprehensively captures the relationships and interactions between objects in an image and creates a structured graph-based representation, has significantly…

Computer Vision and Pattern Recognition · Computer Science 2023-11-03 Hyeongjin Kim , Sangwon Kim , Jong Taek Lee , Byoung Chul Ko

Using Scene Graph Context to Improve Image Generation

Generating realistic images from scene graphs asks neural networks to be able to reason about object relationships and compositionality. As a relatively new task, how to properly ensure the generated images comply with scene graphs or how…

Computer Vision and Pattern Recognition · Computer Science 2019-01-17 Subarna Tripathi , Anahita Bhiwandiwalla , Alexei Bastidas , Hanlin Tang

Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval

Image-text retrieval of natural scenes has been a popular research topic. Since image and text are heterogeneous cross-modal data, one of the key challenges is how to learn comprehensive yet unified representations to express the…

Computer Vision and Pattern Recognition · Computer Science 2019-10-14 Sijin Wang , Ruiping Wang , Ziwei Yao , Shiguang Shan , Xilin Chen

Image Scene Graph Generation (SGG) Benchmark

There is a surge of interest in image scene graph generation (object, attribute and relationship detection) due to the need of building fine-grained image understanding models that go beyond object detection. Due to the lack of a good…

Computer Vision and Pattern Recognition · Computer Science 2021-07-28 Xiaotian Han , Jianwei Yang , Houdong Hu , Lei Zhang , Jianfeng Gao , Pengchuan Zhang

Scenes and Surroundings: Scene Graph Generation using Relation Transformer

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

Open World Scene Graph Generation using Vision Language Models

Scene-Graph Generation (SGG) seeks to recognize objects in an image and distill their salient pairwise relationships. Most methods depend on dataset-specific supervision to learn the variety of interactions, restricting their usefulness in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-11 Amartya Dutta , Kazi Sajeed Mehrab , Medha Sawhney , Abhilash Neog , Mridul Khurana , Sepideh Fatemi , Aanish Pradhan , M. Maruf , Ismini Lourentzou , Arka Daw , Anuj Karpatne

Differentiable Scene Graphs

Reasoning about complex visual scenes involves perception of entities and their relations. Scene graphs provide a natural representation for reasoning tasks, by assigning labels to both entities (nodes) and relations (edges). Unfortunately,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Moshiko Raboh , Roei Herzig , Gal Chechik , Jonathan Berant , Amir Globerson