Related papers: Structured Generative Models for Scene Understandi…

Compositional Scene Understanding through Inverse Generative Modeling

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

A Comprehensive Survey of Scene Graphs: Generation and Application

Scene graph is a structured representation of a scene that can clearly express the objects, attributes, and relationships between objects in the scene. As computer vision technology continues to develop, people are no longer satisfied with…

Computer Vision and Pattern Recognition · Computer Science 2022-01-10 Xiaojun Chang , Pengzhen Ren , Pengfei Xu , Zhihui Li , Xiaojiang Chen , Alex Hauptmann

Scene Graph Modification as Incremental Structure Expanding

A scene graph is a semantic representation that expresses the objects, attributes, and relationships between objects in a scene. Scene graphs play an important role in many cross modality tasks, as they are able to capture the interactions…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Xuming Hu , Zhijiang Guo , Yu Fu , Lijie Wen , Philip S. Yu

Interpreting Context of Images using Scene Graphs

Understanding a visual scene incorporates objects, relationships, and context. Traditional methods working on an image mostly focus on object detection and fail to capture the relationship between the objects. Relationships can give rich…

Computer Vision and Pattern Recognition · Computer Science 2019-12-03 Himangi Mittal , Ajith Abraham , Anuja Arora

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

A comprehensive semantic understanding of a scene is important for many applications - but in what space should diverse semantic information (e.g., objects, scene categories, material types, texture, etc.) be grounded and what should be its…

Computer Vision and Pattern Recognition · Computer Science 2019-10-08 Iro Armeni , Zhi-Yang He , JunYoung Gwak , Amir R. Zamir , Martin Fischer , Jitendra Malik , Silvio Savarese

Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships. Such rich semantic representation is very appealing, yet difficult to obtain from…

Computer Vision and Pattern Recognition · Computer Science 2018-11-08 Paul Gay , Stuart James , Alessio Del Bue

Scene Graph Generation by Iterative Message Passing

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Michael Niemeyer , Andreas Geiger

Unconditional Scene Graph Generation

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Towards Lifelong Scene Graph Generation with Knowledge-ware In-context Prompt Learning

Scene graph generation (SGG) endeavors to predict visual relationships between pairs of objects within an image. Prevailing SGG methods traditionally assume a one-off learning process for SGG. This conventional paradigm may necessitate…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Tao He , Tongtong Wu , Dongyang Zhang , Guiduo Duan , Ke Qin , Yuan-Fang Li

Deep Generative Modeling for Scene Synthesis via Hybrid Representations

We present a deep generative scene modeling technique for indoor environments. Our goal is to train a generative model using a feed-forward neural network that maps a prior distribution (e.g., a normal distribution) to the distribution of…

Computer Vision and Pattern Recognition · Computer Science 2018-08-08 Zaiwei Zhang , Zhenpei Yang , Chongyang Ma , Linjie Luo , Alexander Huth , Etienne Vouga , Qixing Huang

An Interpretable Model for Scene Graph Generation

We propose an efficient and interpretable scene graph generator. We consider three types of features: visual, spatial and semantic, and we use a late fusion strategy such that each feature's contribution can be explicitly investigated. We…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Ji Zhang , Kevin Shih , Andrew Tao , Bryan Catanzaro , Ahmed Elgammal

Scene Graph Generation via Conditional Random Fields

Despite the great success object detection and segmentation models have achieved in recognizing individual objects in images, performance on cognitive tasks such as image caption, semantic image retrieval, and visual QA is far from…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Weilin Cong , William Wang , Wang-Chien Lee

Scene Graph Generation: A Comprehensive Survey

Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Guangming Zhu , Liang Zhang , Youliang Jiang , Yixuan Dang , Haoran Hou , Peiyi Shen , Mingtao Feng , Xia Zhao , Qiguang Miao , Syed Afaq Ali Shah , Mohammed Bennamoun

Multiview Scene Graph

A proper scene representation is central to the pursuit of spatial intelligence where agents can robustly reconstruct and efficiently understand 3D scenes. A scene representation is either metric, such as landmark maps in 3D reconstruction,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Juexiao Zhang , Gao Zhu , Sihang Li , Xinhao Liu , Haorui Song , Xinran Tang , Chen Feng

Learning Structured Representations of Visual Scenes

As the intermediate-level representations bridging the two levels, structured representations of visual scenes, such as visual relationships between pairwise objects, have been shown to not only benefit compositional models in learning to…

Computer Vision and Pattern Recognition · Computer Science 2022-07-12 Meng-Jiun Chiou

Semantic-Guided Inpainting Network for Complex Urban Scenes Manipulation

Manipulating images of complex scenes to reconstruct, insert and/or remove specific object instances is a challenging task. Complex scenes contain multiple semantics and objects, which are frequently cluttered or ambiguous, thus hampering…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Pierfrancesco Ardino , Yahui Liu , Elisa Ricci , Bruno Lepri , Marco De Nadai

Video Is Graph: Structured Graph Module for Video Action Recognition

In the field of action recognition, video clips are always treated as ordered frames for subsequent processing. To achieve spatio-temporal perception, existing approaches propose to embed adjacent temporal interaction in the convolutional…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Rongchang Li , Xiao-Jun Wu , Tianyang Xu

Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction

Machine understanding of complex images is a key goal of artificial intelligence. One challenge underlying this task is that visual scenes contain multiple inter-related objects, and that global context plays an important role in…

Machine Learning · Statistics 2018-11-05 Roei Herzig , Moshiko Raboh , Gal Chechik , Jonathan Berant , Amir Globerson