Related papers: Generative Compositional Augmentations for Scene G…

Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation

Scene graph generation (SGG) aims to predict graph-structured descriptions of input images, in the form of objects and relationships between them. This task is becoming increasingly useful for progress at the interface of vision and…

Computer Vision and Pattern Recognition · Computer Science 2020-08-19 Boris Knyazev , Harm de Vries , Cătălina Cangea , Graham W. Taylor , Aaron Courville , Eugene Belilovsky

Iterative Scene Graph Generation

The task of scene graph generation entails identifying object entities and their corresponding interaction predicates in a given image (or video). Due to the combinatorially large solution space, existing approaches to scene graph…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Siddhesh Khandelwal , Leonid Sigal

Compositional Scene Understanding through Inverse Generative Modeling

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

Semantic Compositional Learning for Low-shot Scene Graph Generation

Scene graphs provide valuable information to many downstream tasks. Many scene graph generation (SGG) models solely use the limited annotated relation triples for training, leading to their underperformance on low-shot (few and zero)…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Tao He , Lianli Gao , Jingkuan Song , Jianfei Cai , Yuan-Fang Li

Exploiting Relationship for Complex-scene Image Generation

The significant progress on Generative Adversarial Networks (GANs) has facilitated realistic single-object image generation based on language input. However, complex-scene generation (with various interactions among multiple objects) still…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Tianyu Hua , Hongdong Zheng , Yalong Bai , Wei Zhang , Xiao-Ping Zhang , Tao Mei

Compositional GAN: Learning Image-Conditional Binary Composition

Generative Adversarial Networks (GANs) can produce images of remarkable complexity and realism but are generally structured to sample from a single latent source ignoring the explicit spatial interaction between multiple entities that could…

Computer Vision and Pattern Recognition · Computer Science 2019-04-01 Samaneh Azadi , Deepak Pathak , Sayna Ebrahimi , Trevor Darrell

Iterative Scene Graph Generation with Generative Transformers

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Investigating Object Compositionality in Generative Adversarial Networks

Deep generative models seek to recover the process with which the observed data was generated. They may be used to synthesize new samples or to subsequently extract representations. Successful approaches in the domain of images are driven…

Computer Vision and Pattern Recognition · Computer Science 2020-07-27 Sjoerd van Steenkiste , Karol Kurach , Jürgen Schmidhuber , Sylvain Gelly

Dependency-Aware Discrete Diffusion for Scene Graph Generation

Scene graphs (SGs) represent objects and their relationships as structured graphs, enabling applications in image generation, robotics, and 3D understanding. Recent work suggests that conditioning image generation on scene graphs improves…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Rajalaxmi Rajagopalan , Romit Roy Choudhury

Compose by Focus: Scene Graph-based Atomic Skills

A key requirement for generalist robots is compositional generalization - the ability to combine atomic skills to solve complex, long-horizon tasks. While prior work has primarily focused on synthesizing a planner that sequences pre-learned…

Robotics · Computer Science 2026-03-10 Han Qi , Changhe Chen , Heng Yang

Unconditional Scene Graph Generation

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

Relationship-Aware Spatial Perception Fusion for Realistic Scene Layout Generation

The significant progress on Generative Adversarial Networks (GANs) have made it possible to generate surprisingly realistic images for single object based on natural language descriptions. However, controlled generation of images for…

Computer Vision and Pattern Recognition · Computer Science 2019-11-14 Hongdong Zheng , Yalong Bai , Wei Zhang , Tao Mei

Scene Graph Generation by Iterative Message Passing

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

Scene Graph Generation with External Knowledge and Image Reconstruction

Scene graph generation has received growing attention with the advancements in image understanding tasks such as object detection, attributes and relationship prediction,~\etc. However, existing datasets are biased in terms of object and…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Jiuxiang Gu , Handong Zhao , Zhe Lin , Sheng Li , Jianfei Cai , Mingyang Ling

Semantic Palette: Guiding Scene Generation with Class Proportions

Despite the recent progress of generative adversarial networks (GANs) at synthesizing photo-realistic images, producing complex urban scenes remains a challenging problem. Previous works break down scene generation into two consecutive…

Computer Vision and Pattern Recognition · Computer Science 2021-06-04 Guillaume Le Moing , Tuan-Hung Vu , Himalaya Jain , Patrick Pérez , Matthieu Cord

Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs

Advancements in generative models have sparked significant interest in generating images while adhering to specific structural guidelines. Scene graph to image generation is one such task of generating images which are consistent with the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Rameshwar Mishra , A V Subramanyam

Learning Predicates as Functions to Enable Few-shot Scene Graph Prediction

Scene graph prediction --- classifying the set of objects and predicates in a visual scene --- requires substantial training data. However, most predicates only occur a handful of times making them difficult to learn. We introduce the first…

Computer Vision and Pattern Recognition · Computer Science 2019-12-09 Apoorva Dornadula , Austin Narcomey , Ranjay Krishna , Michael Bernstein , Li Fei-Fei

Adaptive Scene Category Discovery with Generative Learning and Compositional Sampling

This paper investigates a general framework to discover categories of unlabeled scene images according to their appearances (i.e., textures and structures). We jointly solve the two coupled tasks in an unsupervised manner: (i) classifying…

Computer Vision and Pattern Recognition · Computer Science 2015-02-03 Liang Lin , Ruimao Zhang , Xiaohua Duan

Unbiased Scene Graph Generation using Predicate Similarities

Scene Graphs are widely applied in computer vision as a graphical representation of relationships between objects shown in images. However, these applications have not yet reached a practical stage of development owing to biased training…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Misaki Ohashi , Yusuke Matsui

Scene Graph Prediction with Limited Labels

Visual knowledge bases such as Visual Genome power numerous applications in computer vision, including visual question answering and captioning, but suffer from sparse, incomplete relationships. All scene graph models to date are limited to…

Computer Vision and Pattern Recognition · Computer Science 2019-12-03 Vincent S. Chen , Paroma Varma , Ranjay Krishna , Michael Bernstein , Christopher Re , Li Fei-Fei