English
Related papers

Related papers: Dependency-Aware Discrete Diffusion for Scene Grap…

200 papers

Diffusion models excel in image generation but lack detailed semantic control using text prompts. Additional techniques have been developed to address this limitation. However, conditioning diffusion models solely on text-based descriptions…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Frank Fundel

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

Advancements in generative models have sparked significant interest in generating images while adhering to specific structural guidelines. Scene graph to image generation is one such task of generating images which are consistent with the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Rameshwar Mishra , A V Subramanyam

Learning the underlying distribution of molecular graphs and generating high-fidelity samples is a fundamental research problem in drug discovery and material science. However, accurately modeling distribution and rapidly generating novel…

Machine Learning · Computer Science 2023-05-24 Han Huang , Leilei Sun , Bowen Du , Weifeng Lv

We introduce a framework for joint grounded scene graph - image generation, a challenging task involving high-dimensional, multi-modal structured data. To effectively model this complex joint distribution, we adopt a factorized approach:…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Bicheng Xu , Qi Yan , Renjie Liao , Lele Wang , Leonid Sigal

Text-conditioned image generation has made significant progress in recent years with generative adversarial networks and more recently, diffusion models. While diffusion models conditioned on text prompts have produced impressive and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-01 Azade Farshad , Yousef Yeganeh , Yu Chi , Chengzhi Shen , Björn Ommer , Nassir Navab

Diffusion models achieve state-of-the-art performance in generating realistic objects and have been successfully applied to images, text, and videos. Recent work has shown that diffusion can also be defined on graphs, including graph…

Machine Learning · Computer Science 2023-02-09 Alex M. Tseng , Nathaniel Diamant , Tommaso Biancalani , Gabriele Scalia

In this paper, we learn a diffusion model to generate 3D data on a scene-scale. Specifically, our model crafts a 3D scene consisting of multiple objects, while recent diffusion research has focused on a single object. To realize our goal,…

Computer Vision and Pattern Recognition · Computer Science 2023-01-03 Jumin Lee , Woobin Im , Sebin Lee , Sung-Eui Yoon

Text-to-image models are showcasing the impressive ability to create high-quality and diverse generative images. Nevertheless, the transition from freehand sketches to complex scene images remains challenging using diffusion models. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Tianyu Zhang , Xiaoxuan Xie , Xusheng Du , Haoran Xie

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

Graph is a prevalent discrete data structure, whose generation has wide applications such as drug discovery and circuit design. Diffusion generative models, as an emerging research focus, have been applied to graph generation tasks.…

Machine Learning · Computer Science 2024-11-05 Zhe Xu , Ruizhong Qiu , Yuzhong Chen , Huiyuan Chen , Xiran Fan , Menghai Pan , Zhichen Zeng , Mahashweta Das , Hanghang Tong

Generating images from graph-structured inputs, such as scene graphs, is uniquely challenging due to the difficulty of aligning nodes and connections in graphs with objects and their relations in images. Most existing methods address this…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Ling Yang , Zhilin Huang , Yang Song , Shenda Hong , Guohao Li , Wentao Zhang , Bin Cui , Bernard Ghanem , Ming-Hsuan Yang

There has been exciting progress in generating images from natural language or layout conditions. However, these methods struggle to faithfully reproduce complex scenes due to the insufficient modeling of multiple objects and their…

Computer Vision and Pattern Recognition · Computer Science 2024-10-02 Yunnan Wang , Ziqiang Li , Zequn Zhang , Wenyao Zhang , Baao Xie , Xihui Liu , Wenjun Zeng , Xin Jin

Reasoning about complex visual scenes involves perception of entities and their relations. Scene graphs provide a natural representation for reasoning tasks, by assigning labels to both entities (nodes) and relations (edges). Unfortunately,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Moshiko Raboh , Roei Herzig , Gal Chechik , Jonathan Berant , Amir Globerson

Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges…

Computer Vision and Pattern Recognition · Computer Science 2021-10-04 Boris Knyazev , Harm de Vries , Cătălina Cangea , Graham W. Taylor , Aaron Courville , Eugene Belilovsky

Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Michael Niemeyer , Andreas Geiger

Dynamic scene graph generation from a video is challenging due to the temporal dynamics of the scene and the inherent temporal fluctuations of predictions. We hypothesize that capturing long-term temporal dependencies is the key to…

Computer Vision and Pattern Recognition · Computer Science 2022-10-20 Shengyu Feng , Subarna Tripathi , Hesham Mostafa , Marcel Nassar , Somdeb Majumdar

Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Zeeshan Hayder , Xuming He

Diffusion models have become a new generative paradigm for text generation. Considering the discrete categorical nature of text, in this paper, we propose GlyphDiffusion, a novel diffusion approach for text generation via text-guided image…

Computation and Language · Computer Science 2023-05-09 Junyi Li , Wayne Xin Zhao , Jian-Yun Nie , Ji-Rong Wen

Scene Graph Generation (SGG) is a visual understanding task, aiming to describe a scene as a graph of entities and their relationships with each other. Existing works rely on location labels in form of bounding boxes or segmentation masks,…

Computer Vision and Pattern Recognition · Computer Science 2025-01-22 Ege Özsoy , Felix Holm , Mahdi Saleh , Tobias Czempiel , Chantal Pellegrini , Nassir Navab , Benjamin Busam
‹ Prev 1 2 3 10 Next ›