English
Related papers

Related papers: Interactive Image Generation Using Scene Graphs

200 papers

To truly understand the visual world our models should be able not only to recognize images but also generate them. To this end, there has been exciting recent progress on generating images from natural language descriptions. These methods…

Computer Vision and Pattern Recognition · Computer Science 2018-04-06 Justin Johnson , Agrim Gupta , Li Fei-Fei

Driven by successes in deep learning, computer vision research has begun to move beyond object detection and image classification to more sophisticated tasks like image captioning or visual question answering. Motivating such endeavors is…

Computer Vision and Pattern Recognition · Computer Science 2018-02-09 Matthew Klawonn , Eric Heim

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

Generating images from scene graphs is a challenging task that attracted substantial interest recently. Prior works have approached this task by generating an intermediate layout description of the target image. However, the representation…

Computer Vision and Pattern Recognition · Computer Science 2022-10-12 Maor Ivgi , Yaniv Benny , Avichai Ben-David , Jonathan Berant , Lior Wolf

We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images. Our model contains a Relation Proposal Network (RePN) that efficiently deals with…

Computer Vision and Pattern Recognition · Computer Science 2018-08-02 Jianwei Yang , Jiasen Lu , Stefan Lee , Dhruv Batra , Devi Parikh

Generating images from semantic visual knowledge is a challenging task, that can be useful to condition the synthesis process in complex, subtle, and unambiguous ways, compared to alternatives such as class labels or text descriptions.…

Computer Vision and Pattern Recognition · Computer Science 2022-07-04 Renato Sortino , Simone Palazzo , Concetto Spampinato

Generating realistic images from scene graphs asks neural networks to be able to reason about object relationships and compositionality. As a relatively new task, how to properly ensure the generated images comply with scene graphs or how…

Computer Vision and Pattern Recognition · Computer Science 2019-01-17 Subarna Tripathi , Anahita Bhiwandiwalla , Alexei Bastidas , Hanlin Tang

Graph-structured scene descriptions can be efficiently used in generative models to control the composition of the generated image. Previous approaches are based on the combination of graph convolutional networks and adversarial methods for…

Computer Vision and Pattern Recognition · Computer Science 2023-03-09 Renato Sortino , Simone Palazzo , Concetto Spampinato

Advancements in generative models have sparked significant interest in generating images while adhering to specific structural guidelines. Scene graph to image generation is one such task of generating images which are consistent with the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Rameshwar Mishra , A V Subramanyam

Text-to-image models are showcasing the impressive ability to create high-quality and diverse generative images. Nevertheless, the transition from freehand sketches to complex scene images remains challenging using diffusion models. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Tianyu Zhang , Xiaoxuan Xie , Xusheng Du , Haoran Xie

The visual world we sense, interpret and interact everyday is a complex composition of interleaved physical entities. Therefore, it is a very challenging task to generate vivid scenes of similar complexity using computers. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2019-02-05 Mehmet Ozgur Turkoglu , William Thong , Luuk Spreeuwers , Berkay Kicanaoglu

Controllable scene synthesis consists of generating 3D information that satisfy underlying specifications. Thereby, these specifications should be abstract, i.e. allowing easy user interaction, whilst providing enough interface for detailed…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Helisa Dhamo , Fabian Manhardt , Nassir Navab , Federico Tombari

Conditional text-to-image generation is an active area of research, with many possible applications. Existing research has primarily focused on generating a single image from available conditioning information in one step. One practical…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Alaaeldin El-Nouby , Shikhar Sharma , Hannes Schulz , Devon Hjelm , Layla El Asri , Samira Ebrahimi Kahou , Yoshua Bengio , Graham W. Taylor

Despite the great success object detection and segmentation models have achieved in recognizing individual objects in images, performance on cognitive tasks such as image caption, semantic image retrieval, and visual QA is far from…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Weilin Cong , William Wang , Wang-Chien Lee

The significant progress on Generative Adversarial Networks (GANs) has facilitated realistic single-object image generation based on language input. However, complex-scene generation (with various interactions among multiple objects) still…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Tianyu Hua , Hongdong Zheng , Yalong Bai , Wei Zhang , Xiao-Ping Zhang , Tao Mei

We propose a new recurrent generative model for generating images from text captions while attending on specific parts of text captions. Our model creates images by incrementally adding patches on a "canvas" while attending on words from…

Computer Vision and Pattern Recognition · Computer Science 2018-10-09 Amanpreet Singh , Sharan Agrawal

Recent advancements in text-to-image generation have been propelled by the development of diffusion models and multi-modality learning. However, since text is typically represented sequentially in these models, it often falls short in…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Guibao Shen , Luozhou Wang , Jiantao Lin , Wenhang Ge , Chaozhe Zhang , Xin Tao , Yuan Zhang , Pengfei Wan , Zhongyuan Wang , Guangyong Chen , Yijun Li , Ying-Cong Chen

Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges…

Computer Vision and Pattern Recognition · Computer Science 2021-10-04 Boris Knyazev , Harm de Vries , Cătălina Cangea , Graham W. Taylor , Aaron Courville , Eugene Belilovsky
‹ Prev 1 2 3 10 Next ›