Related papers: Visual-Relation Conscious Image Generation from St…

Relationship-Aware Spatial Perception Fusion for Realistic Scene Layout Generation

The significant progress on Generative Adversarial Networks (GANs) have made it possible to generate surprisingly realistic images for single object based on natural language descriptions. However, controlled generation of images for…

Computer Vision and Pattern Recognition · Computer Science 2019-11-14 Hongdong Zheng , Yalong Bai , Wei Zhang , Tao Mei

Exploiting Relationship for Complex-scene Image Generation

The significant progress on Generative Adversarial Networks (GANs) has facilitated realistic single-object image generation based on language input. However, complex-scene generation (with various interactions among multiple objects) still…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Tianyu Hua , Hongdong Zheng , Yalong Bai , Wei Zhang , Xiao-Ping Zhang , Tao Mei

Composition-aware Graphic Layout GAN for Visual-textual Presentation Designs

In this paper, we study the graphic layout generation problem of producing high-quality visual-textual presentation designs for given images. We note that image compositions, which contain not only global semantics but also spatial…

Computer Vision and Pattern Recognition · Computer Science 2022-07-14 Min Zhou , Chenchen Xu , Ye Ma , Tiezheng Ge , Yuning Jiang , Weiwei Xu

StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN)…

Computer Vision and Pattern Recognition · Computer Science 2018-06-29 Han Zhang , Tao Xu , Hongsheng Li , Shaoting Zhang , Xiaogang Wang , Xiaolei Huang , Dimitris Metaxas

Compositional GAN: Learning Image-Conditional Binary Composition

Generative Adversarial Networks (GANs) can produce images of remarkable complexity and realism but are generally structured to sample from a single latent source ignoring the explicit spatial interaction between multiple entities that could…

Computer Vision and Pattern Recognition · Computer Science 2019-04-01 Samaneh Azadi , Deepak Pathak , Sayna Ebrahimi , Trevor Darrell

A Layer-Based Sequential Framework for Scene Generation with GANs

The visual world we sense, interpret and interact everyday is a complex composition of interleaved physical entities. Therefore, it is a very challenging task to generate vivid scenes of similar complexity using computers. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2019-02-05 Mehmet Ozgur Turkoglu , William Thong , Luuk Spreeuwers , Berkay Kicanaoglu

Text to Image Synthesis using Stacked Conditional Variational Autoencoders and Conditional Generative Adversarial Networks

Synthesizing a realistic image from textual description is a major challenge in computer vision. Current text to image synthesis approaches falls short of producing a highresolution image that represent a text descriptor. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2022-08-16 Haileleol Tibebu , Aadil Malik , Varuna De Silva

Deep Structured Generative Models

Deep generative models have shown promising results in generating realistic images, but it is still non-trivial to generate images with complicated structures. The main reason is that most of the current generative models fail to explore…

Machine Learning · Computer Science 2018-07-12 Kun Xu , Haoyu Liang , Jun Zhu , Hang Su , Bo Zhang

Self-Supervised Image-to-Text and Text-to-Image Synthesis

A comprehensive understanding of vision and language and their interrelation are crucial to realize the underlying similarities and differences between these modalities and to learn more generalized, meaningful representations. In recent…

Computer Vision and Pattern Recognition · Computer Science 2021-12-10 Anindya Sundar Das , Sriparna Saha

VITAL: A Visual Interpretation on Text with Adversarial Learning for Image Labeling

In this paper, we propose a novel way to interpret text information by extracting visual feature presentation from multiple high-resolution and photo-realistic synthetic images generated by Text-to-image Generative Adversarial Network (GAN)…

Computer Vision and Pattern Recognition · Computer Science 2019-08-05 Tao Hu , Chengjiang Long , Leheng Zhang , Chunxia Xiao

DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis

Synthesizing high-quality realistic images from text descriptions is a challenging task. Existing text-to-image Generative Adversarial Networks generally employ a stacked architecture as the backbone yet still remain three flaws. First, the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Ming Tao , Hao Tang , Fei Wu , Xiao-Yuan Jing , Bing-Kun Bao , Changsheng Xu

Towards Realistic 3D Embedding via View Alignment

Recent advances in generative adversarial networks (GANs) have achieved great success in automated image composition that generates new images by embedding interested foreground objects into background images automatically. On the other…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Changgong Zhang , Fangneng Zhan , Shijian Lu , Feiying Ma , Xuansong Xie

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

Synthesizing high-quality images from text descriptions is a challenging problem in computer vision and has many practical applications. Samples generated by existing text-to-image approaches can roughly reflect the meaning of the given…

Computer Vision and Pattern Recognition · Computer Science 2017-08-08 Han Zhang , Tao Xu , Hongsheng Li , Shaoting Zhang , Xiaogang Wang , Xiaolei Huang , Dimitris Metaxas

Scene Graph Generation by Iterative Message Passing

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding

Generating an image from a provided descriptive text is quite a challenging task because of the difficulty in incorporating perceptual information (object shapes, colors, and their interactions) along with providing high relevancy related…

Computer Vision and Pattern Recognition · Computer Science 2020-07-03 Kanish Garg , Ajeet kumar Singh , Dorien Herremans , Brejesh Lall

One-shot Scene Graph Generation

As a structured representation of the image content, the visual scene graph (visual relationship) acts as a bridge between computer vision and natural language processing. Existing models on the scene graph generation task notoriously…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Yuyu Guo , Jingkuan Song , Lianli Gao , Heng Tao Shen

SAC-GAN: Structure-Aware Image Composition

We introduce an end-to-end learning framework for image-to-image composition, aiming to plausibly compose an object represented as a cropped patch from an object image into a background scene image. As our approach emphasizes more on…

Computer Vision and Pattern Recognition · Computer Science 2022-12-05 Hang Zhou , Rui Ma , Ling-Xiao Zhang , Lin Gao , Ali Mahdavi-Amiri , Hao Zhang

MOC-GAN: Mixing Objects and Captions to Generate Realistic Images

Generating images with conditional descriptions gains increasing interests in recent years. However, existing conditional inputs are suffering from either unstructured forms (captions) or limited information and expensive labeling (scene…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Tao Ma , Yikang Li

Next Visual Granularity Generation

We propose a novel approach to image generation by decomposing an image into a structured sequence, where each element in the sequence shares the same spatial resolution but differs in the number of unique tokens used, capturing different…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Yikai Wang , Zhouxia Wang , Zhonghua Wu , Qingyi Tao , Kang Liao , Chen Change Loy

Image Generation from Scene Graphs

To truly understand the visual world our models should be able not only to recognize images but also generate them. To this end, there has been exciting recent progress on generating images from natural language descriptions. These methods…

Computer Vision and Pattern Recognition · Computer Science 2018-04-06 Justin Johnson , Agrim Gupta , Li Fei-Fei