Related papers: Improving Text to Image Generation using Mode-seek…

Evolving GAN Formulations for Higher Quality Image Synthesis

Generative Adversarial Networks (GANs) have extended deep learning to complex generation and translation tasks across different data modalities. However, GANs are notoriously difficult to train: Mode collapse and other instabilities in the…

Neural and Evolutionary Computing · Computer Science 2021-10-29 Santiago Gonzalez , Mohak Kant , Risto Miikkulainen

Adversarial nets with perceptual losses for text-to-image synthesis

Recent approaches in generative adversarial networks (GANs) can automatically synthesize realistic images from descriptive text. Despite the overall fair quality, the generated images often expose visible flaws that lack structural…

Computer Vision and Pattern Recognition · Computer Science 2017-08-31 Miriam Cha , Youngjune Gwon , H. T. Kung

Text to Image Synthesis Using Generative Adversarial Networks

Generating images from natural language is one of the primary applications of recent conditional generative models. Besides testing our ability to model conditional, highly dimensional distributions, text to image synthesis has many…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Cristian Bodnar

FA-GAN: Feature-Aware GAN for Text to Image Synthesis

Text-to-image synthesis aims to generate a photo-realistic image from a given natural language description. Previous works have made significant progress with Generative Adversarial Networks (GANs). Nonetheless, it is still hard to generate…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Eunyeong Jeon , Kunhee Kim , Daijin Kim

Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation

As a challenging task, text-to-image generation aims to generate photo-realistic and semantically consistent images according to the given text descriptions. Existing methods mainly extract the text information from only one sentence to…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Xintian Wu , Hanbin Zhao , Liangli Zheng , Shouhong Ding , Xi Li

CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis

Typical methods for text-to-image synthesis seek to design effective generative architecture to model the text-to-image mapping directly. It is fairly arduous due to the cross-modality translation. In this paper we circumvent this problem…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Jiadong Liang , Wenjie Pei , Feng Lu

OptGAN: Optimizing and Interpreting the Latent Space of the Conditional Text-to-Image GANs

Text-to-image generation intends to automatically produce a photo-realistic image, conditioned on a textual description. It can be potentially employed in the field of art creation, data augmentation, photo-editing, etc. Although many…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Zhenxing Zhang , Lambert Schomaker

Adversarial Learning of Semantic Relevance in Text to Image Synthesis

We describe a new approach that improves the training of generative adversarial nets (GANs) for synthesizing diverse images from a text input. Our approach is based on the conditional version of GANs and expands on previous work leveraging…

Computer Vision and Pattern Recognition · Computer Science 2019-02-07 Miriam Cha , Youngjune L. Gwon , H. T. Kung

Text-to-Image-to-Text Translation using Cycle Consistent Adversarial Networks

Text-to-Image translation has been an active area of research in the recent past. The ability for a network to learn the meaning of a sentence and generate an accurate image that depicts the sentence shows ability of the model to think more…

Machine Learning · Computer Science 2018-08-15 Satya Krishna Gorti , Jeremy Ma

Generating Multimodal Images with GAN: Integrating Text, Image, and Style

In the field of computer vision, multimodal image generation has become a research hotspot, especially the task of integrating text, image, and style. In this study, we propose a multimodal image generation method based on Generative…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Chaoyi Tan , Wenqing Zhang , Zhen Qi , Kowei Shih , Xinshi Li , Ao Xiang

Adversarial Feature Matching for Text Generation

The Generative Adversarial Network (GAN) has achieved great success in generating realistic (real-valued) synthetic data. However, convergence issues and difficulties dealing with discrete data hinder the applicability of GAN to text. We…

Machine Learning · Statistics 2017-11-21 Yizhe Zhang , Zhe Gan , Kai Fan , Zhi Chen , Ricardo Henao , Dinghan Shen , Lawrence Carin

Text-To-Image with Generative Adversarial Networks

Generating realistic images from human texts is one of the most challenging problems in the field of computer vision (CV). The meaning of descriptions given can be roughly reflected by existing text-to-image approaches. In this paper, our…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Mehrshad Momen-Tayefeh

DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis

In this paper, we focus on generating realistic images from text descriptions. Current methods first generate an initial image with rough shape and color, and then refine the initial image to a high-resolution one. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Minfeng Zhu , Pingbo Pan , Wei Chen , Yi Yang

Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork

We develop an approach for text-to-image generation that embraces additional retrieval images, driven by a combination of implicit visual guidance loss and generative objectives. Unlike most existing text-to-image generation methods which…

Computer Vision and Pattern Recognition · Computer Science 2022-08-19 Xin Yuan , Zhe Lin , Jason Kuen , Jianming Zhang , John Collomosse

Style Generation: Image Synthesis based on Coarsely Matched Texts

Previous text-to-image synthesis algorithms typically use explicit textual instructions to generate/manipulate images accurately, but they have difficulty adapting to guidance in the form of coarsely matched texts. In this work, we attempt…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Mengyao Cui , Zhe Zhu , Shao-Ping Lu , Yulu Yang

Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis

Most conditional generation tasks expect diverse outputs given a single conditional context. However, conditional generative adversarial networks (cGANs) often focus on the prior conditional information and ignore the input noise vectors,…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Qi Mao , Hsin-Ying Lee , Hung-Yu Tseng , Siwei Ma , Ming-Hsuan Yang

Fine-grained Text to Image Synthesis

Fine-grained text to image synthesis involves generating images from texts that belong to different categories. In contrast to general text to image synthesis, in fine-grained synthesis there is high similarity between images of different…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Xu Ouyang , Ying Chen , Kaiyue Zhu , Gady Agam

Latent Space is Feature Space: Regularization Term for GANs Training on Limited Dataset

Generative Adversarial Networks (GAN) is currently widely used as an unsupervised image generation method. Current state-of-the-art GANs can generate photorealistic images with high resolution. However, a large amount of data is required,…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Pengwei Wang

Exploring Generative Adversarial Networks for Text-to-Image Generation with Evolution Strategies

In the context of generative models, text-to-image generation achieved impressive results in recent years. Models using different approaches were proposed and trained in huge datasets of pairs of texts and images. However, some methods rely…

Neural and Evolutionary Computing · Computer Science 2022-07-08 Victor Costa , Nuno Lourenço , João Correia , Penousal Machado

GR-GAN: Gradual Refinement Text-to-image Generation

A good Text-to-Image model should not only generate high quality images, but also ensure the consistency between the text and the generated image. Previous models failed to simultaneously fix both sides well. This paper proposes a Gradual…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Bo Yang , Fangxiang Feng , Xiaojie Wang