English
Related papers

Related papers: Improving Text to Image Generation using Mode-seek…

200 papers

Generative Adversarial Networks (GANs) have extended deep learning to complex generation and translation tasks across different data modalities. However, GANs are notoriously difficult to train: Mode collapse and other instabilities in the…

Neural and Evolutionary Computing · Computer Science 2021-10-29 Santiago Gonzalez , Mohak Kant , Risto Miikkulainen

Recent approaches in generative adversarial networks (GANs) can automatically synthesize realistic images from descriptive text. Despite the overall fair quality, the generated images often expose visible flaws that lack structural…

Computer Vision and Pattern Recognition · Computer Science 2017-08-31 Miriam Cha , Youngjune Gwon , H. T. Kung

Generating images from natural language is one of the primary applications of recent conditional generative models. Besides testing our ability to model conditional, highly dimensional distributions, text to image synthesis has many…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Cristian Bodnar

Text-to-image synthesis aims to generate a photo-realistic image from a given natural language description. Previous works have made significant progress with Generative Adversarial Networks (GANs). Nonetheless, it is still hard to generate…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Eunyeong Jeon , Kunhee Kim , Daijin Kim

As a challenging task, text-to-image generation aims to generate photo-realistic and semantically consistent images according to the given text descriptions. Existing methods mainly extract the text information from only one sentence to…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Xintian Wu , Hanbin Zhao , Liangli Zheng , Shouhong Ding , Xi Li

Typical methods for text-to-image synthesis seek to design effective generative architecture to model the text-to-image mapping directly. It is fairly arduous due to the cross-modality translation. In this paper we circumvent this problem…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Jiadong Liang , Wenjie Pei , Feng Lu

Text-to-image generation intends to automatically produce a photo-realistic image, conditioned on a textual description. It can be potentially employed in the field of art creation, data augmentation, photo-editing, etc. Although many…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Zhenxing Zhang , Lambert Schomaker

We describe a new approach that improves the training of generative adversarial nets (GANs) for synthesizing diverse images from a text input. Our approach is based on the conditional version of GANs and expands on previous work leveraging…

Computer Vision and Pattern Recognition · Computer Science 2019-02-07 Miriam Cha , Youngjune L. Gwon , H. T. Kung

Text-to-Image translation has been an active area of research in the recent past. The ability for a network to learn the meaning of a sentence and generate an accurate image that depicts the sentence shows ability of the model to think more…

Machine Learning · Computer Science 2018-08-15 Satya Krishna Gorti , Jeremy Ma

In the field of computer vision, multimodal image generation has become a research hotspot, especially the task of integrating text, image, and style. In this study, we propose a multimodal image generation method based on Generative…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Chaoyi Tan , Wenqing Zhang , Zhen Qi , Kowei Shih , Xinshi Li , Ao Xiang

The Generative Adversarial Network (GAN) has achieved great success in generating realistic (real-valued) synthetic data. However, convergence issues and difficulties dealing with discrete data hinder the applicability of GAN to text. We…

Machine Learning · Statistics 2017-11-21 Yizhe Zhang , Zhe Gan , Kai Fan , Zhi Chen , Ricardo Henao , Dinghan Shen , Lawrence Carin

Generating realistic images from human texts is one of the most challenging problems in the field of computer vision (CV). The meaning of descriptions given can be roughly reflected by existing text-to-image approaches. In this paper, our…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Mehrshad Momen-Tayefeh

In this paper, we focus on generating realistic images from text descriptions. Current methods first generate an initial image with rough shape and color, and then refine the initial image to a high-resolution one. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Minfeng Zhu , Pingbo Pan , Wei Chen , Yi Yang

We develop an approach for text-to-image generation that embraces additional retrieval images, driven by a combination of implicit visual guidance loss and generative objectives. Unlike most existing text-to-image generation methods which…

Computer Vision and Pattern Recognition · Computer Science 2022-08-19 Xin Yuan , Zhe Lin , Jason Kuen , Jianming Zhang , John Collomosse

Previous text-to-image synthesis algorithms typically use explicit textual instructions to generate/manipulate images accurately, but they have difficulty adapting to guidance in the form of coarsely matched texts. In this work, we attempt…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Mengyao Cui , Zhe Zhu , Shao-Ping Lu , Yulu Yang

Most conditional generation tasks expect diverse outputs given a single conditional context. However, conditional generative adversarial networks (cGANs) often focus on the prior conditional information and ignore the input noise vectors,…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Qi Mao , Hsin-Ying Lee , Hung-Yu Tseng , Siwei Ma , Ming-Hsuan Yang

Fine-grained text to image synthesis involves generating images from texts that belong to different categories. In contrast to general text to image synthesis, in fine-grained synthesis there is high similarity between images of different…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Xu Ouyang , Ying Chen , Kaiyue Zhu , Gady Agam

Generative Adversarial Networks (GAN) is currently widely used as an unsupervised image generation method. Current state-of-the-art GANs can generate photorealistic images with high resolution. However, a large amount of data is required,…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Pengwei Wang

In the context of generative models, text-to-image generation achieved impressive results in recent years. Models using different approaches were proposed and trained in huge datasets of pairs of texts and images. However, some methods rely…

Neural and Evolutionary Computing · Computer Science 2022-07-08 Victor Costa , Nuno Lourenço , João Correia , Penousal Machado

A good Text-to-Image model should not only generate high quality images, but also ensure the consistency between the text and the generated image. Previous models failed to simultaneously fix both sides well. This paper proposes a Gradual…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Bo Yang , Fangxiang Feng , Xiaojie Wang
‹ Prev 1 2 3 10 Next ›