Related papers: Memory-Driven Text-to-Image Generation

Semi-parametric Image Synthesis

We present a semi-parametric approach to photographic image synthesis from semantic layouts. The approach combines the complementary strengths of parametric and nonparametric techniques. The nonparametric component is a memory bank of image…

Computer Vision and Pattern Recognition · Computer Science 2018-05-01 Xiaojuan Qi , Qifeng Chen , Jiaya Jia , Vladlen Koltun

Semi-Parametric Neural Image Synthesis

Novel architectures have recently improved generative image synthesis leading to excellent visual quality in various tasks. Much of this success is due to the scalability of these architectures and hence caused by a dramatic increase in…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Andreas Blattmann , Robin Rombach , Kaan Oktay , Jonas Müller , Björn Ommer

Semi-Supervised Image-to-Image Translation

Image-to-image translation is a long-established and a difficult problem in computer vision. In this paper we propose an adversarial based model for image-to-image translation. The regular deep neural-network based methods perform the task…

Computer Vision and Pattern Recognition · Computer Science 2019-01-25 Manan Oza , Himanshu Vaghela , Sudhir Bagul

DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis

In this paper, we focus on generating realistic images from text descriptions. Current methods first generate an initial image with rough shape and color, and then refine the initial image to a high-resolution one. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Minfeng Zhu , Pingbo Pan , Wei Chen , Yi Yang

Semantic Draw Engineering for Text-to-Image Creation

Text-to-image generation is conducted through Generative Adversarial Networks (GANs) or transformer models. However, the current challenge lies in accurately generating images based on textual descriptions, especially in scenarios where the…

Human-Computer Interaction · Computer Science 2024-01-10 Yang Li , Huaqiang Jiang , Yangkai Wu

Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation

As a challenging task, text-to-image generation aims to generate photo-realistic and semantically consistent images according to the given text descriptions. Existing methods mainly extract the text information from only one sentence to…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Xintian Wu , Hanbin Zhao , Liangli Zheng , Shouhong Ding , Xi Li

Image-to-Image Translation with Text Guidance

The goal of this paper is to embed controllable factors, i.e., natural language descriptions, into image-to-image translation with generative adversarial networks, which allows text descriptions to determine the visual attributes of…

Computer Vision and Pattern Recognition · Computer Science 2020-02-14 Bowen Li , Xiaojuan Qi , Philip H. S. Torr , Thomas Lukasiewicz

Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork

We develop an approach for text-to-image generation that embraces additional retrieval images, driven by a combination of implicit visual guidance loss and generative objectives. Unlike most existing text-to-image generation methods which…

Computer Vision and Pattern Recognition · Computer Science 2022-08-19 Xin Yuan , Zhe Lin , Jason Kuen , Jianming Zhang , John Collomosse

Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network

This paper presents a novel method to deal with the challenging task of generating photographic images conditioned on semantic image descriptions. Our method introduces accompanying hierarchical-nested adversarial objectives inside the…

Computer Vision and Pattern Recognition · Computer Science 2018-04-10 Zizhao Zhang , Yuanpu Xie , Lin Yang

Efficient Neural Architecture for Text-to-Image Synthesis

Text-to-image synthesis is the task of generating images from text descriptions. Image generation, by itself, is a challenging task. When we combine image generation and text, we bring complexity to a new level: we need to combine data from…

Machine Learning · Computer Science 2020-04-27 Douglas M. Souza , Jônatas Wehrmann , Duncan D. Ruiz

FA-GAN: Feature-Aware GAN for Text to Image Synthesis

Text-to-image synthesis aims to generate a photo-realistic image from a given natural language description. Previous works have made significant progress with Generative Adversarial Networks (GANs). Nonetheless, it is still hard to generate…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Eunyeong Jeon , Kunhee Kim , Daijin Kim

CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis

Typical methods for text-to-image synthesis seek to design effective generative architecture to model the text-to-image mapping directly. It is fairly arduous due to the cross-modality translation. In this paper we circumvent this problem…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Jiadong Liang , Wenjie Pei , Feng Lu

Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis

We propose a novel hierarchical approach for text-to-image synthesis by inferring semantic layout. Instead of learning a direct mapping from text to image, our algorithm decomposes the generation process into multiple steps, in which it…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Seunghoon Hong , Dingdong Yang , Jongwook Choi , Honglak Lee

Style Generation: Image Synthesis based on Coarsely Matched Texts

Previous text-to-image synthesis algorithms typically use explicit textual instructions to generate/manipulate images accurately, but they have difficulty adapting to guidance in the form of coarsely matched texts. In this work, we attempt…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Mengyao Cui , Zhe Zhu , Shao-Ping Lu , Yulu Yang

Controllable Text-to-Image Generation

In this paper, we propose a novel controllable text-to-image generative adversarial network (ControlGAN), which can effectively synthesise high-quality images and also control parts of the image generation according to natural language…

Computer Vision and Pattern Recognition · Computer Science 2019-12-20 Bowen Li , Xiaojuan Qi , Thomas Lukasiewicz , Philip H. S. Torr

Efficient Personalized Text-to-image Generation by Leveraging Textual Subspace

Personalized text-to-image generation has attracted unprecedented attention in the recent few years due to its unique capability of generating highly-personalized images via using the input concept dataset and novel textual prompt. However,…

Artificial Intelligence · Computer Science 2024-07-02 Shian Du , Xiaotian Cheng , Qi Qian , Henglu Wei , Yi Xu , Xiangyang Ji

Adversarial nets with perceptual losses for text-to-image synthesis

Recent approaches in generative adversarial networks (GANs) can automatically synthesize realistic images from descriptive text. Despite the overall fair quality, the generated images often expose visible flaws that lack structural…

Computer Vision and Pattern Recognition · Computer Science 2017-08-31 Miriam Cha , Youngjune Gwon , H. T. Kung

Semantics Disentangling for Text-to-Image Generation

Synthesizing photo-realistic images from text descriptions is a challenging problem. Previous studies have shown remarkable progresses on visual quality of the generated images. In this paper, we consider semantics from the input text…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Guojun Yin , Bin Liu , Lu Sheng , Nenghai Yu , Xiaogang Wang , Jing Shao

Adversarial Text-to-Image Synthesis: A Review

With the advent of generative adversarial networks, synthesizing images from textual descriptions has recently become an active research area. It is a flexible and intuitive way for conditional image generation with significant progress in…

Computer Vision and Pattern Recognition · Computer Science 2021-10-07 Stanislav Frolov , Tobias Hinz , Federico Raue , Jörn Hees , Andreas Dengel

Zero-Shot Text-to-Image Generation

Text-to-image generation has traditionally focused on finding better modeling assumptions for training on a fixed dataset. These assumptions might involve complex architectures, auxiliary losses, or side information such as object part…

Computer Vision and Pattern Recognition · Computer Science 2021-03-02 Aditya Ramesh , Mikhail Pavlov , Gabriel Goh , Scott Gray , Chelsea Voss , Alec Radford , Mark Chen , Ilya Sutskever