English
Related papers

Related papers: TLGAN: document Text Localization using Generative…

200 papers

This paper addresses the problem of manipulating images using natural language description. Our task aims to semantically modify visual attributes of an object in an image according to the text describing the new visual appearance. Although…

Computer Vision and Pattern Recognition · Computer Science 2018-11-29 Seonghyeon Nam , Yunji Kim , Seon Joo Kim

In this paper, we propose a novel way to interpret text information by extracting visual feature presentation from multiple high-resolution and photo-realistic synthetic images generated by Text-to-image Generative Adversarial Network (GAN)…

Computer Vision and Pattern Recognition · Computer Science 2019-08-05 Tao Hu , Chengjiang Long , Leheng Zhang , Chunxia Xiao

The problem of generating textual descriptions for the visual data has gained research attention in the recent years. In contrast to that the problem of generating visual data from textual descriptions is still very challenging, because it…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Bulla Rajesh , Nandakishore Dusa , Mohammed Javed , Shiv Ram Dubey , P. Nagabhushan

Generative Adversarial Networks (GAN) is a model for data synthesis, which creates plausible data through the competition of generator and discriminator. Although GAN application to image synthesis is extensively studied, it has inherent…

Computation and Language · Computer Science 2025-01-07 Jun-Min Lee , Tae-Bin Ha

Generative adversarial network (GAN) has been shown to be useful in various applications, such as image recognition, text processing and scientific computing, due its strong ability to learn complex data distributions. In this study, a…

Geophysics · Physics 2021-09-14 Tianhao He , Dongxiao Zhang

Recent advances in Generative Adversarial Networks (GANs) have led to the creation of realistic-looking digital images that pose a major challenge to their detection by humans or computers. GANs are used in a wide range of tasks, from…

Image and Video Processing · Electrical Eng. & Systems 2020-07-22 Michael Goebel , Lakshmanan Nataraj , Tejaswi Nanjundaswamy , Tajuddin Manhar Mohammed , Shivkumar Chandrasekaran , B. S. Manjunath

Most existing text-to-image generation methods adopt a multi-stage modular architecture which has three significant problems: 1) Training multiple networks increases the run time and affects the convergence and stability of the generative…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Zhenxing Zhang , Lambert Schomaker

We describe a new approach that improves the training of generative adversarial nets (GANs) for synthesizing diverse images from a text input. Our approach is based on the conditional version of GANs and expands on previous work leveraging…

Computer Vision and Pattern Recognition · Computer Science 2019-02-07 Miriam Cha , Youngjune L. Gwon , H. T. Kung

Recently image-to-image translation has received increasing attention, which aims to map images in one domain to another specific one. Existing methods mainly solve this task via a deep generative model, and focus on exploring the…

Computer Vision and Pattern Recognition · Computer Science 2019-01-24 Songyao Jiang , Zhiqiang Tao , Yun Fu

Deep generative models have been successfully applied to many applications. However, existing works experience limitations when generating large images (the literature usually generates small images, e.g. 32 * 32 or 128 * 128). In this…

Computer Vision and Pattern Recognition · Computer Science 2019-03-06 Zihan Ding , Xiao-Yang Liu , Miao Yin , Linghe Kong

As a new approach to train generative models, \emph{generative adversarial networks} (GANs) have achieved considerable success in image generation. This framework has also recently been applied to data with graph structures. We propose…

Machine Learning · Computer Science 2021-02-26 Shuangfei Fan , Bert Huang

Unsupervised image translation, which aims in translating two independent sets of images, is challenging in discovering the correct correspondences without paired data. Existing works build upon Generative Adversarial Network (GAN) such…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Shuang Ma , Jianlong Fu , Chang Wen Chen , Tao Mei

Although many methods have been proposed to deal with nature image super-resolution (SR) and get impressive performance, the text images SR is not good due to their ignorance of document images. In this paper, we propose a matting-based…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Yubao Liu , Kai Lin

Generative Adversarial Networks (GANs) are a promising approach for text generation that, unlike traditional language models (LM), does not suffer from the problem of ``exposure bias''. However, A major hurdle for understanding the…

Computation and Language · Computer Science 2019-03-26 Guy Tevet , Gavriel Habib , Vered Shwartz , Jonathan Berant

Synthesizing high-quality realistic images from text descriptions is a challenging task. Existing text-to-image Generative Adversarial Networks generally employ a stacked architecture as the backbone yet still remain three flaws. First, the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Ming Tao , Hao Tang , Fei Wu , Xiao-Yuan Jing , Bing-Kun Bao , Changsheng Xu

Conditional Generative Adversarial Networks (GANs) for cross-domain image-to-image translation have made much progress recently. Depending on the task complexity, thousands to millions of labeled image pairs are needed to train a…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Zili Yi , Hao Zhang , Ping Tan , Minglun Gong

The Generative Adversarial Network (GAN) has achieved great success in generating realistic (real-valued) synthetic data. However, convergence issues and difficulties dealing with discrete data hinder the applicability of GAN to text. We…

Machine Learning · Statistics 2017-11-21 Yizhe Zhang , Zhe Gan , Kai Fan , Zhi Chen , Ricardo Henao , Dinghan Shen , Lawrence Carin

Generating realistic images from human texts is one of the most challenging problems in the field of computer vision (CV). The meaning of descriptions given can be roughly reflected by existing text-to-image approaches. In this paper, our…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Mehrshad Momen-Tayefeh

The study of ancient writings has great value for archaeology and philology. Essential forms of material are photographic characters, but manual photographic character recognition is extremely time-consuming and expertise-dependent.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Hongxiang Huang , Daihui Yang , Gang Dai , Zhen Han , Yuyi Wang , Kin-Man Lam , Fan Yang , Shuangping Huang , Yongge Liu , Mengchao He

Text-to-image synthesis has recently seen significant progress thanks to large pretrained language models, large-scale training data, and the introduction of scalable model families such as diffusion and autoregressive models. However, the…

Machine Learning · Computer Science 2023-01-24 Axel Sauer , Tero Karras , Samuli Laine , Andreas Geiger , Timo Aila
‹ Prev 1 2 3 10 Next ›