Related papers: Multimodal Image Synthesis with Conditional Implic…

CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis

A persistent challenge in conditional image synthesis has been to generate diverse output images from the same input image despite only one output image being observed per input image. GAN-based methods are prone to mode collapse, which…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Shichong Peng , Alireza Moazeni , Ke Li

Diverse Image Synthesis from Semantic Layouts via Conditional IMLE

Most existing methods for conditional image synthesis are only able to generate a single plausible image for any given input, or at best a fixed number of plausible images. In this paper, we focus on the problem of generating images from…

Computer Vision and Pattern Recognition · Computer Science 2019-08-30 Ke Li , Tianhao Zhang , Jitendra Malik

Cascading Modular Network (CAM-Net) for Multimodal Image Synthesis

Deep generative models such as GANs have driven impressive advances in conditional image synthesis in recent years. A persistent challenge has been to generate diverse versions of output images from the same input image, due to the problem…

Computer Vision and Pattern Recognition · Computer Science 2021-06-18 Shichong Peng , Alireza Moazeni , Ke Li

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the…

Computer Vision and Pattern Recognition · Computer Science 2018-08-21 Ting-Chun Wang , Ming-Yu Liu , Jun-Yan Zhu , Andrew Tao , Jan Kautz , Bryan Catanzaro

Multimodal Image-to-Image Translation via Mutual Information Estimation and Maximization

Multimodal image-to-image translation (I2IT) aims to learn a conditional distribution that explores multiple possible images in the target domain given an input image in the source domain. Conditional generative adversarial networks (cGANs)…

Computer Vision and Pattern Recognition · Computer Science 2021-05-11 Zhiwen Zuo , Lei Zhao , Zhizhong Wang , Haibo Chen , Ailin Li , Qijiang Xu , Wei Xing , Dongming Lu

Harmonizing Maximum Likelihood with GANs for Multimodal Conditional Generation

Recent advances in conditional image generation tasks, such as image-to-image translation and image inpainting, are largely accounted to the success of conditional GAN models, which are often optimized by the joint use of the GAN loss with…

Machine Learning · Computer Science 2019-02-26 Soochan Lee , Junsoo Ha , Gunhee Kim

Super-Resolution via Conditional Implicit Maximum Likelihood Estimation

Single-image super-resolution (SISR) is a canonical problem with diverse applications. Leading methods like SRGAN produce images that contain various artifacts, such as high-frequency noise, hallucinated colours and shape distortions, which…

Machine Learning · Computer Science 2018-10-03 Ke Li , Shichong Peng , Jitendra Malik

Multimodal Conditional Image Synthesis with Product-of-Experts GANs

Existing conditional image synthesis frameworks generate images based on user inputs in a single modality, such as text, segmentation, sketch, or style reference. They are often unable to leverage multimodal user inputs when available,…

Computer Vision and Pattern Recognition · Computer Science 2021-12-10 Xun Huang , Arun Mallya , Ting-Chun Wang , Ming-Yu Liu

Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis

An emerging area of research aims to learn deep generative models with limited training data. Prior generative models like GANs and diffusion models require a lot of data to perform well, and their performance degrades when they are trained…

Computer Vision and Pattern Recognition · Computer Science 2024-09-27 Chirag Vashist , Shichong Peng , Ke Li

Conditional Image Synthesis With Auxiliary Classifier GANs

Synthesizing high resolution photorealistic images has been a long-standing challenge in machine learning. In this paper we introduce new methods for the improved training of generative adversarial networks (GANs) for image synthesis. We…

Machine Learning · Statistics 2017-07-24 Augustus Odena , Christopher Olah , Jonathon Shlens

Multimodal Shape Completion via IMLE

Shape completion is the problem of completing partial input shapes such as partial scans. This problem finds important applications in computer vision and robotics due to issues such as occlusion or sparsity in real-world data. However,…

Computer Vision and Pattern Recognition · Computer Science 2021-07-08 Himanshu Arora , Saurabh Mishra , Shichong Peng , Ke Li , Ali Mahdavi-Amiri

Diverse Semantic Image Synthesis via Probability Distribution Modeling

Semantic image synthesis, translating semantic layouts to photo-realistic images, is a one-to-many mapping problem. Though impressive progress has been recently made, diverse semantic synthesis that can efficiently produce semantic-level…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Zhentao Tan , Menglei Chai , Dongdong Chen , Jing Liao , Qi Chu , Bin Liu , Gang Hua , Nenghai Yu

Generating Multimodal Images with GAN: Integrating Text, Image, and Style

In the field of computer vision, multimodal image generation has become a research hotspot, especially the task of integrating text, image, and style. In this study, we propose a multimodal image generation method based on Generative…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Chaoyi Tan , Wenqing Zhang , Zhen Qi , Kowei Shih , Xinshi Li , Ao Xiang

IMAGINE: Image Synthesis by Image-Guided Model Inversion

We introduce an inversion based method, denoted as IMAge-Guided model INvErsion (IMAGINE), to generate high-quality and diverse images from only a single training sample. We leverage the knowledge of image semantics from a pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2021-04-14 Pei Wang , Yijun Li , Krishna Kumar Singh , Jingwan Lu , Nuno Vasconcelos

Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis

Most conditional generation tasks expect diverse outputs given a single conditional context. However, conditional generative adversarial networks (cGANs) often focus on the prior conditional information and ignore the input noise vectors,…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Qi Mao , Hsin-Ying Lee , Hung-Yu Tseng , Siwei Ma , Ming-Hsuan Yang

Cluster-guided Image Synthesis with Unconditional Models

Generative Adversarial Networks (GANs) are the driving force behind the state-of-the-art in image generation. Despite their ability to synthesize high-resolution photo-realistic images, generating content with on-demand conditioning of…

Computer Vision and Pattern Recognition · Computer Science 2021-12-28 Markos Georgopoulos , James Oldfield , Grigorios G Chrysos , Yannis Panagakis

Unsupervised Image Generation with Infinite Generative Adversarial Networks

Image generation has been heavily investigated in computer vision, where one core research challenge is to generate images from arbitrarily complex distributions with little supervision. Generative Adversarial Networks (GANs) as an implicit…

Computer Vision and Pattern Recognition · Computer Science 2021-08-19 Hui Ying , He Wang , Tianjia Shao , Yin Yang , Kun Zhou

Panoptic-based Image Synthesis

Conditional image synthesis for generating photorealistic images serves various applications for content editing to content generation. Previous conditional image synthesis algorithms mostly rely on semantic maps, and often fail in complex…

Computer Vision and Pattern Recognition · Computer Science 2020-04-23 Aysegul Dundar , Karan Sapra , Guilin Liu , Andrew Tao , Bryan Catanzaro

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

Numerous task-specific variants of conditional generative adversarial networks have been developed for image completion. Yet, a serious limitation remains that all existing algorithms tend to fail when handling large-scale missing regions.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-19 Shengyu Zhao , Jonathan Cui , Yilun Sheng , Yue Dong , Xiao Liang , Eric I Chang , Yan Xu

Multimodal Face Synthesis from Visual Attributes

Synthesis of face images from visual attributes is an important problem in computer vision and biometrics due to its applications in law enforcement and entertainment. Recent advances in deep generative networks have made it possible to…

Computer Vision and Pattern Recognition · Computer Science 2022-01-14 Xing Di , Vishal M. Patel