Related papers: Swapping Autoencoder for Deep Image Manipulation

Rewriting a Deep Generative Model

A deep generative model such as a GAN learns to model a rich set of semantic and physical rules about the target distribution, but up to now, it has been obscure how such rules are encoded in the network, or how a rule could be changed. In…

Computer Vision and Pattern Recognition · Computer Science 2020-07-31 David Bau , Steven Liu , Tongzhou Wang , Jun-Yan Zhu , Antonio Torralba

A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing

Recent advances in the field of generative models and in particular generative adversarial networks (GANs) have lead to substantial progress for controlled image editing, especially compared with the pre-deep learning era. Despite their…

Computer Vision and Pattern Recognition · Computer Science 2023-12-14 Gwilherm Lesné , Yann Gousseau , Saïd Ladjal , Alasdair Newson

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

Diffusion probabilistic models (DPMs) have achieved remarkable quality in image generation that rivals GANs'. But unlike GANs, DPMs use a set of latent variables that lack semantic meaning and cannot serve as a useful representation for…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Konpat Preechakul , Nattanat Chatthee , Suttisak Wizadwongsa , Supasorn Suwajanakorn

Transforming Image Generation from Scene Graphs

Generating images from semantic visual knowledge is a challenging task, that can be useful to condition the synthesis process in complex, subtle, and unambiguous ways, compared to alternatives such as class labels or text descriptions.…

Computer Vision and Pattern Recognition · Computer Science 2022-07-04 Renato Sortino , Simone Palazzo , Concetto Spampinato

Semi-Supervised Manifold Learning with Complexity Decoupled Chart Autoencoders

Autoencoding is a popular method in representation learning. Conventional autoencoders employ symmetric encoding-decoding procedures and a simple Euclidean latent space to detect hidden low-dimensional structures in an unsupervised way.…

Machine Learning · Computer Science 2024-10-07 Stefan C. Schonsheck , Scott Mahan , Timo Klock , Alexander Cloninger , Rongjie Lai

Semantic Image Inpainting with Deep Generative Models

Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results…

Computer Vision and Pattern Recognition · Computer Science 2017-07-14 Raymond A. Yeh , Chen Chen , Teck Yian Lim , Alexander G. Schwing , Mark Hasegawa-Johnson , Minh N. Do

Image Decomposition and Classification through a Generative Model

We demonstrate in this paper that a generative model can be designed to perform classification tasks under challenging settings, including adversarial attacks and input distribution shifts. Specifically, we propose a conditional variational…

Computer Vision and Pattern Recognition · Computer Science 2019-02-12 Houpu Yao , Malcolm Regan , Yezhou Yang , Yi Ren

Generate, Segment and Refine: Towards Generic Manipulation Segmentation

Detecting manipulated images has become a significant emerging challenge. The advent of image sharing platforms and the easy availability of advanced photo editing software have resulted in a large quantities of manipulated images being…

Computer Vision and Pattern Recognition · Computer Science 2019-11-26 Peng Zhou , Bor-Chun Chen , Xintong Han , Mahyar Najibi , Abhinav Shrivastava , Ser Nam Lim , Larry S. Davis

Interpretable Transformations with Encoder-Decoder Networks

Deep feature spaces have the capacity to encode complex transformations of their input data. However, understanding the relative feature-space relationship between two transformed encoded images is difficult. For instance, what is the…

Computer Vision and Pattern Recognition · Computer Science 2017-10-23 Daniel E. Worrall , Stephan J. Garbin , Daniyar Turmukhambetov , Gabriel J. Brostow

Autoencoder Image Interpolation by Shaping the Latent Space

Autoencoders represent an effective approach for computing the underlying factors characterizing datasets of different types. The latent representation of autoencoders have been studied in the context of enabling interpolation between data…

Machine Learning · Computer Science 2020-10-23 Alon Oring , Zohar Yakhini , Yacov Hel-Or

Improving Autoencoder Image Interpolation via Dynamic Optimal Transport

Autoencoders are important generative models that, among others, have the ability to interpolate image sequences. However, interpolated images are usually not semantically meaningful.In this paper, motivated by dynamic optimal transport, we…

Optimization and Control · Mathematics 2024-04-16 Xue Feng , Thomas Strohmer

Generating Multi-Image Synthetic Data for Text-to-Image Customization

Customization of text-to-image models enables users to insert new concepts or objects and generate them in unseen settings. Existing methods either rely on comparatively expensive test-time optimization or train encoders on single-image…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Nupur Kumari , Xi Yin , Jun-Yan Zhu , Ishan Misra , Samaneh Azadi

Transfer Learning From Synthetic To Real Images Using Variational Autoencoders For Precise Position Detection

Capturing and labeling camera images in the real world is an expensive task, whereas synthesizing labeled images in a simulation environment is easy for collecting large-scale image data. However, learning from only synthetic images may not…

Computer Vision and Pattern Recognition · Computer Science 2018-07-06 Tadanobu Inoue , Subhajit Chaudhury , Giovanni De Magistris , Sakyasingha Dasgupta

Supervised Dimensionality Reduction and Image Classification Utilizing Convolutional Autoencoders

The joint optimization of the reconstruction and classification error is a hard non convex problem, especially when a non linear mapping is utilized. In order to overcome this obstacle, a novel optimization strategy is proposed, in which a…

Machine Learning · Computer Science 2022-11-07 Ioannis A. Nellas , Sotiris K. Tasoulis , Vassilis P. Plagianakos , Spiros V. Georgakopoulos

Counterfactuals uncover the modular structure of deep generative models

Deep generative models can emulate the perceptual properties of complex image datasets, providing a latent representation of the data. However, manipulating such representation to perform meaningful and controllable transformations in the…

Machine Learning · Computer Science 2019-12-13 Michel Besserve , Arash Mehrjou , Rémy Sun , Bernhard Schölkopf

MADE: Masked Autoencoder for Distribution Estimation

There has been a lot of recent interest in designing neural network models to estimate a distribution from a set of examples. We introduce a simple modification for autoencoder neural networks that yields powerful generative models. Our…

Machine Learning · Computer Science 2015-06-08 Mathieu Germain , Karol Gregor , Iain Murray , Hugo Larochelle

Towards Photographic Image Manipulation with Balanced Growing of Generative Autoencoders

We present a generative autoencoder that provides fast encoding, faithful reconstructions (eg. retaining the identity of a face), sharp generated/reconstructed samples in high resolutions, and a well-structured latent space that supports…

Machine Learning · Computer Science 2020-02-21 Ari Heljakka , Arno Solin , Juho Kannala

A geometrically aware auto-encoder for multi-texture synthesis

We propose an auto-encoder architecture for multi-texture synthesis. The approach relies on both a compact encoder accounting for second order neural statistics and a generator incorporating adaptive periodic content. Images are embedded in…

Computer Vision and Pattern Recognition · Computer Science 2023-06-30 Pierrick Chatillon , Yann Gousseau , Sidonie Lefebvre

EdiBERT, a generative model for image editing

Advances in computer vision are pushing the limits of im-age manipulation, with generative models sampling detailed images on various tasks. However, a specialized model is often developed and trained for each specific task, even though…

Computer Vision and Pattern Recognition · Computer Science 2022-07-22 Thibaut Issenhuth , Ugo Tanielian , Jérémie Mary , David Picard

Fader Networks: Manipulating Images by Sliding Attributes

This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our…

Computer Vision and Pattern Recognition · Computer Science 2018-01-30 Guillaume Lample , Neil Zeghidour , Nicolas Usunier , Antoine Bordes , Ludovic Denoyer , Marc'Aurelio Ranzato