Deep Automodulators

Ari Heljakka; Yuxin Hou; Juho Kannala; Arno Solin

Deep Automodulators

Machine Learning 2020-10-30 v4 Computer Vision and Pattern Recognition Machine Learning

Authors: Ari Heljakka , Yuxin Hou , Juho Kannala , Arno Solin

Abstract

We introduce a new category of generative autoencoders called automodulators. These networks can faithfully reproduce individual real-world input images like regular autoencoders, but also generate a fused sample from an arbitrary combination of several such images, allowing instantaneous 'style-mixing' and other new applications. An automodulator decouples the data flow of decoder operations from statistical properties thereof and uses the latent vector to modulate the former by the latter, with a principled approach for mutual disentanglement of decoder layers. Prior work has explored similar decoder architecture with GANs, but their focus has been on random sampling. A corresponding autoencoder could operate on real input images. For the first time, we show how to train such a general-purpose model with sharp outputs in high resolution, using novel training techniques, demonstrated on four image data sets. Besides style-mixing, we show state-of-the-art results in autoencoder comparison, and visual image quality nearly indistinguishable from state-of-the-art GANs. We expect the automodulator variants to become a useful building block for image applications and other data domains.

Keywords

image generation encoder-decoder architecture image representation learning

Cite

@article{arxiv.1912.10321,
  title  = {Deep Automodulators},
  author = {Ari Heljakka and Yuxin Hou and Juho Kannala and Arno Solin},
  journal= {arXiv preprint arXiv:1912.10321},
  year   = {2020}
}

Comments

To appear in Advances in Neural Information Processing Systems (NeurIPS 2020)

Deep Automodulators

Abstract

Keywords

Cite

Comments

Related papers