Related papers: Swapping Autoencoder for Deep Image Manipulation

A Variational U-Net for Conditional Appearance and Shape Generation

Deep generative models have demonstrated great performance in image synthesis. However, results deteriorate in case of spatial deformations, since they generate images of objects directly, rather than modeling the intricate interplay of…

Computer Vision and Pattern Recognition · Computer Science 2018-04-16 Patrick Esser , Ekaterina Sutter , Björn Ommer

Generative Modeling with Conditional Autoencoders: Building an Integrated Cell

We present a conditional generative model to learn variation in cell and nuclear morphology and the location of subcellular structures from microscopy images. Our model generalizes to a wide range of subcellular localization and allows for…

Machine Learning · Statistics 2017-05-02 Gregory R. Johnson , Rory M. Donovan-Maiye , Mary M. Maleckar

Variational Autoencoder for Deep Learning of Images, Labels and Captions

A novel variational autoencoder is developed to model images, as well as associated labels or captions. The Deep Generative Deconvolutional Network (DGDN) is used as a decoder of the latent image features, and a deep Convolutional Neural…

Machine Learning · Statistics 2016-09-29 Yunchen Pu , Zhe Gan , Ricardo Henao , Xin Yuan , Chunyuan Li , Andrew Stevens , Lawrence Carin

Deep Hyperspectral Unmixing using Transformer Network

Currently, this paper is under review in IEEE. Transformers have intrigued the vision research community with their state-of-the-art performance in natural language processing. With their superior performance, transformers have found their…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Preetam Ghosh , Swalpa Kumar Roy , Bikram Koirala , Behnood Rasti , Paul Scheunders

Authoring image decompositions with generative models

We show how to extend traditional intrinsic image decompositions to incorporate further layers above albedo and shading. It is hard to obtain data to learn a multi-layer decomposition. Instead, we can learn to decompose an image into layers…

Computer Vision and Pattern Recognition · Computer Science 2016-12-06 Jason Rock , Theerasit Issaranon , Aditya Deshpande , David Forsyth

Transfer learning from synthetic to real images using variational autoencoders for robotic applications

Robotic learning in simulation environments provides a faster, more scalable, and safer training methodology than learning directly with physical robots. Also, synthesizing images in a simulation environment for collecting large-scale image…

Robotics · Computer Science 2017-09-21 Tadanobu Inoue , Subhajit Chaudhury , Giovanni De Magistris , Sakyasingha Dasgupta

Transformer-based Learned Image Compression for Joint Decoding and Denoising

This work introduces a Transformer-based image compression system. It has the flexibility to switch between the standard image reconstruction and the denoising reconstruction from a single compressed bitstream. Instead of training separate…

Image and Video Processing · Electrical Eng. & Systems 2024-02-21 Yi-Hsin Chen , Kuan-Wei Ho , Shiau-Rung Tsai , Guan-Hsun Lin , Alessandro Gnutti , Wen-Hsiao Peng , Riccardo Leonardi

Hybrid LSTM and Encoder-Decoder Architecture for Detection of Image Forgeries

With advanced image journaling tools, one can easily alter the semantic meaning of an image by exploiting certain manipulation techniques such as copy-clone, object splicing, and removal, which mislead the viewers. In contrast, the…

Computer Vision and Pattern Recognition · Computer Science 2019-06-26 Jawadul H. Bappy , Cody Simons , Lakshmanan Nataraj , B. S. Manjunath , Amit K. Roy-Chowdhury

Estimating Distributions with Low-dimensional Structures Using Mixtures of Generative Models

There has been a growing interest in statistical inference from data satisfying the so-called manifold hypothesis, assuming data points in the high-dimensional ambient space to lie in close vicinity of a submanifold of much lower dimension.…

Methodology · Statistics 2023-01-04 Rong Tang , Yun Yang

High-Resolution Complex Scene Synthesis with Transformers

The use of coarse-grained layouts for controllable synthesis of complex scene images via deep generative models has recently gained popularity. However, results of current approaches still fall short of their promise of high-resolution…

Computer Vision and Pattern Recognition · Computer Science 2021-05-14 Manuel Jahn , Robin Rombach , Björn Ommer

Perceptual Generative Autoencoders

Modern generative models are usually designed to match target distributions directly in the data space, where the intrinsic dimension of data can be much lower than the ambient dimension. We argue that this discrepancy may contribute to the…

Machine Learning · Computer Science 2020-07-02 Zijun Zhang , Ruixiang Zhang , Zongpeng Li , Yoshua Bengio , Liam Paull

Learning to Manipulate Individual Objects in an Image

We describe a method to train a generative model with latent factors that are (approximately) independent and localized. This means that perturbing the latent variables affects only local regions of the synthesized image, corresponding to…

Computer Vision and Pattern Recognition · Computer Science 2020-04-14 Yanchao Yang , Yutong Chen , Stefano Soatto

MaskGIT: Masked Generative Image Transformer

Generative transformers have experienced rapid popularity growth in the computer vision community in synthesizing high-fidelity and high-resolution images. The best generative transformer models so far, however, still treat an image naively…

Computer Vision and Pattern Recognition · Computer Science 2022-02-10 Huiwen Chang , Han Zhang , Lu Jiang , Ce Liu , William T. Freeman

Explicit Disentanglement of Appearance and Perspective in Generative Models

Disentangled representation learning finds compact, independent and easy-to-interpret factors of the data. Learning such has been shown to require an inductive bias, which we explicitly encode in a generative model of images. Specifically,…

Computer Vision and Pattern Recognition · Computer Science 2019-11-14 Nicki Skafte Detlefsen , Søren Hauberg

Examining Performance of Sketch-to-Image Translation Models with Multiclass Automatically Generated Paired Training Data

Image translation is a computer vision task that involves translating one representation of the scene into another. Various approaches have been proposed and achieved highly desirable results. Nevertheless, its accomplishment requires…

Computer Vision and Pattern Recognition · Computer Science 2018-11-02 Dichao Hu

A PCA-like Autoencoder

An autoencoder is a neural network which data projects to and from a lower dimensional latent space, where this data is easier to understand and model. The autoencoder consists of two sub-networks, the encoder and the decoder, which carry…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Saïd Ladjal , Alasdair Newson , Chi-Hieu Pham

Divide and Compose with Score Based Generative Models

While score based generative models, or diffusion models, have found success in image synthesis, they are often coupled with text data or image label to be able to manipulate and conditionally generate images. Even though manipulation of…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Sandesh Ghimire , Armand Comas , Davin Hill , Aria Masoomi , Octavia Camps , Jennifer Dy

Learning Co-segmentation by Segment Swapping for Retrieval and Discovery

The goal of this work is to efficiently identify visually similar patterns in images, e.g. identifying an artwork detail copied between an engraving and an oil painting, or recognizing parts of a night-time photograph visible in its daytime…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Xi Shen , Alexei A. Efros , Armand Joulin , Mathieu Aubry

High-dimensional Assisted Generative Model for Color Image Restoration

This work presents an unsupervised deep learning scheme that exploiting high-dimensional assisted score-based generative model for color image restoration tasks. Considering that the sample number and internal dimension in score-based…

Image and Video Processing · Electrical Eng. & Systems 2021-08-17 Kai Hong , Chunhua Wu , Cailian Yang , Minghui Zhang , Yancheng Lu , Yuhao Wang , Qiegen Liu

Homomorphism Autoencoder -- Learning Group Structured Representations from Observed Transitions

How can agents learn internal models that veridically represent interactions with the real world is a largely open question. As machine learning is moving towards representations containing not just observational but also interventional…

Machine Learning · Computer Science 2024-07-03 Hamza Keurti , Hsiao-Ru Pan , Michel Besserve , Benjamin F. Grewe , Bernhard Schölkopf