Related papers: Swapping Autoencoder for Deep Image Manipulation

Transformer-based Image Generation from Scene Graphs

Graph-structured scene descriptions can be efficiently used in generative models to control the composition of the generated image. Previous approaches are based on the combination of graph convolutional networks and adversarial methods for…

Computer Vision and Pattern Recognition · Computer Science 2023-03-09 Renato Sortino , Simone Palazzo , Concetto Spampinato

Unsupervised Learning for Intrinsic Image Decomposition from a Single Image

Intrinsic image decomposition, which is an essential task in computer vision, aims to infer the reflectance and shading of the scene. It is challenging since it needs to separate one image into two components. To tackle this, conventional…

Computer Vision and Pattern Recognition · Computer Science 2020-05-28 Yunfei Liu , Yu Li , Shaodi You , Feng Lu

Pioneer Networks: Progressively Growing Generative Autoencoder

We introduce a novel generative autoencoder network model that learns to encode and reconstruct images with high quality and resolution, and supports smooth random sampling from the latent space of the encoder. Generative adversarial…

Machine Learning · Computer Science 2018-10-10 Ari Heljakka , Arno Solin , Juho Kannala

Deep Image Composition Meets Image Forgery

Image forgery is a topic that has been studied for many years. Before the breakthrough of deep learning, forged images were detected using handcrafted features that did not require training. These traditional methods failed to perform…

Computer Vision and Pattern Recognition · Computer Science 2024-04-29 Eren Tahir , Mert Bal

Generative Image Layer Decomposition with Visual Effects

Recent advancements in large generative models, particularly diffusion-based methods, have significantly enhanced the capabilities of image editing. However, achieving precise control over image composition tasks remains a challenge.…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Jinrui Yang , Qing Liu , Yijun Li , Soo Ye Kim , Daniil Pakhomov , Mengwei Ren , Jianming Zhang , Zhe Lin , Cihang Xie , Yuyin Zhou

An Auto-Encoder Strategy for Adaptive Image Segmentation

Deep neural networks are powerful tools for biomedical image segmentation. These models are often trained with heavy supervision, relying on pairs of images and corresponding voxel-level labels. However, obtaining segmentations of…

Image and Video Processing · Electrical Eng. & Systems 2020-04-30 Evan M. Yu , Juan Eugenio Iglesias , Adrian V. Dalca , Mert R. Sabuncu

Disentangling Domain and Content

Many real-world datasets can be divided into groups according to certain salient features (e.g. grouping images by subject, grouping text by font, etc.). Often, machine learning tasks require that these features be represented separately…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-02-16 Dan Andrei Iliescu , Aliaksei Mikhailiuk , Damon Wischik , Rafal Mantiuk

Style-Guided Inference of Transformer for High-resolution Image Synthesis

Transformer is eminently suitable for auto-regressive image synthesis which predicts discrete value from the past values recursively to make up full image. Especially, combined with vector quantised latent representation, the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-12 Jonghwa Yim , Minjae Kim

Unsupervised Change Detection in Hyperspectral Images using Feature Fusion Deep Convolutional Autoencoders

Binary change detection in bi-temporal co-registered hyperspectral images is a challenging task due to a large number of spectral bands present in the data. Researchers, therefore, try to handle it by reducing dimensions. The proposed work…

Computer Vision and Pattern Recognition · Computer Science 2021-09-13 Debasrita Chakraborty , Ashish Ghosh

Editable Image Elements for Controllable Synthesis

Diffusion models have made significant advances in text-guided synthesis tasks. However, editing user-provided images remains challenging, as the high dimensional noise input space of diffusion models is not naturally suited for image…

Computer Vision and Pattern Recognition · Computer Science 2024-04-25 Jiteng Mu , Michaël Gharbi , Richard Zhang , Eli Shechtman , Nuno Vasconcelos , Xiaolong Wang , Taesung Park

Conditional Variational Autoencoder for Learned Image Reconstruction

Learned image reconstruction techniques using deep neural networks have recently gained popularity, and have delivered promising empirical results. However, most approaches focus on one single recovery for each observation, and thus neglect…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Chen Zhang , Riccardo Barbano , Bangti Jin

Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data

In quantised autoencoders, images are usually split into local patches, each encoded by one token. This representation is redundant in the sense that the same number of tokens is spend per region, regardless of the visual information…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Tim Elsner , Paula Usinger , Victor Czech , Gregor Kobsik , Yanjiang He , Isaak Lim , Leif Kobbelt

Code-Aligned Autoencoders for Unsupervised Change Detection in Multimodal Remote Sensing Images

Image translation with convolutional autoencoders has recently been used as an approach to multimodal change detection in bitemporal satellite images. A main challenge is the alignment of the code spaces by reducing the contribution of…

Computer Vision and Pattern Recognition · Computer Science 2020-04-16 Luigi T. Luppino , Mads A. Hansen , Michael Kampffmeyer , Filippo M. Bianchi , Gabriele Moser , Robert Jenssen , Stian N. Anfinsen

Going deeper with Image Transformers

Transformers have been recently adapted for large scale image classification, achieving high scores shaking up the long supremacy of convolutional neural networks. However the optimization of image transformers has been little studied so…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Hugo Touvron , Matthieu Cord , Alexandre Sablayrolles , Gabriel Synnaeve , Hervé Jégou

Steered Mixture-of-Experts Autoencoder Design for Real-Time Image Modelling and Denoising

Research in the past years introduced Steered Mixture-of-Experts (SMoE) as a framework to form sparse, edge-aware models for 2D- and higher dimensional pixel data, applicable to compression, denoising, and beyond, and capable to compete…

Image and Video Processing · Electrical Eng. & Systems 2023-05-08 Elvira Fleig , Erik Bochinski , Thomas Sikora

Deep Structured Generative Models

Deep generative models have shown promising results in generating realistic images, but it is still non-trivial to generate images with complicated structures. The main reason is that most of the current generative models fail to explore…

Machine Learning · Computer Science 2018-07-12 Kun Xu , Haoyu Liang , Jun Zhu , Hang Su , Bo Zhang

Using Swarm Optimization To Enhance Autoencoders Images

Autoencoders learn data representations through reconstruction. Robust training is the key factor affecting the quality of the learned representations and, consequently, the accuracy of the application that use them. Previous works…

Neural and Evolutionary Computing · Computer Science 2018-07-11 Maisa Doaud , Michael Mayo

DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder

Autonomous vehicles increasingly rely on cameras to provide the input for perception and scene understanding and the ability of these models to classify their environment and objects, under adverse conditions and image noise is crucial.…

Computer Vision and Pattern Recognition · Computer Science 2021-11-08 Andreas Papachristodoulou , Christos Kyrkou , Theocharis Theocharides

Representation Learning for Non-Melanoma Skin Cancer using a Latent Autoencoder

Generative learning is a powerful tool for representation learning, and shows particular promise for problems in biomedical imaging. However, in this context, sampling from the distribution is secondary to finding representations of real…

Image and Video Processing · Electrical Eng. & Systems 2022-09-07 Simon Myles Thomas

Generative Image Inpainting with Contextual Attention

Recent deep learning based approaches have shown promising results for the challenging task of inpainting large missing regions in an image. These methods can generate visually plausible image structures and textures, but often create…

Computer Vision and Pattern Recognition · Computer Science 2018-03-23 Jiahui Yu , Zhe Lin , Jimei Yang , Xiaohui Shen , Xin Lu , Thomas S. Huang