Related papers: Swapping Autoencoder for Deep Image Manipulation

CIMGEN: Controlled Image Manipulation by Finetuning Pretrained Generative Models on Limited Data

Content creation and image editing can benefit from flexible user controls. A common intermediate representation for conditional image generation is a semantic map, that has information of objects present in the image. When compared to raw…

Artificial Intelligence · Computer Science 2024-01-25 Chandrakanth Gudavalli , Erik Rosten , Lakshmanan Nataraj , Shivkumar Chandrasekaran , B. S. Manjunath

Fully Convolutional Mesh Autoencoder using Efficient Spatially Varying Kernels

Learning latent representations of registered meshes is useful for many 3D tasks. Techniques have recently shifted to neural mesh autoencoders. Although they demonstrate higher precision than traditional methods, they remain unable to…

Computer Vision and Pattern Recognition · Computer Science 2020-10-22 Yi Zhou , Chenglei Wu , Zimo Li , Chen Cao , Yuting Ye , Jason Saragih , Hao Li , Yaser Sheikh

Learning Disentangled Representations via Independent Subspaces

Image generating neural networks are mostly viewed as black boxes, where any change in the input can have a number of globally effective changes on the output. In this work, we propose a method for learning disentangled representations to…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Maren Awiszus , Hanno Ackermann , Bodo Rosenhahn

Functional Autoencoder for Smoothing and Representation Learning

A common pipeline in functional data analysis is to first convert the discretely observed data to smooth functions, and then represent the functions by a finite-dimensional vector of coefficients summarizing the information. Existing…

Machine Learning · Computer Science 2024-01-19 Sidi Wu , Cédric Beaulac , Jiguo Cao

Deep Image Compositing

Image compositing is a task of combining regions from different images to compose a new image. A common use case is background replacement of portrait images. To obtain high quality composites, professionals typically manually perform…

Computer Vision and Pattern Recognition · Computer Science 2020-11-05 He Zhang , Jianming Zhang , Federico Perazzi , Zhe Lin , Vishal M. Patel

Learning to Manipulate Object Collections Using Grounded State Representations

We propose a method for sim-to-real robot learning which exploits simulator state information in a way that scales to many objects. We first train a pair of encoder networks to capture multi-object state information in a latent space. One…

Robotics · Computer Science 2020-08-10 Matthew Wilson , Tucker Hermans

Structured Graph Variational Autoencoders for Indoor Furniture layout Generation

We present a structured graph variational autoencoder for generating the layout of indoor 3D scenes. Given the room type (e.g., living room or library) and the room layout (e.g., room elements such as floor and walls), our architecture…

Computer Vision and Pattern Recognition · Computer Science 2022-07-27 Aditya Chattopadhyay , Xi Zhang , David Paul Wipf , Himanshu Arora , Rene Vidal

Identity Encoder for Personalized Diffusion

Many applications can benefit from personalized image generation models, including image enhancement, video conferences, just to name a few. Existing works achieved personalization by fine-tuning one model for each person. While being…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Yu-Chuan Su , Kelvin C. K. Chan , Yandong Li , Yang Zhao , Han Zhang , Boqing Gong , Huisheng Wang , Xuhui Jia

Multi-Level Representation Learning for Deep Subspace Clustering

This paper proposes a novel deep subspace clustering approach which uses convolutional autoencoders to transform input images into new representations lying on a union of linear subspaces. The first contribution of our work is to insert…

Computer Vision and Pattern Recognition · Computer Science 2020-01-24 Mohsen Kheirandishfard , Fariba Zohrizadeh , Farhad Kamangar

Single Image Reflection Removal Using Deep Encoder-Decoder Network

Image of a scene captured through a piece of transparent and reflective material, such as glass, is often spoiled by a superimposed layer of reflection image. While separating the reflection from a familiar object in an image is mentally…

Computer Vision and Pattern Recognition · Computer Science 2018-02-02 Zhixiang Chi , Xiaolin Wu , Xiao Shu , Jinjin Gu

GraphVAE: Towards Generation of Small Graphs Using Variational Autoencoders

Deep learning on graphs has become a popular research topic with many applications. However, past work has concentrated on learning graph embedding tasks, which is in contrast with advances in generative models for images and text. Is it…

Machine Learning · Computer Science 2018-02-13 Martin Simonovsky , Nikos Komodakis

Resolution Invariant Autoencoder

Deep learning has significantly advanced medical imaging analysis, yet variations in image resolution remain an overlooked challenge. Most methods address this by resampling images, leading to either information loss or computational…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Ashay Patel , Michela Antonelli , Sebastien Ourselin , M. Jorge Cardoso

Masked Transformer for image Anomaly Localization

Image anomaly detection consists in detecting images or image portions that are visually different from the majority of the samples in a dataset. The task is of practical importance for various real-life applications like biomedical image…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Axel De Nardin , Pankaj Mishra , Gian Luca Foresti , Claudio Piciarelli

Unsupervised Superpixel Generation using Edge-Sparse Embedding

Partitioning an image into superpixels based on the similarity of pixels with respect to features such as colour or spatial location can significantly reduce data complexity and improve subsequent image processing tasks. Initial algorithms…

Computer Vision and Pattern Recognition · Computer Science 2022-11-30 Jakob Geusen , Gustav Bredell , Tianfei Zhou , Ender Konukoglu

Color Space Transformation Network

Deep networks have become very popular over the past few years. The main reason for this widespread use is their excellent ability to learn and predict knowledge in a very easy and efficient way. Convolutional neural networks and…

Computer Vision and Pattern Recognition · Computer Science 2015-12-15 Alexandros Karargyris

Semantic Photo Manipulation with a Generative Image Prior

Despite the recent success of GANs in synthesizing images conditioned on inputs such as a user sketch, text, or semantic labels, manipulating the high-level attributes of an existing natural photograph with GANs is challenging for two…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 David Bau , Hendrik Strobelt , William Peebles , Jonas Wulff , Bolei Zhou , Jun-Yan Zhu , Antonio Torralba

Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform

We propose a versatile deep image compression network based on Spatial Feature Transform (SFT arXiv:1804.02815), which takes a source image and a corresponding quality map as inputs and produce a compressed image with variable rates. Our…

Image and Video Processing · Electrical Eng. & Systems 2021-08-24 Myungseo Song , Jinyoung Choi , Bohyung Han

AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Ligong Han , Sri Harsha Musunuri , Martin Renqiang Min , Ruijiang Gao , Yu Tian , Dimitris Metaxas

Hierarchical Photo-Scene Encoder for Album Storytelling

In this paper, we propose a novel model with a hierarchical photo-scene encoder and a reconstructor for the task of album storytelling. The photo-scene encoder contains two sub-encoders, namely the photo and scene encoders, which are…

Computer Vision and Pattern Recognition · Computer Science 2019-02-05 Bairui Wang , Lin Ma , Wei Zhang , Wenhao Jiang , Feng Zhang

MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image. To this end, we combine a convolutional…

Computer Vision and Pattern Recognition · Computer Science 2017-12-11 Ayush Tewari , Michael Zollhöfer , Hyeongwoo Kim , Pablo Garrido , Florian Bernard , Patrick Pérez , Christian Theobalt