Related papers: Swapping Autoencoder for Deep Image Manipulation

Wavelets to the Rescue: Improving Sample Quality of Latent Variable Deep Generative Models

Variational Autoencoders (VAE) are probabilistic deep generative models underpinned by elegant theory, stable training processes, and meaningful manifold representations. However, they produce blurry images due to a lack of explicit…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Prashnna K Gyawali , Rudra Saha , Linwei Wang , VSR Veeravasarapu , Maneesh Singh

NeVAE: A Deep Generative Model for Molecular Graphs

Deep generative models have been praised for their ability to learn smooth latent representation of images, text, and audio, which can then be used to generate new, plausible data. However, current generative models are unable to work with…

Machine Learning · Computer Science 2019-09-09 Bidisha Samanta , Abir De , Gourhari Jana , Pratim Kumar Chattaraj , Niloy Ganguly , Manuel Gomez-Rodriguez

Manipulating Embeddings of Stable Diffusion Prompts

Prompt engineering is still the primary way for users of generative text-to-image models to manipulate generated images in a targeted way. Based on treating the model as a continuous function and by passing gradients between the image space…

Computer Vision and Pattern Recognition · Computer Science 2024-06-25 Niklas Deckers , Julia Peters , Martin Potthast

Segmenter: Transformer for Semantic Segmentation

Image segmentation is often ambiguous at the level of individual image patches and requires contextual information to reach label consensus. In this paper we introduce Segmenter, a transformer model for semantic segmentation. In contrast to…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Robin Strudel , Ricardo Garcia , Ivan Laptev , Cordelia Schmid

Proactive Image Manipulation Detection

Image manipulation detection algorithms are often trained to discriminate between images manipulated with particular Generative Models (GMs) and genuine/real images, yet generalize poorly to images manipulated with GMs unseen in the…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Vishal Asnani , Xi Yin , Tal Hassner , Sijia Liu , Xiaoming Liu

Image Representation Learning Using Graph Regularized Auto-Encoders

We consider the problem of image representation for the tasks of unsupervised learning and semi-supervised learning. In those learning tasks, the raw image vectors may not provide enough representation for their intrinsic structures due to…

Machine Learning · Computer Science 2014-02-20 Yiyi Liao , Yue Wang , Yong Liu

Toward Spatially Unbiased Generative Models

Recent image generation models show remarkable generation performance. However, they mirror strong location preference in datasets, which we call spatial bias. Therefore, generators render poor samples at unseen locations and scales. We…

Machine Learning · Computer Science 2021-08-04 Jooyoung Choi , Jungbeom Lee , Yonghyun Jeong , Sungroh Yoon

Spatially Scalable Compressed Image Sensing with Hybrid Transform and Inter-layer Prediction Model

Compressive imaging is an emerging application of compressed sensing, devoted to acquisition, encoding and reconstruction of images using random projections as measurements. In this paper we propose a novel method to provide a scalable…

Information Theory · Computer Science 2013-10-07 Diego Valsesia , Enrico Magli

Supervised Symbolic Music Style Translation Using Synthetic Data

Research on style transfer and domain translation has clearly demonstrated the ability of deep learning-based algorithms to manipulate images in terms of artistic style. More recently, several attempts have been made to extend such…

Sound · Computer Science 2021-06-11 Ondřej Cífka , Umut Şimşekli , Gaël Richard

Overparameterized Neural Networks Implement Associative Memory

Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks…

Machine Learning · Computer Science 2022-05-25 Adityanarayanan Radhakrishnan , Mikhail Belkin , Caroline Uhler

Representing Closed Transformation Paths in Encoded Network Latent Space

Deep generative networks have been widely used for learning mappings from a low-dimensional latent space to a high-dimensional data space. In many cases, data transformations are defined by linear paths in this latent space. However, the…

Machine Learning · Statistics 2019-12-06 Marissa Connor , Christopher Rozell

Deep Photo Cropper and Enhancer

This paper introduces a new type of image enhancement problem. Compared to traditional image enhancement methods, which mostly deal with pixel-wise modifications of a given photo, our proposed task is to crop an image which is embedded…

Computer Vision and Pattern Recognition · Computer Science 2020-08-04 Aaron Ott , Amir Mazaheri , Niels D. Lobo , Mubarak Shah

IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis

We present a novel introspective variational autoencoder (IntroVAE) model for synthesizing high-resolution photographic images. IntroVAE is capable of self-evaluating the quality of its generated samples and improving itself accordingly.…

Machine Learning · Computer Science 2018-10-30 Huaibo Huang , Zhihang Li , Ran He , Zhenan Sun , Tieniu Tan

Plausible Shading Decomposition For Layered Photo Retouching

Photographers routinely compose multiple manipulated photos of the same scene (layers) into a single image, which is better than any individual photo could be alone. Similarly, 3D artists set up rendering systems to produce layered images…

Graphics · Computer Science 2017-02-03 Carlo Innamorati , Tobias Ritschel , Tim Weyrich , Niloy J. Mitra

Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation

Multimodal machine translation (MMT) simultaneously takes the source sentence and a relevant image as input for translation. Since there is no paired image available for the input sentence in most cases, recent studies suggest utilizing…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Wenyu Guo , Qingkai Fang , Dong Yu , Yang Feng

Controllable Image Enhancement

Editing flat-looking images into stunning photographs requires skill and time. Automated image enhancement algorithms have attracted increased interest by generating high-quality images without user interaction. However, the quality…

Computer Vision and Pattern Recognition · Computer Science 2022-06-20 Heewon Kim , Kyoung Mu Lee

Data-driven plasma modelling: Surrogate collisional radiative models of fluorocarbon plasmas from deep generative autoencoders

We have developed a deep generative model that can produce accurate optical emission spectra and colour images of an ICP plasma using only the applied coil power, electrode power, pressure and gas flows as inputs -- essentially an empirical…

Plasma Physics · Physics 2023-06-27 Gregory A. Daly , Jonathan E. Fieldsend , Geoff Hassall , Gavin Tabor

A Pseudo-labelling Auto-Encoder for unsupervised image classification

In this paper, we introduce a unique variant of the denoising Auto-Encoder and combine it with the perceptual loss to classify images in an unsupervised manner. The proposed method, called Pseudo Labelling, consists of first applying a…

Computer Vision and Pattern Recognition · Computer Science 2021-06-01 Aymene Mohammed Bouayed , Karim Atif , Rachid Deriche , Abdelhakim Saim

Diverse Inpainting and Editing with GAN Inversion

Recent inversion methods have shown that real images can be inverted into StyleGAN's latent space and numerous edits can be achieved on those images thanks to the semantically rich feature representations of well-trained GAN models.…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Ahmet Burak Yildirim , Hamza Pehlivan , Bahri Batuhan Bilecen , Aysegul Dundar

Inverting the Imaging Process by Learning an Implicit Camera Model

Representing visual signals with implicit coordinate-based neural networks, as an effective replacement of the traditional discrete signal representation, has gained considerable popularity in computer vision and graphics. In contrast to…

Computer Vision and Pattern Recognition · Computer Science 2023-04-26 Xin Huang , Qi Zhang , Ying Feng , Hongdong Li , Qing Wang