English
Related papers

Related papers: Swapping Autoencoder for Deep Image Manipulation

200 papers

Variational Autoencoders (VAE) are probabilistic deep generative models underpinned by elegant theory, stable training processes, and meaningful manifold representations. However, they produce blurry images due to a lack of explicit…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Prashnna K Gyawali , Rudra Saha , Linwei Wang , VSR Veeravasarapu , Maneesh Singh

Deep generative models have been praised for their ability to learn smooth latent representation of images, text, and audio, which can then be used to generate new, plausible data. However, current generative models are unable to work with…

Machine Learning · Computer Science 2019-09-09 Bidisha Samanta , Abir De , Gourhari Jana , Pratim Kumar Chattaraj , Niloy Ganguly , Manuel Gomez-Rodriguez

Prompt engineering is still the primary way for users of generative text-to-image models to manipulate generated images in a targeted way. Based on treating the model as a continuous function and by passing gradients between the image space…

Computer Vision and Pattern Recognition · Computer Science 2024-06-25 Niklas Deckers , Julia Peters , Martin Potthast

Image segmentation is often ambiguous at the level of individual image patches and requires contextual information to reach label consensus. In this paper we introduce Segmenter, a transformer model for semantic segmentation. In contrast to…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Robin Strudel , Ricardo Garcia , Ivan Laptev , Cordelia Schmid

Image manipulation detection algorithms are often trained to discriminate between images manipulated with particular Generative Models (GMs) and genuine/real images, yet generalize poorly to images manipulated with GMs unseen in the…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Vishal Asnani , Xi Yin , Tal Hassner , Sijia Liu , Xiaoming Liu

We consider the problem of image representation for the tasks of unsupervised learning and semi-supervised learning. In those learning tasks, the raw image vectors may not provide enough representation for their intrinsic structures due to…

Machine Learning · Computer Science 2014-02-20 Yiyi Liao , Yue Wang , Yong Liu

Recent image generation models show remarkable generation performance. However, they mirror strong location preference in datasets, which we call spatial bias. Therefore, generators render poor samples at unseen locations and scales. We…

Machine Learning · Computer Science 2021-08-04 Jooyoung Choi , Jungbeom Lee , Yonghyun Jeong , Sungroh Yoon

Compressive imaging is an emerging application of compressed sensing, devoted to acquisition, encoding and reconstruction of images using random projections as measurements. In this paper we propose a novel method to provide a scalable…

Information Theory · Computer Science 2013-10-07 Diego Valsesia , Enrico Magli

Research on style transfer and domain translation has clearly demonstrated the ability of deep learning-based algorithms to manipulate images in terms of artistic style. More recently, several attempts have been made to extend such…

Sound · Computer Science 2021-06-11 Ondřej Cífka , Umut Şimşekli , Gaël Richard

Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks…

Machine Learning · Computer Science 2022-05-25 Adityanarayanan Radhakrishnan , Mikhail Belkin , Caroline Uhler

Deep generative networks have been widely used for learning mappings from a low-dimensional latent space to a high-dimensional data space. In many cases, data transformations are defined by linear paths in this latent space. However, the…

Machine Learning · Statistics 2019-12-06 Marissa Connor , Christopher Rozell

This paper introduces a new type of image enhancement problem. Compared to traditional image enhancement methods, which mostly deal with pixel-wise modifications of a given photo, our proposed task is to crop an image which is embedded…

Computer Vision and Pattern Recognition · Computer Science 2020-08-04 Aaron Ott , Amir Mazaheri , Niels D. Lobo , Mubarak Shah

We present a novel introspective variational autoencoder (IntroVAE) model for synthesizing high-resolution photographic images. IntroVAE is capable of self-evaluating the quality of its generated samples and improving itself accordingly.…

Machine Learning · Computer Science 2018-10-30 Huaibo Huang , Zhihang Li , Ran He , Zhenan Sun , Tieniu Tan

Photographers routinely compose multiple manipulated photos of the same scene (layers) into a single image, which is better than any individual photo could be alone. Similarly, 3D artists set up rendering systems to produce layered images…

Graphics · Computer Science 2017-02-03 Carlo Innamorati , Tobias Ritschel , Tim Weyrich , Niloy J. Mitra

Multimodal machine translation (MMT) simultaneously takes the source sentence and a relevant image as input for translation. Since there is no paired image available for the input sentence in most cases, recent studies suggest utilizing…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Wenyu Guo , Qingkai Fang , Dong Yu , Yang Feng

Editing flat-looking images into stunning photographs requires skill and time. Automated image enhancement algorithms have attracted increased interest by generating high-quality images without user interaction. However, the quality…

Computer Vision and Pattern Recognition · Computer Science 2022-06-20 Heewon Kim , Kyoung Mu Lee

We have developed a deep generative model that can produce accurate optical emission spectra and colour images of an ICP plasma using only the applied coil power, electrode power, pressure and gas flows as inputs -- essentially an empirical…

Plasma Physics · Physics 2023-06-27 Gregory A. Daly , Jonathan E. Fieldsend , Geoff Hassall , Gavin Tabor

In this paper, we introduce a unique variant of the denoising Auto-Encoder and combine it with the perceptual loss to classify images in an unsupervised manner. The proposed method, called Pseudo Labelling, consists of first applying a…

Computer Vision and Pattern Recognition · Computer Science 2021-06-01 Aymene Mohammed Bouayed , Karim Atif , Rachid Deriche , Abdelhakim Saim

Recent inversion methods have shown that real images can be inverted into StyleGAN's latent space and numerous edits can be achieved on those images thanks to the semantically rich feature representations of well-trained GAN models.…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Ahmet Burak Yildirim , Hamza Pehlivan , Bahri Batuhan Bilecen , Aysegul Dundar

Representing visual signals with implicit coordinate-based neural networks, as an effective replacement of the traditional discrete signal representation, has gained considerable popularity in computer vision and graphics. In contrast to…

Computer Vision and Pattern Recognition · Computer Science 2023-04-26 Xin Huang , Qi Zhang , Ying Feng , Hongdong Li , Qing Wang