English
Related papers

Related papers: Transform Invariant Auto-encoder

200 papers

This paper proposes a theoretical framework on the mechanism of autoencoders. To the encoder part, under the main use of dimensionality reduction, we investigate its two fundamental properties: bijective maps and data disentangling. The…

Machine Learning · Computer Science 2022-12-13 Changcun Huang

Establishing correspondences between 3D shapes is a fundamental task in 3D Computer Vision, typically addressed by matching local descriptors. Recently, a few attempts at applying the deep learning paradigm to the task have shown promising…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Riccardo Spezialetti , Samuele Salti , Luigi Di Stefano

In this work we investigate how to achieve equivariance to input transformations in deep networks, purely from data, without being given a model of those transformations. Convolutional Neural Networks (CNNs), for example, are equivariant to…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Jianbo Jiao , João F. Henriques

The joint optimization of the reconstruction and classification error is a hard non convex problem, especially when a non linear mapping is utilized. In order to overcome this obstacle, a novel optimization strategy is proposed, in which a…

Machine Learning · Computer Science 2022-11-07 Ioannis A. Nellas , Sotiris K. Tasoulis , Vassilis P. Plagianakos , Spiros V. Georgakopoulos

In this work we introduce Deforming Autoencoders, a generative model for images that disentangles shape from appearance in an unsupervised manner. As in the deformable template paradigm, shape is represented as a deformation between a…

Computer Vision and Pattern Recognition · Computer Science 2018-06-19 Zhixin Shu , Mihir Sahasrabudhe , Alp Guler , Dimitris Samaras , Nikos Paragios , Iasonas Kokkinos

Weakly-supervised anomaly detection aims at learning an anomaly detector from a limited amount of labeled data and abundant unlabeled data. Recent works build deep neural networks for anomaly detection by discriminatively mapping the normal…

Machine Learning · Computer Science 2021-08-29 Yingjie Zhou , Xucheng Song , Yanru Zhang , Fanxing Liu , Ce Zhu , Lingqiao Liu

Deep generative models have become increasingly effective at producing realistic images from randomly sampled seeds, but using such models for controllable manipulation of existing images remains challenging. We propose the Swapping…

Computer Vision and Pattern Recognition · Computer Science 2020-12-15 Taesung Park , Jun-Yan Zhu , Oliver Wang , Jingwan Lu , Eli Shechtman , Alexei A. Efros , Richard Zhang

Autoencoders are certainly among the most studied and used Deep Learning models: the idea behind them is to train a model in order to reconstruct the same input data. The peculiarity of these models is to compress the information through a…

Machine Learning · Computer Science 2023-09-06 Gabriele Martino , Davide Moroni , Massimo Martinelli

Autoencoders are techniques for data representation learning based on artificial neural networks. Differently to other feature learning methods which may be focused on finding specific transformations of the feature space, they can be…

Machine Learning · Computer Science 2020-05-12 David Charte , Francisco Charte , María J. del Jesus , Francisco Herrera

Deep learning has significantly advanced medical imaging analysis, yet variations in image resolution remain an overlooked challenge. Most methods address this by resampling images, leading to either information loss or computational…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Ashay Patel , Michela Antonelli , Sebastien Ourselin , M. Jorge Cardoso

We propose a novel machine learning strategy for studying neuroanatomical shape variation. Our model works with volumetric binary segmentation images, and requires no pre-processing such as the extraction of surface points or a mesh. The…

Computer Vision and Pattern Recognition · Computer Science 2024-10-30 Evan M. Yu , Mert R. Sabuncu

Image-to-image translation is a subset of computer vision and pattern recognition problems where our goal is to learn a mapping between input images of domain $\mathbf{X}_1$ and output images of domain $\mathbf{X}_2$. Current methods use…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Safalya Pal

Neural rendering techniques promise efficient photo-realistic image synthesis while at the same time providing rich control over scene parameters by learning the physical image formation process. While several supervised methods have been…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Hassan Abu Alhaija , Siva Karthik Mustikovela , Justus Thies , Varun Jampani , Matthias Nießner , Andreas Geiger , Carsten Rother

An autoencoder is a specific type of a neural network, which is mainly designed to encode the input into a compressed and meaningful representation, and then decode it back such that the reconstructed input is similar as possible to the…

Machine Learning · Computer Science 2021-04-06 Dor Bank , Noam Koenigstein , Raja Giryes

This work presents the first convolutional neural network that learns an image-to-graph translation task without needing external supervision. Obtaining graph representations of image content, where objects are represented as nodes and…

Computer Vision and Pattern Recognition · Computer Science 2020-12-14 Chenyang Lu , Gijs Dubbelman

In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement. We start by employing a…

Computer Vision and Pattern Recognition · Computer Science 2022-08-19 Mohamed Ali Souibgui , Sanket Biswas , Andres Mafla , Ali Furkan Biten , Alicia Fornés , Yousri Kessentini , Josep Lladós , Lluis Gomez , Dimosthenis Karatzas

Common domain shift problem formulations consider the integration of multiple source domains, or the target domain during training. Regarding the generalization of machine learning models between different car interiors, we formulate the…

Computer Vision and Pattern Recognition · Computer Science 2021-05-10 Steve Dias Da Cruz , Bertram Taetz , Oliver Wasenmüller , Thomas Stifter , Didier Stricker

An autoencoder is a neural network which data projects to and from a lower dimensional latent space, where this data is easier to understand and model. The autoencoder consists of two sub-networks, the encoder and the decoder, which carry…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Saïd Ladjal , Alasdair Newson , Chi-Hieu Pham

This study proposes an automated data mining framework based on autoencoders and experimentally verifies its effectiveness in feature extraction and data dimensionality reduction. Through the encoding-decoding structure, the autoencoder can…

Machine Learning · Computer Science 2024-12-04 Yaxin Liang , Xinshi Li , Xin Huang , Ziqi Zhang , Yue Yao

We construct a new kind of encoder, leveraging the expressive power of diffusion models. In a traditional variational autoencoder, the encoder and decoder jointly negotiate a latent representation of the input. This is made possible by the…

Machine Learning · Computer Science 2026-05-14 Akhil Premkumar , Sarah Lucioni
‹ Prev 1 2 3 10 Next ›