Related papers: Multiresolution Convolutional Autoencoders

MONCAE: Multi-Objective Neuroevolution of Convolutional Autoencoders

In this paper, we present a novel neuroevolutionary method to identify the architecture and hyperparameters of convolutional autoencoders. Remarkably, we used a hypervolume indicator in the context of neural architecture search for…

Neural and Evolutionary Computing · Computer Science 2021-06-23 Daniel Dimanov , Emili Balaguer-Ballester , Colin Singleton , Shahin Rostami

Multi-level Convolutional Autoencoder Networks for Parametric Prediction of Spatio-temporal Dynamics

A data-driven framework is proposed towards the end of predictive modeling of complex spatio-temporal dynamics, leveraging nested non-linear manifolds. Three levels of neural networks are used, with the goal of predicting the future state…

Computational Physics · Physics 2020-09-14 Jiayang Xu , Karthik Duraisamy

Multimodal Masked Autoencoders Learn Transferable Representations

Building scalable models to learn from diverse, multimodal data remains an open challenge. For vision-language data, the dominant approaches are based on contrastive learning objectives that train a separate encoder for each modality. While…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Xinyang Geng , Hao Liu , Lisa Lee , Dale Schuurmans , Sergey Levine , Pieter Abbeel

HierCVAE: Hierarchical Attention-Driven Conditional Variational Autoencoders for Multi-Scale Temporal Modeling

Temporal modeling in complex systems requires capturing dependencies across multiple time scales while managing inherent uncertainties. We propose HierCVAE, a novel architecture that integrates hierarchical attention mechanisms with…

Machine Learning · Computer Science 2025-08-27 Yao Wu

Multiresolution Equivariant Graph Variational Autoencoder

In this paper, we propose Multiresolution Equivariant Graph Variational Autoencoders (MGVAE), the first hierarchical generative model to learn and generate graphs in a multiresolution and equivariant manner. At each resolution level, MGVAE…

Machine Learning · Computer Science 2022-06-30 Truong Son Hy , Risi Kondor

Reconstruction-Driven Multimodal Representation Learning for Automated Media Understanding

Broadcast and media organizations increasingly rely on artificial intelligence to automate the labor-intensive processes of content indexing, tagging, and metadata generation. However, existing AI systems typically operate on a single…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Yassir Benhammou , Suman Kalyan , Sujay Kumar

Unsupervised Motion Representation Learning with Capsule Autoencoders

We propose the Motion Capsule Autoencoder (MCAE), which addresses a key challenge in the unsupervised learning of motion representations: transformation invariance. MCAE models motion in a two-level hierarchy. In the lower level, a…

Computer Vision and Pattern Recognition · Computer Science 2021-10-04 Ziwei Xu , Xudong Shen , Yongkang Wong , Mohan S Kankanhalli

Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

Masked Autoencoders (MAE) have shown great potentials in self-supervised pre-training for language and 2D image transformers. However, it still remains an open question on how to exploit masked autoencoding for learning 3D representations…

Computer Vision and Pattern Recognition · Computer Science 2022-10-17 Renrui Zhang , Ziyu Guo , Rongyao Fang , Bin Zhao , Dong Wang , Yu Qiao , Hongsheng Li , Peng Gao

A Structural Representation Learning for Multi-relational Networks

Most of the existing multi-relational network embedding methods, e.g., TransE, are formulated to preserve pair-wise connectivity structures in the networks. With the observations that significant triangular connectivity structures and…

Social and Information Networks · Computer Science 2018-06-11 Xin Li , Huiting Hong , Lin Liu , William K. Cheung

MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders

Medical images are acquired at high resolutions with large fields of view in order to capture fine-grained features necessary for clinical decision-making. Consequently, training deep learning models on medical images can incur large…

Image and Video Processing · Electrical Eng. & Systems 2025-06-04 Maya Varma , Ashwin Kumar , Rogier van der Sluijs , Sophie Ostmeier , Louis Blankemeier , Pierre Chambon , Christian Bluethgen , Jip Prince , Curtis Langlotz , Akshay Chaudhari

Masked Autoencoders Are Scalable Vision Learners

This paper shows that masked autoencoders (MAE) are scalable self-supervised learners for computer vision. Our MAE approach is simple: we mask random patches of the input image and reconstruct the missing pixels. It is based on two core…

Computer Vision and Pattern Recognition · Computer Science 2021-12-21 Kaiming He , Xinlei Chen , Saining Xie , Yanghao Li , Piotr Dollár , Ross Girshick

Multi-Scale Geometric Autoencoder

Autoencoders have emerged as powerful models for visualization and dimensionality reduction based on the fundamental assumption that high-dimensional data is generated from a low-dimensional manifold. A critical challenge in autoencoder…

Machine Learning · Computer Science 2025-09-30 Qipeng Zhan , Zhuoping Zhou , Zexuan Wang , Li Shen

MV2MAE: Multi-View Video Masked Autoencoders

Videos captured from multiple viewpoints can help in perceiving the 3D structure of the world and benefit computer vision tasks such as action recognition, tracking, etc. In this paper, we present a method for self-supervised learning from…

Computer Vision and Pattern Recognition · Computer Science 2024-01-30 Ketul Shah , Robert Crandall , Jie Xu , Peng Zhou , Marian George , Mayank Bansal , Rama Chellappa

Recurrent autoencoder with sequence-aware encoding

Recurrent Neural Networks (RNN) received a vast amount of attention last decade. Recently, the architectures of Recurrent AutoEncoders (RAE) found many applications in practice. RAE can extract the semantically valuable information, called…

Machine Learning · Computer Science 2021-06-14 Robert Susik

An Efficient and Explanatory Image and Text Clustering System with Multimodal Autoencoder Architecture

We demonstrate the efficiencies and explanatory abilities of extensions to the common tools of Autoencoders and LLM interpreters, in the novel context of comparing different cultural approaches to the same international news event. We…

Multimedia · Computer Science 2024-08-16 Tiancheng Shi , Yuanchen Wei , John R. Kender

Extreme Image Coding via Multiscale Autoencoders With Generative Adversarial Optimization

We propose a MultiScale AutoEncoder(MSAE) based extreme image compression framework to offer visually pleasing reconstruction at a very low bitrate. Our method leverages the "priors" at different resolution scale to improve the compression…

Image and Video Processing · Electrical Eng. & Systems 2020-01-06 Chao Huang , Haojie Liu , Tong Chen , Qiu Shen , Zhan Ma

A Hybrid Convolutional Variational Autoencoder for Text Generation

In this paper we explore the effect of architectural choices on learning a Variational Autoencoder (VAE) for text generation. In contrast to the previously introduced VAE model for text where both the encoder and decoder are RNNs, we…

Computation and Language · Computer Science 2017-02-09 Stanislau Semeniuta , Aliaksei Severyn , Erhardt Barth

Introducing a microstructure-embedded autoencoder approach for reconstructing high-resolution solution field data from a reduced parametric space

In this study, we develop a novel multi-fidelity deep learning approach that transforms low-fidelity solution maps into high-fidelity ones by incorporating parametric space information into a standard autoencoder architecture. This method's…

Computational Engineering, Finance, and Science · Computer Science 2024-05-08 Rasoul Najafi Koopas , Shahed Rezaei , Natalie Rauter , Richard Ostwald , Rolf Lammering

Deep Convolutional AutoEncoder-based Lossy Image Compression

Image compression has been investigated as a fundamental research topic for many decades. Recently, deep learning has achieved great success in many computer vision tasks, and is gradually being used in image compression. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2018-04-26 Zhengxue Cheng , Heming Sun , Masaru Takeuchi , Jiro Katto

Efficient Convolutional Auto-Encoding via Random Convexification and Frequency-Domain Minimization

The omnipresence of deep learning architectures such as deep convolutional neural networks (CNN)s is fueled by the synergistic combination of ever-increasing labeled datasets and specialized hardware. Despite the indisputable success, the…

Machine Learning · Statistics 2016-11-29 Meshia Cédric Oveneke , Mitchel Aliosha-Perez , Yong Zhao , Dongmei Jiang , Hichem Sahli