Related papers: Split Hierarchical Variational Compression

HyVIC: A Metric-Driven Spatio-Spectral Hyperspectral Image Compression Architecture Based on Variational Autoencoders

The rapid growth of hyperspectral data archives in remote sensing (RS) necessitates effective compression methods for storage and transmission. Recent advances in learning-based hyperspectral image (HSI) compression have significantly…

Computer Vision and Pattern Recognition · Computer Science 2026-03-30 Martin Hermann Paul Fuchs , Behnood Rasti , Begüm Demir

Enhancing variational generation through self-decomposition

In this article we introduce the notion of Split Variational Autoencoder (SVAE), whose output $\hat{x}$ is obtained as a weighted sum $\sigma \odot \hat{x_1} + (1-\sigma) \odot \hat{x_2}$ of two generated images $\hat{x_1},\hat{x_2}$, and…

Computer Vision and Pattern Recognition · Computer Science 2022-07-15 Andrea Asperti , Laura Bugo , Daniele Filippini

SC-VAE: Sparse Coding-based Variational Autoencoder with Learned ISTA

Learning rich data representations from unlabeled data is a key challenge towards applying deep learning algorithms in downstream tasks. Several variants of variational autoencoders (VAEs) have been proposed to learn compact data…

Computer Vision and Pattern Recognition · Computer Science 2024-01-11 Pan Xiao , Peijie Qiu , Sungmin Ha , Abdalla Bani , Shuang Zhou , Aristeidis Sotiras

Deep Hierarchical Video Compression

Recently, probabilistic predictive coding that directly models the conditional distribution of latent features across successive frames for temporal redundancy removal has yielded promising results. Existing methods using a single-scale…

Image and Video Processing · Electrical Eng. & Systems 2023-12-13 Ming Lu , Zhihao Duan , Fengqing Zhu , Zhan Ma

High-Efficiency Neural Video Compression via Hierarchical Predictive Learning

The enhanced Deep Hierarchical Video Compression-DHVC 2.0-has been introduced. This single-model neural video codec operates across a broad range of bitrates, delivering not only superior compression performance to representative methods…

Image and Video Processing · Electrical Eng. & Systems 2024-10-04 Ming Lu , Zhihao Duan , Wuyang Cong , Dandan Ding , Fengqing Zhu , Zhan Ma

Diverse super-resolution with pretrained deep hiererarchical VAEs

We investigate the problem of producing diverse solutions to an image super-resolution problem. From a probabilistic perspective, this can be done by sampling from the posterior distribution of an inverse problem, which requires the…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Jean Prost , Antoine Houdard , Andrés Almansa , Nicolas Papadakis

Convolutional variational autoencoders for secure lossy image compression in remote sensing

The volume of remote sensing data is experiencing rapid growth, primarily due to the plethora of space and air platforms equipped with an array of sensors. Due to limited hardware and battery constraints the data is transmitted back to…

Image and Video Processing · Electrical Eng. & Systems 2024-04-18 Alessandro Giuliano , S. Andrew Gadsden , Waleed Hilal , John Yawney

Learned Image Transmission with Hierarchical Variational Autoencoder

In this paper, we introduce an innovative hierarchical joint source-channel coding (HJSCC) framework for image transmission, utilizing a hierarchical variational autoencoder (VAE). Our approach leverages a combination of bottom-up and…

Image and Video Processing · Electrical Eng. & Systems 2025-03-18 Guangyi Zhang , Hanlei Li , Yunlong Cai , Qiyu Hu , Guanding Yu , Runmin Zhang

NVAE: A Deep Hierarchical Variational Autoencoder

Normalizing flows, autoregressive models, variational autoencoders (VAEs), and deep energy-based models are among competing likelihood-based frameworks for deep generative learning. Among them, VAEs have the advantage of fast and tractable…

Machine Learning · Statistics 2021-01-11 Arash Vahdat , Jan Kautz

Multispectral Image Compression Based on HEVC Using Pel-Recursive Inter-Band Prediction

Recent developments in optical sensors enable a wide range of applications for multispectral imaging, e.g., in surveillance, optical sorting, and life-science instrumentation. Increasing spatial and spectral resolution allows creating…

Image and Video Processing · Electrical Eng. & Systems 2023-03-10 Anna Meyer , Nils Genser , André Kaup

Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes

Deep learning has gained significant attention in medical image segmentation. However, the limited availability of annotated training data presents a challenge to achieving accurate results. In efforts to overcome this challenge, data…

Image and Video Processing · Electrical Eng. & Systems 2024-08-16 Aghiles Kebaili , Jérôme Lapuyade-Lahorgue , Pierre Vera , Su Ruan

Self-Supervised Variational Auto-Encoders

Density estimation, compression and data generation are crucial tasks in artificial intelligence. Variational Auto-Encoders (VAEs) constitute a single framework to achieve these goals. Here, we present a novel class of generative models,…

Machine Learning · Statistics 2021-07-07 Ioannis Gatopoulos , Jakub M. Tomczak

Lossy Image Compression with Quantized Hierarchical VAEs

Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory. Motivated by this, we consider the problem of lossy image compression from the perspective of generative…

Image and Video Processing · Electrical Eng. & Systems 2023-03-28 Zhihao Duan , Ming Lu , Zhan Ma , Fengqing Zhu

PH-VAE: A Polynomial Hierarchical Variational Autoencoder Towards Disentangled Representation Learning

The variational autoencoder (VAE) is a simple and efficient generative artificial intelligence method for modeling complex probability distributions of various types of data, such as images and texts. However, it suffers some main…

Machine Learning · Computer Science 2025-02-14 Xi Chen , Shaofan Li

Hi-VAE: Efficient Video Autoencoding with Global and Detailed Motion

Recent breakthroughs in video autoencoders (Video AEs) have advanced video generation, but existing methods fail to efficiently model spatio-temporal redundancies in dynamics, resulting in suboptimal compression factors. This shortfall…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Huaize Liu , Wenzhang Sun , Qiyuan Zhang , Donglin Di , Biao Gong , Hao Li , Chen Wei , Changqing Zou

Improving Inference for Neural Image Compression

We consider the problem of lossy image compression with deep latent variable models. State-of-the-art methods build on hierarchical variational autoencoders (VAEs) and learn inference networks to predict a compressible latent representation…

Image and Video Processing · Electrical Eng. & Systems 2021-01-11 Yibo Yang , Robert Bamler , Stephan Mandt

FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion

Sensor fusion can significantly improve the performance of many computer vision tasks. However, traditional fusion approaches are either not data-driven and cannot exploit prior knowledge nor find regularities in a given dataset or they are…

Computer Vision and Pattern Recognition · Computer Science 2022-09-26 Fabian Duffhauss , Ngo Anh Vien , Hanna Ziesche , Gerhard Neumann

End-to-end autoencoding architecture for the simultaneous generation of medical images and corresponding segmentation masks

Despite the increasing use of deep learning in medical image segmentation, acquiring sufficient training data remains a challenge in the medical field. In response, data augmentation techniques have been proposed; however, the generation of…

Image and Video Processing · Electrical Eng. & Systems 2024-06-11 Aghiles Kebaili , Jérôme Lapuyade-Lahorgue , Pierre Vera , Su Ruan

Improved Video VAE for Latent Video Diffusion Model

Variational Autoencoder (VAE) aims to compress pixel data into low-dimensional latent space, playing an important role in OpenAI's Sora and other latent video diffusion generation models. While most of existing video VAEs inflate a…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Pingyu Wu , Kai Zhu , Yu Liu , Liming Zhao , Wei Zhai , Yang Cao , Zheng-Jun Zha

Quantum Down Sampling Filter for Variational Auto-encoder

Variational autoencoders (VAEs) are fundamental for generative modeling and image reconstruction, yet their performance often struggles to maintain high fidelity in reconstructions. This study introduces a hybrid model, quantum variational…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Farina Riaz , Fakhar Zaman , Hajime Suzuki , Sharif Abuadbba , David Nguyen