Related papers: Variable Rate Deep Image Compression with Modulate…

Variable Rate Deep Image Compression With a Conditional Autoencoder

In this paper, we propose a novel variable-rate learned image compression framework with a conditional autoencoder. Previous learning-based image compression methods mostly require training separate networks for different compression rates…

Image and Video Processing · Electrical Eng. & Systems 2019-09-12 Yoojin Choi , Mostafa El-Khamy , Jungwon Lee

An Improved Upper Bound on the Rate-Distortion Function of Images

Recent work has shown that Variational Autoencoders (VAEs) can be used to upper-bound the information rate-distortion (R-D) function of images, i.e., the fundamental limit of lossy image compression. In this paper, we report an improved…

Image and Video Processing · Electrical Eng. & Systems 2023-09-07 Zhihao Duan , Jack Ma , Jiangpeng He , Fengqing Zhu

Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve

Variational autoencoders (VAEs) are powerful tools for learning latent representations of data used in a wide range of applications. In practice, VAEs usually require multiple training rounds to choose the amount of information the latent…

Machine Learning · Computer Science 2023-08-21 Juhan Bae , Michael R. Zhang , Michael Ruan , Eric Wang , So Hasegawa , Jimmy Ba , Roger Grosse

Slimmable Compressive Autoencoders for Practical Neural Image Compression

Neural image compression leverages deep neural networks to outperform traditional image codecs in rate-distortion performance. However, the resulting models are also heavy, computationally demanding and generally optimized for a single…

Image and Video Processing · Electrical Eng. & Systems 2022-05-03 Fei Yang , Luis Herranz , Yongmei Cheng , Mikhail G. Mozerov

Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression

Autoencoder-based structures have dominated recent learned image compression methods. However, the inherent information loss associated with autoencoders limits their rate-distortion performance at high bit rates and restricts their…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Hanyue Tu , Siqi Wu , Li Li , Wengang Zhou , Houqiang Li

MTC-VAE: Multi-Level Temporal Compression with Content Awareness

Latent Video Diffusion Models (LVDMs) rely on Variational Autoencoders (VAEs) to compress videos into compact latent representations. For continuous Variational Autoencoders (VAEs), achieving higher compression rates is desirable; yet, the…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Yubo Dong , Linchao Zhu

Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network

Neural Video Compression (NVC) has achieved remarkable performance in recent years. However, precise rate control remains a challenge due to the inherent limitations of learning-based codecs. To solve this issue, we propose a dynamic video…

Computer Vision and Pattern Recognition · Computer Science 2025-08-29 Chenhao Zhang , Wei Gao

Lossy Image Compression with Quantized Hierarchical VAEs

Recent research has shown a strong theoretical connection between variational autoencoders (VAEs) and the rate-distortion theory. Motivated by this, we consider the problem of lossy image compression from the perspective of generative…

Image and Video Processing · Electrical Eng. & Systems 2023-03-28 Zhihao Duan , Ming Lu , Zhan Ma , Fengqing Zhu

Flexible Variable-Rate Image Feature Compression for Edge-Cloud Systems

Feature compression is a promising direction for coding for machines. Existing methods have made substantial progress, but they require designing and training separate neural network models to meet different specifications of compression…

Image and Video Processing · Electrical Eng. & Systems 2024-04-02 Md Adnan Faisal Hossain , Zhihao Duan , Yuning Huang , Fengqing Zhu

High-Fidelity Variable-Rate Image Compression via Invertible Activation Transformation

Learning-based methods have effectively promoted the community of image compression. Meanwhile, variational autoencoder (VAE) based variable-rate approaches have recently gained much attention to avoid the usage of a set of different…

Image and Video Processing · Electrical Eng. & Systems 2022-09-13 Shilv Cai , Zhijun Zhang , Liqun Chen , Luxin Yan , Sheng Zhong , Xu Zou

Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets

Achieving successful variable bitrate compression with computationally simple algorithms from a single end-to-end learned image or video compression model remains a challenge. Many approaches have been proposed, including conditional…

Image and Video Processing · Electrical Eng. & Systems 2024-03-01 Fatih Kamisli , Fabien Racape , Hyomin Choi

Latent-Compressed Variational Autoencoder for Video Diffusion Models

Video variational autoencoders (VAEs) used in latent diffusion models typically require a sufficiently large number of latent channels to ensure high-quality video reconstruction. However, recent studies have revealed that an excessive…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Jiarui Guan , Wenshuai Zhao , Zhengtao Zou , Juho Kannala , Arno Solin

Feedback Recurrent Autoencoder for Video Compression

Recent advances in deep generative modeling have enabled efficient modeling of high dimensional data distributions and opened up a new horizon for solving data compression problems. Specifically, autoencoder based learned image or video…

Machine Learning · Computer Science 2020-04-10 Adam Golinski , Reza Pourreza , Yang Yang , Guillaume Sautiere , Taco S Cohen

Transformer-based Variable-rate Image Compression with Region-of-interest Control

This paper proposes a transformer-based learned image compression system. It is capable of achieving variable-rate compression with a single model while supporting the region-of-interest (ROI) functionality. Inspired by prompt tuning, we…

Image and Video Processing · Electrical Eng. & Systems 2023-08-02 Chia-Hao Kao , Ying-Chieh Weng , Yi-Hsin Chen , Wei-Chen Chiu , Wen-Hsiao Peng

Large Motion Video Autoencoding with Cross-modal Video VAE

Learning a robust video Variational Autoencoder (VAE) is essential for reducing video redundancy and facilitating efficient video generation. Directly applying image VAEs to individual frames in isolation can result in temporal…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Yazhou Xing , Yang Fei , Yingqing He , Jingye Chen , Jiaxin Xie , Xiaowei Chi , Qifeng Chen

Variational Mutual Information Maximization Framework for VAE Latent Codes with Continuous and Discrete Priors

Learning interpretable and disentangled representations of data is a key topic in machine learning research. Variational Autoencoder (VAE) is a scalable method for learning directed latent variable models of complex data. It employs a clear…

Machine Learning · Computer Science 2020-06-04 Andriy Serdega , Dae-Shik Kim

Adaptive Compression of the Latent Space in Variational Autoencoders

Variational Autoencoders (VAEs) are powerful generative models that have been widely used in various fields, including image and text generation. However, one of the known challenges in using VAEs is the model's sensitivity to its…

Machine Learning · Computer Science 2024-12-31 Gabriela Sejnova , Michal Vavrecka , Karla Stepanova

Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation

With the development of deep learning techniques, the combination of deep learning with image compression has drawn lots of attention. Recently, learned image compression methods had exceeded their classical counterparts in terms of…

Image and Video Processing · Electrical Eng. & Systems 2022-08-03 Ze Cui , Jing Wang , Shangyin Gao , Bo Bai , Tiansheng Guo , Yihui Feng

Deep Compressive Autoencoder for Action Potential Compression in Large-Scale Neural Recording

Understanding the coordinated activity underlying brain computations requires large-scale, simultaneous recordings from distributed neuronal structures at a cellular-level resolution. One major hurdle to design high-bandwidth,…

Neural and Evolutionary Computing · Computer Science 2018-09-18 Tong Wu , Wenfeng Zhao , Edward Keefer , Zhi Yang

Modular Autoencoders for Ensemble Feature Extraction

We introduce the concept of a Modular Autoencoder (MAE), capable of learning a set of diverse but complementary representations from unlabelled data, that can later be used for supervised tasks. The learning of the representations is…

Machine Learning · Computer Science 2015-11-24 Henry W J Reeve , Gavin Brown