English
Related papers

Related papers: Feedback Recurrent Autoencoder for Video Compressi…

200 papers

In this paper we present a a deep generative model for lossy video compression. We employ a model that consists of a 3D autoencoder with a discrete latent space and an autoregressive prior used for entropy coding. Both autoencoder and prior…

Image and Video Processing · Electrical Eng. & Systems 2020-05-11 Amirhossein Habibian , Ties van Rozendaal , Jakub M. Tomczak , Taco S. Cohen

The past few years have witnessed increasing interests in applying deep learning to video compression. However, the existing approaches compress a video frame with only a few number of reference frames, which limits their ability to fully…

Image and Video Processing · Electrical Eng. & Systems 2021-03-18 Ren Yang , Fabian Mentzer , Luc Van Gool , Radu Timofte

Autoencoder-based structures have dominated recent learned image compression methods. However, the inherent information loss associated with autoencoders limits their rate-distortion performance at high bit rates and restricts their…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Hanyue Tu , Siqi Wu , Li Li , Wengang Zhou , Houqiang Li

In recent years, neural network-based image compression techniques have been able to outperform traditional codecs and have opened the gates for the development of learning-based video codecs. However, to take advantage of the high temporal…

Image and Video Processing · Electrical Eng. & Systems 2020-08-25 Aishwarya Jadhav

We propose a new approach to the problem of optimizing autoencoders for lossy image compression. New media formats, changing hardware technology, as well as diverse requirements and content types create a need for compression algorithms…

Machine Learning · Statistics 2017-03-02 Lucas Theis , Wenzhe Shi , Andrew Cunningham , Ferenc Huszár

Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional…

Image and Video Processing · Electrical Eng. & Systems 2019-04-09 Guo Lu , Wanli Ouyang , Dong Xu , Xiaoyun Zhang , Chunlei Cai , Zhiyong Gao

While recent machine learning research has revealed connections between deep generative models such as VAEs and rate-distortion losses used in learned compression, most of this work has focused on images. In a similar spirit, we view…

Image and Video Processing · Electrical Eng. & Systems 2024-10-28 Ruihan Yang , Yibo Yang , Joseph Marino , Stephan Mandt

The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Jun Han , Salvator Lombardo , Christopher Schroers , Stephan Mandt

A large fraction of Internet traffic is now driven by requests from mobile devices with relatively small screens and often stringent bandwidth requirements. Due to these factors, it has become the norm for modern graphics-heavy websites to…

Computer Vision and Pattern Recognition · Computer Science 2016-03-03 George Toderici , Sean M. O'Malley , Sung Jin Hwang , Damien Vincent , David Minnen , Shumeet Baluja , Michele Covell , Rahul Sukthankar

In this paper, we propose a novel variable-rate learned image compression framework with a conditional autoencoder. Previous learning-based image compression methods mostly require training separate networks for different compression rates…

Image and Video Processing · Electrical Eng. & Systems 2019-09-12 Yoojin Choi , Mostafa El-Khamy , Jungwon Lee

In this work we propose a novel deep learning approach for ultra-low bitrate video compression for video conferencing applications. To address the shortcomings of current video compression paradigms when the available bandwidth is extremely…

Computer Vision and Pattern Recognition · Computer Science 2020-12-02 Goluck Konuko , Giuseppe Valenzise , Stéphane Lathuilière

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk

We present Recurrent Video Masked-Autoencoders (RVM): a novel approach to video representation learning that leverages recurrent computation to model the temporal structure of video data. RVM couples an asymmetric masking objective with a…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Daniel Zoran , Nikhil Parthasarathy , Yi Yang , Drew A Hudson , Joao Carreira , Andrew Zisserman

With exponential growth in the use of digital image data, the need for efficient transmission methods has become imperative. Traditional image compression techniques often sacrifice image fidelity for reduced file sizes, challenging…

Image and Video Processing · Electrical Eng. & Systems 2024-10-15 Aryan Kashyap Naveen , Sunil Thunga , Anuhya Murki , Mahati A Kalale , Shriya Anil

Video variational autoencoders (VAEs) used in latent diffusion models typically require a sufficiently large number of latent channels to ensure high-quality video reconstruction. However, recent studies have revealed that an excessive…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Jiarui Guan , Wenshuai Zhao , Zhengtao Zou , Juho Kannala , Arno Solin

One of the core components of conventional (i.e., non-learned) video codecs consists of predicting a frame from a previously-decoded frame, by leveraging temporal correlations. In this paper, we propose an end-to-end learned system for…

Image and Video Processing · Electrical Eng. & Systems 2020-04-22 Nannan Zou , Honglei Zhang , Francesco Cricri , Hamed R. Tavakoli , Jani Lainema , Emre Aksu , Miska Hannuksela , Esa Rahtu

Recent advancements in information technology and the widespread use of the Internet have led to easier access to data worldwide. As a result, transmitting data through noisy channels is inevitable. Reducing the size of data and protecting…

We present a new algorithm for video coding, learned end-to-end for the low-latency mode. In this setting, our approach outperforms all existing video codecs across nearly the entire bitrate range. To our knowledge, this is the first…

Image and Video Processing · Electrical Eng. & Systems 2018-11-20 Oren Rippel , Sanjay Nair , Carissa Lew , Steve Branson , Alexander G. Anderson , Lubomir Bourdev

Video compression is a fundamental topic in the visual intelligence, bridging visual signal sensing/capturing and high-level visual analytics. The broad success of artificial intelligence (AI) technology has enriched the horizon of video…

Image and Video Processing · Electrical Eng. & Systems 2025-05-01 Chuanmin Jia , Feng Ye , Siwei Ma , Wen Gao , Huifang Sun , Leonardo Chiariglione

In recent years, the field of learned video compression has witnessed rapid advancement, exemplified by the latest neural video codecs DCVC-DC that has outperformed the upcoming next-generation codec ECM in terms of compression ratio.…

Image and Video Processing · Electrical Eng. & Systems 2024-07-24 Zidian Qiu , Zongyao He , Zhi Jin
‹ Prev 1 2 3 10 Next ›