Related papers: Cross-Platform Neural Video Coding: A Case Study

Towards Real-Time Neural Video Codec for Cross-Platform Application Using Calibration Information

The state-of-the-art neural video codecs have outperformed the most sophisticated traditional codecs in terms of RD performance in certain cases. However, utilizing them for practical applications is still challenging for two major reasons.…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Kuan Tian , Yonghang Guan , Jinxi Xiang , Jun Zhang , Xiao Han , Wei Yang

Improved Encoding for Overfitted Video Codecs

Overfitted neural video codecs offer a decoding complexity orders of magnitude smaller than their autoencoder counterparts. Yet, this low complexity comes at the cost of limited compression efficiency, in part due to their difficulty…

Image and Video Processing · Electrical Eng. & Systems 2025-03-27 Thomas Leguay , Théo Ladune , Pierrick Philippe , Olivier Deforges

Deep Learning-Based Real-Time Quality Control of Standard Video Compression for Live Streaming

Ensuring high-quality video content for wireless users has become increasingly vital. Nevertheless, maintaining a consistent level of video quality faces challenges due to the fluctuating encoded bitrate, primarily caused by dynamic video…

Image and Video Processing · Electrical Eng. & Systems 2023-11-23 Matin Mortaheb , Mohammad A. Amir Khojastepour , Srimat T. Chakradhar , Sennur Ulukus

Exploiting Latent Properties to Optimize Neural Codecs

End-to-end image and video codecs are becoming increasingly competitive, compared to traditional compression techniques that have been developed through decades of manual engineering efforts. These trainable codecs have many advantages over…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Muhammet Balcilar , Bharath Bhushan Damodaran , Karam Naser , Franck Galpin , Pierre Hellier

Standard compliant video coding using low complexity, switchable neural wrappers

The proliferation of high resolution videos posts great storage and bandwidth pressure on cloud video services, driving the development of next-generation video codecs. Despite great progress made in neural video coding, existing approaches…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Yueyu Hu , Chenhao Zhang , Onur G. Guleryuz , Debargha Mukherjee , Yao Wang

Neural Video Compression with Feature Modulation

The emerging conditional coding-based neural video codec (NVC) shows superiority over commonly-used residual coding-based codec and the latest NVC already claims to outperform the best traditional codec. However, there still exist critical…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Jiahao Li , Bin Li , Yan Lu

Parallel Context Modeling for Sliding Window Attention in Neural Video Coding

Most neural video codecs rely on temporal conditioning, which makes them susceptible to error propagation over long sequences. While Transformer-based architectures like the VCT offer a drift-free alternative, they suffer from high…

Image and Video Processing · Electrical Eng. & Systems 2026-05-21 Alexander Kopte , André Kaup

Learned Scalable Video Coding For Humans and Machines

Video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital TV, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep…

Image and Video Processing · Electrical Eng. & Systems 2024-11-19 Hadi Hadizadeh , Ivan V. Bajić

Differentiable bit-rate estimation for neural-based video codec enhancement

Neural networks (NN) can improve standard video compression by pre- and post-processing the encoded video. For optimal NN training, the standard codec needs to be replaced with a codec proxy that can provide derivatives of estimated…

Image and Video Processing · Electrical Eng. & Systems 2023-01-25 Amir Said , Manish Kumar Singh , Reza Pourreza

Deep Video Codec Control for Vision Models

Standardized lossy video coding is at the core of almost all real-world video processing pipelines. Rate control is used to enable standard codecs to adapt to different network bandwidth conditions or storage constraints. However, standard…

Image and Video Processing · Electrical Eng. & Systems 2024-10-08 Christoph Reich , Biplob Debnath , Deep Patel , Tim Prangemeier , Daniel Cremers , Srimat Chakradhar

A Preprocessing Framework for Video Machine Vision under Compression

There has been a growing trend in compressing and transmitting videos from terminals for machine vision tasks. Nevertheless, most video coding optimization method focus on minimizing distortion according to human perceptual metrics,…

Multimedia · Computer Science 2025-12-18 Fei Zhao , Mengxi Guo , Shijie Zhao , Junlin Li , Li Zhang , Xiaodong Xie

Improving The Reconstruction Quality by Overfitted Decoder Bias in Neural Image Compression

End-to-end trainable models have reached the performance of traditional handcrafted compression techniques on videos and images. Since the parameters of these models are learned over large training sets, they are not optimal for any given…

Image and Video Processing · Electrical Eng. & Systems 2022-10-12 Oussama Jourairi , Muhammet Balcilar , Anne Lambert , François Schnitzler

Effortless Cross-Platform Video Codec: A Codebook-Based Method

Under certain circumstances, advanced neural video codecs can surpass the most complex traditional codecs in their rate-distortion (RD) performance. One of the main reasons for the high performance of existing neural video codecs is the use…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Kuan Tian , Yonghang Guan , Jinxi Xiang , Jun Zhang , Xiao Han , Wei Yang

Sandwiched Video Compression: Efficiently Extending the Reach of Standard Codecs with Neural Wrappers

We propose sandwiched video compression -- a video compression system that wraps neural networks around a standard video codec. The sandwich framework consists of a neural pre- and post-processor with a standard video codec between them.…

Image and Video Processing · Electrical Eng. & Systems 2023-07-07 Berivan Isik , Onur G. Guleryuz , Danhang Tang , Jonathan Taylor , Philip A. Chou

Device Interoperability for Learned Image Compression with Weights and Activations Quantization

Learning-based image compression has improved to a level where it can outperform traditional image codecs such as HEVC and VVC in terms of coding performance. In addition to good compression performance, device interoperability is essential…

Image and Video Processing · Electrical Eng. & Systems 2022-12-05 Esin Koyuncu , Timofey Solovyev , Elena Alshina , André Kaup

ELF-VC: Efficient Learned Flexible-Rate Video Coding

While learned video codecs have demonstrated great promise, they have yet to achieve sufficient efficiency for practical deployment. In this work, we propose several novel ideas for learned video compression which allow for improved…

Image and Video Processing · Electrical Eng. & Systems 2021-10-06 Oren Rippel , Alexander G. Anderson , Kedar Tatwawadi , Sanjay Nair , Craig Lytle , Lubomir Bourdev

A Coding Framework and Benchmark towards Low-Bitrate Video Understanding

Video compression is indispensable to most video analysis systems. Despite saving transportation bandwidth, it also deteriorates downstream video understanding tasks, especially at low-bitrate settings. To systematically investigate this…

Image and Video Processing · Electrical Eng. & Systems 2024-09-24 Yuan Tian , Guo Lu , Yichao Yan , Guangtao Zhai , Li Chen , Zhiyong Gao

Real-Time Neural Video Compression with Unified Intra and Inter Coding

Neural video compression (NVC) technologies have advanced rapidly in recent years, yielding state-of-the-art schemes such as DCVC-RT that offer superior compression efficiency to H.266/VVC and real-time encoding/decoding capabilities.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Hui Xiang , Yifan Bian , Li Li , Jingran Wu , Xianguo Zhang , Dong Liu

Delving Deeper into the Decoder for Video Captioning

Video captioning is an advanced multi-modal task which aims to describe a video clip using a natural language sentence. The encoder-decoder framework is the most popular paradigm for this task in recent years. However, there exist some…

Computer Vision and Pattern Recognition · Computer Science 2021-02-15 Haoran Chen , Jianmin Li , Xiaolin Hu

Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set

We introduce a video compression algorithm based on instance-adaptive learning. On each video sequence to be transmitted, we finetune a pretrained compression model. The optimal parameters are transmitted to the receiver along with the…

Image and Video Processing · Electrical Eng. & Systems 2023-06-26 Ties van Rozendaal , Johann Brehmer , Yunfan Zhang , Reza Pourreza , Auke Wiggers , Taco S. Cohen