English

Variable Rate Learned Wavelet Video Coding using Temporal Layer Adaptivity

Image and Video Processing 2025-06-11 v3

Abstract

Learned wavelet video coders provide an explainable framework by performing discrete wavelet transforms in temporal, horizontal, and vertical dimensions. With a temporal transform based on motion-compensated temporal filtering (MCTF), spatial and temporal scalability is obtained. In this paper, we introduce variable rate support and a mechanism for quality adaption to different temporal layers for a higher coding efficiency. Moreover, we propose a multi-stage training strategy that allows training with multiple temporal layers. Our experiments demonstrate Bj{\o}ntegaard Delta bitrate savings of at least -32% compared to a learned MCTF model without these extensions. Training and inference code is available at: https://github.com/FAU-LMS/Learned-pMCTF.

Keywords

Cite

@article{arxiv.2410.15873,
  title  = {Variable Rate Learned Wavelet Video Coding using Temporal Layer Adaptivity},
  author = {Anna Meyer and André Kaup},
  journal= {arXiv preprint arXiv:2410.15873},
  year   = {2025}
}

Comments

6 pages, 5 figures, ICIP2025