English

Optical Flow and Mode Selection for Learning-based Video Coding

Image and Video Processing 2020-08-07 v1 Computer Vision and Pattern Recognition Neural and Evolutionary Computing

Abstract

This paper introduces a new method for inter-frame coding based on two complementary autoencoders: MOFNet and CodecNet. MOFNet aims at computing and conveying the Optical Flow and a pixel-wise coding Mode selection. The optical flow is used to perform a prediction of the frame to code. The coding mode selection enables competition between direct copy of the prediction or transmission through CodecNet. The proposed coding scheme is assessed under the Challenge on Learned Image Compression 2020 (CLIC20) P-frame coding conditions, where it is shown to perform on par with the state-of-the-art video codec ITU/MPEG HEVC. Moreover, the possibility of copying the prediction enables to learn the optical flow in an end-to-end fashion i.e. without relying on pre-training and/or a dedicated loss term.

Keywords

Cite

@article{arxiv.2008.02580,
  title  = {Optical Flow and Mode Selection for Learning-based Video Coding},
  author = {Théo Ladune and Pierrick Philippe and Wassim Hamidouche and Lu Zhang and Olivier Déforges},
  journal= {arXiv preprint arXiv:2008.02580},
  year   = {2020}
}

Comments

MMSP 2020, IEEE 22nd International Workshop on Multimedia Signal Processing, Sep 2020, Tampere, Finland

R2 v1 2026-06-23T17:40:45.221Z