Fast Encoding and Decoding for Implicit Video Representation

Hao Chen; Saining Xie; Ser-Nam Lim; Abhinav Shrivastava

Fast Encoding and Decoding for Implicit Video Representation

Computer Vision and Pattern Recognition 2024-10-16 v2

Authors: Hao Chen , Saining Xie , Ser-Nam Lim , Abhinav Shrivastava

Abstract

Despite the abundant availability and content richness for video data, its high-dimensionality poses challenges for video research. Recent advancements have explored the implicit representation for videos using neural networks, demonstrating strong performance in applications such as video compression and enhancement. However, the prolonged encoding time remains a persistent challenge for video Implicit Neural Representations (INRs). In this paper, we focus on improving the speed of video encoding and decoding within implicit representations. We introduce two key components: NeRV-Enc, a transformer-based hyper-network for fast encoding; and NeRV-Dec, a parallel decoder for efficient video loading. NeRV-Enc achieves an impressive speed-up of $\mathbf{10^4\times}$ by eliminating gradient-based optimization. Meanwhile, NeRV-Dec simplifies video decoding, outperforming conventional codecs with a loading speed $\mathbf{11\times}$ faster, and surpassing RAM loading with pre-decoded videos ( $\mathbf{2.5\times}$ faster while being $\mathbf{65\times}$ smaller in size).

Keywords

encoder-decoder architecture image compression video retrieval

Cite

@article{arxiv.2409.19429,
  title  = {Fast Encoding and Decoding for Implicit Video Representation},
  author = {Hao Chen and Saining Xie and Ser-Nam Lim and Abhinav Shrivastava},
  journal= {arXiv preprint arXiv:2409.19429},
  year   = {2024}
}

Comments

ECCV 2024. Project page at https://haochen-rye.github.io/FastNeRV/, code will be at https://github.com/haochen-rye/FastNeRV

Fast Encoding and Decoding for Implicit Video Representation

Abstract

Keywords

Cite

Comments

Related papers