English

Neural Multi-scale Image Compression

Machine Learning 2018-05-17 v1 Computer Vision and Pattern Recognition Machine Learning

Abstract

This study presents a new lossy image compression method that utilizes the multi-scale features of natural images. Our model consists of two networks: multi-scale lossy autoencoder and parallel multi-scale lossless coder. The multi-scale lossy autoencoder extracts the multi-scale image features to quantized variables and the parallel multi-scale lossless coder enables rapid and accurate lossless coding of the quantized variables via encoding/decoding the variables in parallel. Our proposed model achieves comparable performance to the state-of-the-art model on Kodak and RAISE-1k dataset images, and it encodes a PNG image of size 768×512768 \times 512 in 70 ms with a single GPU and a single CPU process and decodes it into a high-fidelity image in approximately 200 ms.

Keywords

Cite

@article{arxiv.1805.06386,
  title  = {Neural Multi-scale Image Compression},
  author = {Ken Nakanishi and Shin-ichi Maeda and Takeru Miyato and Daisuke Okanohara},
  journal= {arXiv preprint arXiv:1805.06386},
  year   = {2018}
}

Comments

15 pages, 15 figures