MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression
Abstract
The latent representation in learned image compression encompasses channel-wise, local spatial, and global spatial correlations, which are essential for the entropy model to capture for conditional entropy minimization. Efficiently capturing these contexts within a single entropy model, especially in high-resolution image coding, presents a challenge due to the computational complexity of existing global context modules. To address this challenge, we propose the Linear Complexity Multi-Reference Entropy Model (MEM). Specifically, the latent representation is partitioned into multiple slices. For channel-wise contexts, previously compressed slices serve as the context for compressing a particular slice. For local contexts, we introduce a shifted-window-based checkerboard attention module. This module ensures linear complexity without sacrificing performance. For global contexts, we propose a linear complexity attention mechanism. It captures global correlations by decomposing the softmax operation, enabling the implicit computation of attention maps from previously decoded slices. Using MEM as the entropy model, we develop the image compression method MLIC. Extensive experimental results demonstrate that MLIC achieves state-of-the-art performance, reducing BD-rate by on the Kodak dataset compared to VTM-17.0 in Peak Signal-to-Noise Ratio (PSNR). Furthermore, MLIC exhibits linear computational complexity and memory consumption with resolution, making it highly suitable for high-resolution image coding. Code and pre-trained models are available at https://github.com/JiangWeibeta/MLIC. Training dataset is available at https://huggingface.co/datasets/Whiteboat/MLIC-Train-100K.
Keywords
Cite
@article{arxiv.2307.15421,
title = {MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression},
author = {Wei Jiang and Jiayu Yang and Yongqi Zhai and Feng Gao and Ronggang Wang},
journal= {arXiv preprint arXiv:2307.15421},
year = {2025}
}
Comments
Accepted to ICML 2023 Neural Compression Workshop and ACM Transactions on Multimedia Computing, Communications, and Applications 2025