MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression

Wei Jiang; Jiayu Yang; Yongqi Zhai; Feng Gao; Ronggang Wang

doi:10.1145/3719011

MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression

Image and Video Processing 2025-10-29 v11 Computer Vision and Pattern Recognition

Authors: Wei Jiang , Jiayu Yang , Yongqi Zhai , Feng Gao , Ronggang Wang

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

The latent representation in learned image compression encompasses channel-wise, local spatial, and global spatial correlations, which are essential for the entropy model to capture for conditional entropy minimization. Efficiently capturing these contexts within a single entropy model, especially in high-resolution image coding, presents a challenge due to the computational complexity of existing global context modules. To address this challenge, we propose the Linear Complexity Multi-Reference Entropy Model (MEM $^{++}$ ). Specifically, the latent representation is partitioned into multiple slices. For channel-wise contexts, previously compressed slices serve as the context for compressing a particular slice. For local contexts, we introduce a shifted-window-based checkerboard attention module. This module ensures linear complexity without sacrificing performance. For global contexts, we propose a linear complexity attention mechanism. It captures global correlations by decomposing the softmax operation, enabling the implicit computation of attention maps from previously decoded slices. Using MEM $^{++}$ as the entropy model, we develop the image compression method MLIC $^{++}$ . Extensive experimental results demonstrate that MLIC $^{++}$ achieves state-of-the-art performance, reducing BD-rate by $13.39\%$ on the Kodak dataset compared to VTM-17.0 in Peak Signal-to-Noise Ratio (PSNR). Furthermore, MLIC $^{++}$ exhibits linear computational complexity and memory consumption with resolution, making it highly suitable for high-resolution image coding. Code and pre-trained models are available at https://github.com/JiangWeibeta/MLIC. Training dataset is available at https://huggingface.co/datasets/Whiteboat/MLIC-Train-100K.

Keywords

learned image compression medical and hyperspectral imaging deep learning for detection and classification

Cite

@article{arxiv.2307.15421,
  title  = {MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression},
  author = {Wei Jiang and Jiayu Yang and Yongqi Zhai and Feng Gao and Ronggang Wang},
  journal= {arXiv preprint arXiv:2307.15421},
  year   = {2025}
}

Comments

Accepted to ICML 2023 Neural Compression Workshop and ACM Transactions on Multimedia Computing, Communications, and Applications 2025

MLIC++: Linear Complexity Multi-Reference Entropy Modeling for Learned Image Compression

Abstract

Keywords

Cite

Comments

Related papers