Hybrid Model-based / Data-driven Graph Transform for Image Coding
Abstract
Transform coding to sparsify signal representations remains crucial in an image compression pipeline. While the Karhunen-Lo\`{e}ve transform (KLT) computed from an empirical covariance matrix is theoretically optimal for a stationary process, in practice, collecting sufficient statistics from a non-stationary image to reliably estimate can be difficult. In this paper, to encode an intra-prediction residual block, we pursue a hybrid model-based / data-driven approach: the first eigenvectors of a transform matrix are derived from a statistical model, e.g., the asymmetric discrete sine transform (ADST), for stability, while the remaining are computed from for performance. The transform computation is posed as a graph learning problem, where we seek a graph Laplacian matrix minimizing a graphical lasso objective inside a convex cone sharing the first eigenvectors in a Hilbert space of real symmetric matrices. We efficiently solve the problem via augmented Lagrangian relaxation and proximal gradient (PG). Using WebP as a baseline image codec, experimental results show that our hybrid graph transform achieved better energy compaction than default discrete cosine transform (DCT) and better stability than KLT.
Cite
@article{arxiv.2203.01186,
title = {Hybrid Model-based / Data-driven Graph Transform for Image Coding},
author = {Saghar Bagheri and Tam Thuc Do and Gene Cheung and Antonio Ortega},
journal= {arXiv preprint arXiv:2203.01186},
year = {2022}
}