English

Hybrid Model-based / Data-driven Graph Transform for Image Coding

Image and Video Processing 2022-03-03 v1 Machine Learning Signal Processing

Abstract

Transform coding to sparsify signal representations remains crucial in an image compression pipeline. While the Karhunen-Lo\`{e}ve transform (KLT) computed from an empirical covariance matrix Cˉ\bar{C} is theoretically optimal for a stationary process, in practice, collecting sufficient statistics from a non-stationary image to reliably estimate Cˉ\bar{C} can be difficult. In this paper, to encode an intra-prediction residual block, we pursue a hybrid model-based / data-driven approach: the first KK eigenvectors of a transform matrix are derived from a statistical model, e.g., the asymmetric discrete sine transform (ADST), for stability, while the remaining NKN-K are computed from Cˉ\bar{C} for performance. The transform computation is posed as a graph learning problem, where we seek a graph Laplacian matrix minimizing a graphical lasso objective inside a convex cone sharing the first KK eigenvectors in a Hilbert space of real symmetric matrices. We efficiently solve the problem via augmented Lagrangian relaxation and proximal gradient (PG). Using WebP as a baseline image codec, experimental results show that our hybrid graph transform achieved better energy compaction than default discrete cosine transform (DCT) and better stability than KLT.

Keywords

Cite

@article{arxiv.2203.01186,
  title  = {Hybrid Model-based / Data-driven Graph Transform for Image Coding},
  author = {Saghar Bagheri and Tam Thuc Do and Gene Cheung and Antonio Ortega},
  journal= {arXiv preprint arXiv:2203.01186},
  year   = {2022}
}