English

Asymmetric Encoding-Decoding Schemes for Lossless Data Compression

Information Theory 2026-01-26 v1 math.IT

Abstract

This paper proposes a new lossless data compression coding scheme named an asymmetric encoding-decoding scheme (AEDS), which can be considered as a generalization of tANS (tabled variant of asymmetric numeral systems). In the AEDS, a data sequence s=s1s2sn\mathbf{s}=s_1s_2\cdots s_n is encoded in backward order st,t=n,,2,1s_t, t=n, \cdots, 2,1, while s\mathbf{s} is decoded in forward order st,t=1,2,,ns_t, t=1, 2, \cdots, n in the same way as the tANS. But, the code class of the AEDS is much broader than that of the tANS. We show for i.i.d.~sources that an AEDS with 2 states (resp.~5 states) can attain a shorter average code length than the Huffman code if a child of the root in the Huffman code tree has a probability weight larger than 0.61803 (resp.~0.56984). Furthermore, we derive several upper bounds on the average code length of the AEDS, which also hold for the tANS, and we show that the average code length of the optimal AEDS and tANS with NN states converges to the source entropy with speed O(1/N)O(1/N) as NN increases.

Keywords

Cite

@article{arxiv.2601.10991,
  title  = {Asymmetric Encoding-Decoding Schemes for Lossless Data Compression},
  author = {Hirosuke Yamamoto and Ken-ichi Iwata},
  journal= {arXiv preprint arXiv:2601.10991},
  year   = {2026}
}

Comments

24 pages, 19 figures, Submitted to the IEEE Transactions on Information Theory