English

Low-Complexity Vector Source Coding for Discrete Long Sequences with Unknown Distributions

Information Theory 2024-10-28 v1 math.IT

Abstract

In this paper, we propose a source coding scheme that represents data from unknown distributions through frequency and support information. Existing encoding schemes often compress data by sacrificing computational efficiency or by assuming the data follows a known distribution. We take advantage of the structure that arises within the spatial representation and utilize it to encode run-lengths within this representation using Golomb coding. Through theoretical analysis, we show that our scheme yields an overall bit rate that nears entropy without a computationally complex encoding algorithm and verify these results through numerical experiments.

Keywords

Cite

@article{arxiv.2309.05633,
  title  = {Low-Complexity Vector Source Coding for Discrete Long Sequences with Unknown Distributions},
  author = {Leah Woldemariam and Hang Liu and Anna Scaglione},
  journal= {arXiv preprint arXiv:2309.05633},
  year   = {2024}
}

Comments

This work has been submitted to the IEEE for possible publication