English

Efficient Integer Retrieving from Unordered Compressed Sequences

Information Theory 2023-02-14 v1 math.IT

Abstract

The variable-length Reverse Multi-Delimiter (RMD) codes are known to represent sequences of unbounded and unordered integers. When applied to data compression, they combine a good compression ratio with fast decoding. In this paper, we investigate another property of RMD-codes - the ability of direct access to codewords in the encoded bitstream. We present the method allowing us to extract and decode a codeword from an RMD-bitstream in almost constant time with the tiny space overhead, and make experiments on its application to natural language text compression.

Keywords

Cite

@article{arxiv.2302.05869,
  title  = {Efficient Integer Retrieving from Unordered Compressed Sequences},
  author = {Igor O. Zavadskyi},
  journal= {arXiv preprint arXiv:2302.05869},
  year   = {2023}
}

Comments

18 pages, 5 figures, 2 algorithms, 1 table