English

A DNA Sequence Compression Algorithm Based on LUT and LZ77

Information Theory 2007-07-16 v6 math.IT

Abstract

This article introduces a new DNA sequence compression algorithm which is based on LUT and LZ77 algorithm. Combined a LUT-based pre-coding routine and LZ77 compression routine,this algorithm can approach a compression ratio of 1.9bits \slash base and even lower.The biggest advantage of this algorithm is fast execution, small memory occupation and easy implementation.

Cite

@article{arxiv.cs/0504100,
  title  = {A DNA Sequence Compression Algorithm Based on LUT and LZ77},
  author = {Sheng Bao and Shi Chen and Zhiqiang Jing and Ran Ren},
  journal= {arXiv preprint arXiv:cs/0504100},
  year   = {2007}
}

Comments

13 pages,3 tables,with source code of our algorithm