English

Codes Correcting a Single Long Duplication Error

Information Theory 2023-04-26 v1 math.IT

Abstract

We consider the problem of constructing a code capable of correcting a single long tandem duplication error of variable length. As the main contribution of this paper, we present a qq-ary efficiently encodable code of length n+1n+1 and redundancy 11 that can correct a single duplication of length at least K=4logqn+1K=4\cdot\lceil \log_q n\rceil +1. The complexity of encoding is O(n2logn)O(\frac{n^2}{\log n}) and the complexity of decoding is O(n)O(n). We also present a qq-ary non-efficient code of length n+1n+1 correcting single long duplication of length at least K=logqn+ϕ(n)K = \lceil \log_q n\rceil +\phi(n), where ϕ(n)\phi(n)\rightarrow{\infty} as nn\rightarrow{\infty}. This code has redundancy less than 11 for sufficiently large nn. Moreover, we show that in the class of codes correcting a single long duplication with redundancy 11, the value KK in our constructions is order-optimal.

Keywords

Cite

@article{arxiv.2304.12399,
  title  = {Codes Correcting a Single Long Duplication Error},
  author = {Daniil Goshkoder and Nikita Polyanskii and Ilya Vorobyev},
  journal= {arXiv preprint arXiv:2304.12399},
  year   = {2023}
}