English

On the Palindromic/Reverse-Complement Duplication Correcting Codes

Information Theory 2026-02-03 v1 math.IT

Abstract

Motivated by applications in in-vivo DNA storage, we study codes for correcting duplications. A reverse-complement duplication of length kk is the insertion of the reversed and complemented copy of a substring of length kk adjacent to its original position, while a palindromic duplication only inserts the reversed copy without complementation. We first construct an explicit code with a single redundant symbol capable of correcting an arbitrary number of reverse-complement duplications (respectively, palindromic duplications), provided that all duplications have length k3logqnk \ge 3\lceil \log_q n \rceil and are disjoint. Next, we derive a Gilbert-Varshamov bound for codes that can correct a reverse-complement duplication (respectively, palindromic duplication) of arbitrary length, showing that the optimal redundancy is upper bounded by 2logqn+logqlogqn+O(1)2\log_q n + \log_q\log_q n + O(1). Finally, for q4q \ge 4, we present two explicit constructions of codes that can correct tt length-one reverse-complement duplications. The first construction achieves a redundancy of 2tlogqn+O(logqlogqn)2t\log_q n + O(\log_q\log_q n) with encoding complexity O(n)O(n) and decoding complexity O(n(log2n)4)O\big(n(\log_2 n)^4\big). The second construction achieves an improved redundancy of (2t1)logqn+O(logqlogqn)(2t-1)\log_q n + O(\log_q\log_q n), but with encoding and decoding complexities of O(npoly(log2n))O\big(n \cdot \mathrm{poly}(\log_2 n)\big).

Keywords

Cite

@article{arxiv.2602.01151,
  title  = {On the Palindromic/Reverse-Complement Duplication Correcting Codes},
  author = {Yubo Sun and Gennian Ge},
  journal= {arXiv preprint arXiv:2602.01151},
  year   = {2026}
}
R2 v1 2026-07-01T09:30:05.557Z