English

Efficient Low-Redundancy Codes for Correcting Multiple Deletions

Information Theory 2019-05-21 v2 Discrete Mathematics Data Structures and Algorithms math.IT

Abstract

We consider the problem of constructing binary codes to recover from kk-bit deletions with efficient encoding/decoding, for a fixed kk. The single deletion case is well understood, with the Varshamov-Tenengolts-Levenshtein code from 1965 giving an asymptotically optimal construction with 2n/n\approx 2^n/n codewords of length nn, i.e., at most logn\log n bits of redundancy. However, even for the case of two deletions, there was no known explicit construction with redundancy less than nΩ(1)n^{\Omega(1)}. For any fixed kk, we construct a binary code with cklognc_k \log n redundancy that can be decoded from kk deletions in Ok(nlog4n)O_k(n \log^4 n) time. The coefficient ckc_k can be taken to be O(k2logk)O(k^2 \log k), which is only quadratically worse than the optimal, non-constructive bound of O(k)O(k). We also indicate how to modify this code to allow for a combination of up to kk insertions and deletions.

Keywords

Cite

@article{arxiv.1507.06175,
  title  = {Efficient Low-Redundancy Codes for Correcting Multiple Deletions},
  author = {Joshua Brakensiek and Venkatesan Guruswami and Samuel Zbarsky},
  journal= {arXiv preprint arXiv:1507.06175},
  year   = {2019}
}

Comments

The published version of this paper claimed in an appendix a rate limitation of linear deletion codes. This claim is false and has been retracted in this version

R2 v1 2026-06-22T10:16:26.968Z