English

Coding for Sequence Reconstruction for Single Edits

Information Theory 2020-06-16 v2 math.IT

Abstract

The sequence reconstruction problem, introduced by Levenshtein in 2001, considers a communication scenario where the sender transmits a codeword from some codebook and the receiver obtains multiple noisy reads of the codeword. The common setup assumes the codebook to be the entire space and the problem is to determine the minimum number of distinct reads that is required to reconstruct the transmitted codeword. Motivated by modern storage devices, we study a variant of the problem where the number of noisy reads NN is fixed. Specifically, we design reconstruction codes that reconstruct a codeword from NN distinct noisy reads. We focus on channels that introduce single edit error (i.e. a single substitution, insertion, or deletion) and their variants, and design reconstruction codes for all values of NN. In particular, for the case of a single edit, we show that as the number of noisy reads increases, the number of redundant bits required can be gracefully reduced from logn+O(1)\log n+O(1) to loglogn+O(1)\log \log n+O(1), and then to O(1)O(1), where nn denotes the length of a codeword. We also show that the redundancy of certain reconstruction codes is within one bit of optimality.

Keywords

Cite

@article{arxiv.2001.01376,
  title  = {Coding for Sequence Reconstruction for Single Edits},
  author = {Kui Cai and Han Mao Kiah and Tuan Thanh Nguyen and Eitan Yaakobi},
  journal= {arXiv preprint arXiv:2001.01376},
  year   = {2020}
}
R2 v1 2026-06-23T13:03:28.474Z