English

Deterministic Document Exchange Protocols, and Almost Optimal Binary Codes for Edit Errors

Data Structures and Algorithms 2018-07-18 v4

Abstract

We study two basic problems regarding edit error, i.e. document exchange and error correcting codes for edit errors (insdel codes). For message length nn and edit error upper bound kk, it is known that in both problems the optimal sketch size or the optimal number of redundant bits is Θ(klognk)\Theta(k \log \frac{n}{k}). However, known constructions are far from achieving these bounds. We significantly improve previous results on both problems. For document exchange, we give an efficient deterministic protocol with sketch size O(klog2nk)O(k\log^2 \frac{n}{k}). This significantly improves the previous best known deterministic protocol, which has sketch size O(k2+klog2n)O(k^2 + k \log^2 n) (Belazzougui15). For binary insdel codes, we obtain the following results: 1. An explicit binary insdel code which encodes an nn-bit message xx against kk errors with redundancy O(klog2nk)O(k \log^2 \frac{n}{k}). In particular this implies an explicit family of binary insdel codes that can correct ε\varepsilon fraction of insertions and deletions with rate 1O(εlog2(1ε))=1O~(ε)1-O(\varepsilon \log^2 (\frac{1}{\varepsilon}))=1-\widetilde{O}(\varepsilon). 2. An explicit binary insdel code which encodes an nn-bit message xx against kk errors with redundancy O(klogn)O(k \log n). This is the first explicit construction of binary insdel codes that has optimal redundancy for a wide range of error parameters kk, and this brings our understanding of binary insdel codes much closer to that of standard binary error correcting codes. In obtaining our results we introduce the notion of \emph{ε\varepsilon-self matching hash functions} and \emph{ε\varepsilon-synchronization hash functions}. We believe our techniques can have further applications in the literature.

Keywords

Cite

@article{arxiv.1804.05776,
  title  = {Deterministic Document Exchange Protocols, and Almost Optimal Binary Codes for Edit Errors},
  author = {Kuan Cheng and Zhengzhong Jin and Xin Li and Ke Wu},
  journal= {arXiv preprint arXiv:1804.05776},
  year   = {2018}
}

Comments

34 pages, 2 figures