English

Substring Density Estimation from Traces

Information Theory 2022-10-21 v1 Data Structures and Algorithms math.IT Probability Statistics Theory Statistics Theory

Abstract

In the trace reconstruction problem, one seeks to reconstruct a binary string ss from a collection of traces, each of which is obtained by passing ss through a deletion channel. It is known that exp(O~(n1/5))\exp(\tilde O(n^{1/5})) traces suffice to reconstruct any length-nn string with high probability. We consider a variant of the trace reconstruction problem where the goal is to recover a "density map" that indicates the locations of each length-kk substring throughout ss. We show that ϵ2poly(n)\epsilon^{-2}\cdot \text{poly}(n) traces suffice to recover the density map with error at most ϵ\epsilon. As a result, when restricted to a set of source strings whose minimum "density map distance" is at least 1/poly(n)1/\text{poly}(n), the trace reconstruction problem can be solved with polynomially many traces.

Keywords

Cite

@article{arxiv.2210.10917,
  title  = {Substring Density Estimation from Traces},
  author = {Kayvon Mazooji and Ilan Shomorony},
  journal= {arXiv preprint arXiv:2210.10917},
  year   = {2022}
}

Comments

22 pages, 3 figures