English

A Simple Sublinear Algorithm for Gap Edit Distance

Data Structures and Algorithms 2020-07-29 v1

Abstract

We study the problem of estimating the edit distance between two nn-character strings. While exact computation in the worst case is believed to require near-quadratic time, previous work showed that in certain regimes it is possible to solve the following {\em gap edit distance} problem in sub-linear time: distinguish between inputs of distance k\le k and >k2>k^2. Our main result is a very simple algorithm for this benchmark that runs in time O~(n/k)\tilde O(n/\sqrt{k}), and in particular settles the open problem of obtaining a truly sublinear time for the entire range of relevant kk. Building on the same framework, we also obtain a kk-vs-k2k^2 algorithm for the one-sided preprocessing model with O~(n)\tilde O(n) preprocessing time and O~(n/k)\tilde O(n/k) query time (improving over a recent O~(n/k+k2)\tilde O(n/k+k^2)-query time algorithm for the same problem [GRS'20].

Keywords

Cite

@article{arxiv.2007.14368,
  title  = {A Simple Sublinear Algorithm for Gap Edit Distance},
  author = {Joshua Brakensiek and Moses Charikar and Aviad Rubinstein},
  journal= {arXiv preprint arXiv:2007.14368},
  year   = {2020}
}

Comments

26 pages