English

Improved Sublinear-Time Edit Distance for Preprocessed Strings

Data Structures and Algorithms 2022-10-13 v1

Abstract

We study the problem of approximating the edit distance of two strings in sublinear time, in a setting where one or both string(s) are preprocessed, as initiated by Goldenberg, Rubinstein, Saha (STOC '20). Specifically, in the (k,K)(k, K)-gap edit distance problem, the goal is to distinguish whether the edit distance of two strings is at most kk or at least KK. We obtain the following results: * After preprocessing one string in time n1+o(1)n^{1+o(1)}, we can solve (k,kno(1))(k, k \cdot n^{o(1)})-gap edit distance in time (n/k+k)no(1)(n/k + k) \cdot n^{o(1)}. * After preprocessing both strings separately in time n1+o(1)n^{1+o(1)}, we can solve (k,kno(1))(k, k \cdot n^{o(1)})-gap edit distance in time kno(1)k \cdot n^{o(1)}. Both results improve upon some previously best known result, with respect to either the gap or the query time or the preprocessing time. Our algorithms build on the framework by Andoni, Krauthgamer and Onak (FOCS '10) and the recent sublinear-time algorithm by Bringmann, Cassis, Fischer and Nakos (STOC '22). We replace many complicated parts in their algorithm by faster and simpler solutions which exploit the preprocessing.

Keywords

Cite

@article{arxiv.2204.14137,
  title  = {Improved Sublinear-Time Edit Distance for Preprocessed Strings},
  author = {Karl Bringmann and Alejandro Cassis and Nick Fischer and Vasileios Nakos},
  journal= {arXiv preprint arXiv:2204.14137},
  year   = {2022}
}

Comments

Appears at ICALP '22