English

Faster Sublinear-Time Edit Distance

Data Structures and Algorithms 2023-12-05 v1

Abstract

We study the fundamental problem of approximating the edit distance of two strings. After an extensive line of research led to the development of a constant-factor approximation algorithm in almost-linear time, recent years have witnessed a notable shift in focus towards sublinear-time algorithms. Here, the task is typically formalized as the (k,K)(k, K)-gap edit distance problem: Distinguish whether the edit distance of two strings is at most kk or more than KK. Surprisingly, it is still possible to compute meaningful approximations in this challenging regime. Nevertheless, in almost all previous work, truly sublinear running time of O(n1ε)O(n^{1-\varepsilon}) (for a constant ε>0\varepsilon > 0) comes at the price of at least polynomial gap KknΩ(ε)K \ge k \cdot n^{\Omega(\varepsilon)}. Only recently, [Bringmann, Cassis, Fischer, and Nakos; STOC'22] broke through this barrier and solved the sub-polynomial (k,k1+o(1))(k, k^{1+o(1)})-gap edit distance problem in time O(n/k+k4+o(1))O(n/k + k^{4+o(1)}), which is truly sublinear if nΩ(1)kn14Ω(1)n^{\Omega(1)} \le k \le n^{\frac14-\Omega(1)}.The n/kn/k term is inevitable (already for Hamming distance), but it remains an important task to optimize the poly(k)\mathrm{poly}(k) term and, in general, solve the (k,k1+o(1))(k, k^{1+o(1)})-gap edit distance problem in sublinear-time for larger values of kk. In this work, we design an improved algorithm for the (k,k1+o(1))(k, k^{1+o(1)})-gap edit distance problem in sublinear time O(n/k+k2+o(1))O(n/k + k^{2+o(1)}), yielding a significant quadratic speed-up over the previous O(n/k+k4+o(1))O(n/k + k^{4+o(1)})-time algorithm. Notably, our algorithm is unconditionally almost-optimal (up to subpolynomial factors) in the regime where kn13k \leq n^{\frac13} and improves upon the state of the art for kn12o(1)k \leq n^{\frac12-o(1)}.

Keywords

Cite

@article{arxiv.2312.01759,
  title  = {Faster Sublinear-Time Edit Distance},
  author = {Karl Bringmann and Alejandro Cassis and Nick Fischer and Tomasz Kociumaka},
  journal= {arXiv preprint arXiv:2312.01759},
  year   = {2023}
}

Comments

To appear in SODA'24. Shortened abstract for arXiv