English

Space Efficient Deterministic Approximation of String Measures

Data Structures and Algorithms 2020-07-28 v4

Abstract

We study approximation algorithms for the following three string measures that are widely used in practice: edit distance (ED), longest common subsequence (LCS), and longest increasing sequence (LIS). All three problems can be solved exactly by standard algorithms that run in polynomial time with roughly Θ(n)\Theta(n) space, where nn is the input length, and our goal is to design deterministic approximation algorithms that run in polynomial time with significantly smaller space. Towards this, we design several algorithms that achieve 1+ϵ1+\epsilon or 1ϵ1-\epsilon approximation for all three problems, where ϵ>0\epsilon>0 can be any constant and even slightly sub constant. Our algorithms are flexible and can be adjusted to achieve the following two regimes of parameters: 1) space nδn^\delta for any constant δ>0\delta>0 with running time essentially the same as or slightly more than the standard algorithms; and 2) space polylog(n)\mathsf{polylog}(n) with (a larger) polynomial running time, which puts the approximation versions of the three problems in Steve's class (\textbf{SC}). Our algorithms significantly improve previous results in terms of space complexity, where all known results need to use space at least Ω(n)\Omega(\sqrt{n}). Some of our algorithms can also be adapted to work in the asymmetric streaming model [SS13], and output the corresponding sequence. Furthermore, our results can be used to improve a recent result by Farhadi et. al. [FHRS20] about approximating ED in the asymmetric streaming model, reducing the running time from being exponential in [FHRS20] to a polynomial. Our algorithms are based on the idea of using recursion as in Savitch's theorem [Sav70], and a careful adaption of previous techniques to make the recursion work. Along the way we also give a new logspace reduction from longest common subsequence to longest increasing sequence, which may be of independent interest.

Keywords

Cite

@article{arxiv.2002.08498,
  title  = {Space Efficient Deterministic Approximation of String Measures},
  author = {Kuan Cheng and Zhengzhong Jin and Xin Li and Yu Zheng},
  journal= {arXiv preprint arXiv:2002.08498},
  year   = {2020}
}