Efficiently Approximating Edit Distance Between Pseudorandom Strings
Abstract
We present an algorithm for approximating the edit distance between two strings and in time parameterized by the degree to which one of the strings satisfies a natural pseudorandomness property. The pseudorandomness model is asymmetric in that no requirements are placed on the second string , which may be constructed by an adversary with full knowledge of . We say that is \emph{-pseudorandom} if all pairs and of disjoint -letter substrings of satisfy . Given parameters and , our algorithm computes the edit distance between a -pseudorandom string and an arbitrary string within a factor of in time , with high probability. Our algorithm is robust in the sense that it can handle a small portion of being adversarial (i.e., not satisfying the pseudorandomness property). In this case, the algorithm incurs an additive approximation error proportional to the fraction of which behaves maliciously. The asymmetry of our pseudorandomness model has particular appeal for the case where is a \emph{source string}, meaning that will be computed for many strings . Suppose that one wishes to achieve an -approximation for each computation, and that is the smallest block-size for which the string is -pseudorandom. We show that without knowing beforehand, may be preprocessed in time , so that all future computations of the form may be -approximated in time . Furthermore, for the special case where only a single computation will be performed, we show how to achieve an -approximation in time .
Cite
@article{arxiv.1811.04300,
title = {Efficiently Approximating Edit Distance Between Pseudorandom Strings},
author = {William Kuszmaul},
journal= {arXiv preprint arXiv:1811.04300},
year = {2018}
}
Comments
SODA 2019