English

Faster subsequence recognition in compressed strings

Data Structures and Algorithms 2011-11-10 v4 Computational Complexity Discrete Mathematics

Abstract

Computation on compressed strings is one of the key approaches to processing massive data sets. We consider local subsequence recognition problems on strings compressed by straight-line programs (SLP), which is closely related to Lempel--Ziv compression. For an SLP-compressed text of length mˉ\bar m, and an uncompressed pattern of length nn, C{\'e}gielski et al. gave an algorithm for local subsequence recognition running in time O(mˉn2logn)O(\bar mn^2 \log n). We improve the running time to O(mˉn1.5)O(\bar mn^{1.5}). Our algorithm can also be used to compute the longest common subsequence between a compressed text and an uncompressed pattern in time O(mˉn1.5)O(\bar mn^{1.5}); the same problem with a compressed pattern is known to be NP-hard.

Keywords

Cite

@article{arxiv.0707.3407,
  title  = {Faster subsequence recognition in compressed strings},
  author = {Alexander Tiskin},
  journal= {arXiv preprint arXiv:0707.3407},
  year   = {2011}
}
R2 v1 2026-06-21T09:00:55.136Z