English

Parameterized Pattern Matching -- Succinctly

Data Structures and Algorithms 2016-04-07 v2

Abstract

We consider the ParameterizedParameterized PatternPattern MatchingMatching problem, where a pattern PP matches some location in a text T\mathsf{T} iff there is a one-to-one correspondence between the alphabet symbols of the pattern to those of the text. More specifically, assume that the text T\mathsf{T} contains nn characters from a static alphabet Σs\Sigma_s and a parameterized alphabet Σp\Sigma_p, where ΣsΣp=\Sigma_s \cap \Sigma_p = \varnothing and ΣsΣp=σ|\Sigma_s \cup \Sigma_p|=\sigma. A pattern PP matches a substring SS of T\mathsf{T} iff the static characters match exactly, and there exists a one-to-one function that renames the parameterized characters in SS to that in PP. Previous indexing solution [Baker, STOC 1993], known as ParameterizedParameterized SuffixSuffix TreeTree, requires Θ(nlogn)\Theta(n\log n) bits of space, and can find all occocc occurrences of PP in O(Plogσ+occ)\mathcal{O}(|P|\log \sigma+ occ) time. In this paper, we present the first succinct index that occupies nlogσ+O(n)n \log \sigma + \mathcal{O}(n) bits and answers queries in O((P+occlogn)logσloglogσ)\mathcal{O}((|P|+ occ\cdot \log n) \log\sigma\log \log \sigma) time. We also present a compact index that occupies O(nlogσ)\mathcal{O}(n\log\sigma) bits and answers queries in O(Plogσ+occlogn)\mathcal{O}(|P|\log \sigma+ occ\cdot \log n) time. Furthermore, the techniques are extended to obtain the first succinct representation of the index of Shibuya for StructuralStructural MatchingMatching [SWAT, 2000], and of Idury and Sch\"{a}ffer for ParameterizedParameterized DictionaryDictionary MatchingMatching [CPM, 1994].

Keywords

Cite

@article{arxiv.1603.07457,
  title  = {Parameterized Pattern Matching -- Succinctly},
  author = {Arnab Ganguly and Rahul Shah and Sharma V. Thankachan},
  journal= {arXiv preprint arXiv:1603.07457},
  year   = {2016}
}
R2 v1 2026-06-22T13:17:42.167Z