English

Computing Covers under Substring Consistent Equivalence Relations

Data Structures and Algorithms 2020-07-31 v2

Abstract

Covers are a kind of quasiperiodicity in strings. A string CC is a cover of another string TT if any position of TT is inside some occurrence of CC in TT. The shortest and longest cover arrays of TT have the lengths of the shortest and longest covers of each prefix of TT, respectively. The literature has proposed linear-time algorithms computing longest and shortest cover arrays taking border arrays as input. An equivalence relation \approx over strings is called a substring consistent equivalence relation (SCER) iff XYX \approx Y implies (1) X=Y|X| = |Y| and (2) X[i:j]Y[i:j]X[i:j] \approx Y[i:j] for all 1ijX1 \le i \le j \le |X|. In this paper, we generalize the notion of covers for SCERs and prove that existing algorithms to compute the shortest cover array and the longest cover array of a string TT under the identity relation will work for any SCERs taking the accordingly generalized border arrays.

Keywords

Cite

@article{arxiv.2002.06764,
  title  = {Computing Covers under Substring Consistent Equivalence Relations},
  author = {Natsumi Kikuchi and Diptarama Hendrian and Ryo Yoshinaka and Ayumi Shinohara},
  journal= {arXiv preprint arXiv:2002.06764},
  year   = {2020}
}

Comments

16 pages