English

Maximal Closed Substrings

Data Structures and Algorithms 2024-05-15 v2 Formal Languages and Automata Theory

Abstract

A string is closed if it has length 1 or has a nonempty border without internal occurrences. In this paper we introduce the definition of a \emph{maximal closed substring} (MCS), which is an occurrence of a closed substring that cannot be extended to the left nor to the right into a longer closed substring. MCSs with exponent at least 22 are commonly called \emph{runs}; those with exponent smaller than 22, instead, are particular cases of \emph{maximal gapped repeats}. We provide an algorithm that, given a string of length nn locates all MCSs the string contains in O(nlogn)\mathcal O(n\log n) time.

Keywords

Cite

@article{arxiv.2209.00271,
  title  = {Maximal Closed Substrings},
  author = {Golnaz Badkobeh and Alessandro De Luca and Gabriele Fici and Simon Puglisi},
  journal= {arXiv preprint arXiv:2209.00271},
  year   = {2024}
}