English

Algorithms for Jumbled Pattern Matching in Strings

Data Structures and Algorithms 2015-03-18 v1

Abstract

The Parikh vector p(s) of a string s is defined as the vector of multiplicities of the characters. Parikh vector q occurs in s if s has a substring t with p(t)=q. We present two novel algorithms for searching for a query q in a text s. One solves the decision problem over a binary text in constant time, using a linear size index of the text. The second algorithm, for a general finite alphabet, finds all occurrences of a given Parikh vector q and has sub-linear expected time complexity; we present two variants, which both use a linear size index of the text.

Keywords

Cite

@article{arxiv.1102.1746,
  title  = {Algorithms for Jumbled Pattern Matching in Strings},
  author = {Péter Burcsi and Ferdinando Cicalese and Gabriele Fici and Zsuzsanna Lipták},
  journal= {arXiv preprint arXiv:1102.1746},
  year   = {2015}
}

Comments

18 pages, 9 figures; article accepted for publication in the International Journal of Foundations of Computer Science

R2 v1 2026-06-21T17:23:36.495Z