English

Order-Preserving Pattern Matching Indeterminate Strings

Data Structures and Algorithms 2019-05-08 v1

Abstract

Given an indeterminate string pattern pp and an indeterminate string text tt, the problem of order-preserving pattern matching with character uncertainties (μ\muOPPM) is to find all substrings of tt that satisfy one of the possible orderings defined by pp. When the text and pattern are determinate strings, we are in the presence of the well-studied exact order-preserving pattern matching (OPPM) problem with diverse applications on time series analysis. Despite its relevance, the exact OPPM problem suffers from two major drawbacks: 1) the inability to deal with indetermination in the text, thus preventing the analysis of noisy time series; and 2) the inability to deal with indetermination in the pattern, thus imposing the strict satisfaction of the orders among all pattern positions. This paper provides the first polynomial algorithm to answer the μ\muOPPM problem when indetermination is observed on the pattern or text. Given two strings with length mm and O(r)O(r) uncertain characters per string position, we show that the μ\muOPPM problem can be solved in O(mrlgr)O(mr\lg r) time when one string is indeterminate and rN+r\in\mathbb{N}^+. Mappings into satisfiability problems are provided when indetermination is observed on both the pattern and the text, and results concerning the general problem complexity are presented as well, with μ\muOPPM problem proved to be NP-hard in general.

Keywords

Cite

@article{arxiv.1905.02589,
  title  = {Order-Preserving Pattern Matching Indeterminate Strings},
  author = {Diogo Costa and Luís M. S. Russo and Rui Henriques and Hideo Bannai and Alexandre P. Francisco},
  journal= {arXiv preprint arXiv:1905.02589},
  year   = {2019}
}