中文

Selective Memoization for Efficient Backtracking Regular Expression Matching

形式语言与自动机理论 2026-06-25 v1

摘要

Backtracking regular expression matchers are widely used due to their expressive power but may exhibit exponential worst-case matching time. Memoization provides a principled method for eliminating redundant computation and ensuring linear matching time, but full memoization is memory-intensive and impractical. We introduce the Minimum Feedback Node (MFN) memoization scheme, a selective memoization strategy based on computing a minimum feedback vertex set of an automaton. We establish relationships with existing memoization schemes and analyze their behaviour under both Thompson and Glushkov automaton constructions.

引用

@article{arxiv.2606.26678,
  title  = {Selective Memoization for Efficient Backtracking Regular Expression Matching},
  author = {Martin Berglund and Brink van der Merwe and Iain le Roux},
  journal= {arXiv preprint arXiv:2606.26678},
  year   = {2026}
}

备注

In Proceedings NCMA 2026, arXiv:2606.25881