Regular expression length via arithmetic formula complexity
Abstract
We prove lower bounds on the length of regular expressions for finite languages by methods from arithmetic circuit complexity. First, we show a reduction: the length of a regular expression for a language is bounded from below by the minimum size of a monotone arithmetic formula computing a polynomial that has as its set of exponent vectors, viewing words as vectors. This result yields lower bounds for the binomial language of all words with exactly ones and zeros and for the language of all Dyck words of length . We also determine the blow-up of language operations (intersection and shuffle) of regular expressions for finite languages. Second, we adapt a lower bound method for multilinear arithmetic formulas by so-called log-product polynomials to regular expressions. With this method we show almost tight lower bounds for the language of all binary numbers with bits that are divisible by a given odd integer , for the language of all words of length over a letter alphabet with an even number of occurrences of each letter and for the language of all permutations of .
Cite
@article{arxiv.2012.15617,
title = {Regular expression length via arithmetic formula complexity},
author = {Ehud Cseresnyes and Hannes Seiwert},
journal= {arXiv preprint arXiv:2012.15617},
year = {2021}
}