Words Avoiding Tangrams
Abstract
A \emph{tangram} is a word in which every letter occurs an even number of times. Such word can be cut into parts that can be arranged into two identical words. The minimum number of cuts needed is called the \emph{cut number} of a tangram. For example, the word is a tangram with cut number one, while the word is a tangram with cut number two. Clearly, tangrams with cut number one coincide with the well known family of words, known as \emph{squares}, having the form for some nonempty word . A word \emph{avoids} a word if it is not possible to write , for any words and (possibly empty). The famous 1906 theorem of Thue asserts that there exist arbitrarily long words avoiding squares over alphabet with just \emph{three} letters. Given a fixed number , how many letters are needed to avoid tangrams with the cut number at most ? Let denote the minimum size of an alphabet needed for that purpose. By Thue's result we have , which easily implies . Curiously, these are currently the only known exact values of this function. In our main result we prove that . The proof uses \emph{entropy compression} argument and \emph{Zimin words}. By using a different method we prove that for all , which gives more exact estimates for small values of . The proof makes use of \emph{Dejean words} and a curious property of \emph{Gauss words}, which is perhaps of independent interest.
Cite
@article{arxiv.2407.03819,
title = {Words Avoiding Tangrams},
author = {Michał Dębski and Jarosław Grytczuk and Bartłomiej Pawlik and Jakub Przybyło and Małgorzata Śleszyńska-Nowak},
journal= {arXiv preprint arXiv:2407.03819},
year = {2024}
}
Comments
15 pages, 2 figures