English

Words Avoiding Tangrams

Combinatorics 2024-07-08 v1

Abstract

A \emph{tangram} is a word in which every letter occurs an even number of times. Such word can be cut into parts that can be arranged into two identical words. The minimum number of cuts needed is called the \emph{cut number} of a tangram. For example, the word 01020102\mathtt{\color{red}{0102}\color{blue}{0102}} is a tangram with cut number one, while the word 0101023023\mathtt{\color{red}{01}\color{blue}{01023}\color{red}{023}} is a tangram with cut number two. Clearly, tangrams with cut number one coincide with the well known family of words, known as \emph{squares}, having the form UUUU for some nonempty word UU. A word WW \emph{avoids} a word TT if it is not possible to write W=ATBW=ATB, for any words AA and BB (possibly empty). The famous 1906 theorem of Thue asserts that there exist arbitrarily long words avoiding squares over alphabet with just \emph{three} letters. Given a fixed number k1k\geqslant 1, how many letters are needed to avoid tangrams with the cut number at most kk? Let t(k)t(k) denote the minimum size of an alphabet needed for that purpose. By Thue's result we have t(1)=3t(1)=3, which easily implies t(2)=3t(2)=3. Curiously, these are currently the only known exact values of this function. In our main result we prove that t(k)=Θ(log2k)t(k)=\Theta(\log_2k). The proof uses \emph{entropy compression} argument and \emph{Zimin words}. By using a different method we prove that t(k)k+1t(k)\leqslant k+1 for all k4k\geqslant 4, which gives more exact estimates for small values of kk. The proof makes use of \emph{Dejean words} and a curious property of \emph{Gauss words}, which is perhaps of independent interest.

Keywords

Cite

@article{arxiv.2407.03819,
  title  = {Words Avoiding Tangrams},
  author = {Michał Dębski and Jarosław Grytczuk and Bartłomiej Pawlik and Jakub Przybyło and Małgorzata Śleszyńska-Nowak},
  journal= {arXiv preprint arXiv:2407.03819},
  year   = {2024}
}

Comments

15 pages, 2 figures