English

The maximum agreement subtree problem

Combinatorics 2013-02-21 v5

Abstract

In this paper we investigate an extremal problem on binary phylogenetic trees. Given two such trees T1T_1 and T2T_2, both with leaf-set 1,2,...,n{1,2,...,n}, we are interested in the size of the largest subset S1,2,...,nS \subseteq {1,2,...,n} of leaves in a common subtree of T1T_1 and T2T_2. We show that any two binary phylogenetic trees have a common subtree on Ω(logn)\Omega(\sqrt{\log{n}}) leaves, thus improving on the previously known bound of Ω(loglogn)\Omega(\log\log n) due to M. Steel and L. Szekely. To achieve this improved bound, we first consider two special cases of the problem: when one of the trees is balanced or a caterpillar, we show that the largest common subtree has Ω(logn)\Omega(\log n) leaves. We then handle the general case by proving and applying a Ramsey-type result: that every binary tree contains either a large balanced subtree or a large caterpillar. We also show that there are constants c,α>0c, \alpha > 0 such that, when both trees are balanced, they have a common subtree on cnαc n^\alpha leaves. We conjecture that it is possible to take α=1/2\alpha = 1/2 in the unrooted case, and both c=1c = 1 and α=1/2\alpha = 1/2 in the rooted case.

Keywords

Cite

@article{arxiv.1201.5168,
  title  = {The maximum agreement subtree problem},
  author = {Daniel M. Martin and Bhalchandra D. Thatte},
  journal= {arXiv preprint arXiv:1201.5168},
  year   = {2013}
}

Comments

22 pages, 4 figures

R2 v1 2026-06-21T20:09:20.383Z