English

An Algorithm for Consensus Trees

Data Structures and Algorithms 2020-03-09 v2

Abstract

We consider the tree consensus problem, an important problem in bioinformatics. Given a rooted tree tt and another tree TT, one would like to incorporate compatible information from TT to tt. This problem is a subproblem in the tree refinement problem called the RF-Optimal Tree Refinement Problem defined by in Christensen, Molloy, Vachaspati and Warnow [WABI'19] who employ the greedy algorithm by Gawrychowski, Landau, Sung, and Weimann [ICALP'18] that runs in time O(n1.5logn)O(n^{1.5}\log n). We give a faster algorithm for this problem that runs in time O(nlogn)O(n\log n). Our key ingredient is a bipartition compatibility criteria based on amortized-time leaf counters. While this is an improvement, the fastest solution is an algorithm by Jansson, Shen, and Sung [JACM'16] which runs in time O(n)O(n).

Keywords

Cite

@article{arxiv.2003.00488,
  title  = {An Algorithm for Consensus Trees},
  author = {Pongsaphol Pongsawakul},
  journal= {arXiv preprint arXiv:2003.00488},
  year   = {2020}
}

Comments

erroneous claim by JF removed, reference to a faster algorithm by Jansson, Shen, and Sung [JACM'16] added, also reference to Sung [WALCOM'19] added (thanks to Pawel Gawrychowski and Oren Weiman for comments and references)