English

Making Tree Ensembles Interpretable: A Bayesian Model Selection Approach

Machine Learning 2017-03-01 v3

Abstract

Tree ensembles, such as random forests and boosted trees, are renowned for their high prediction performance. However, their interpretability is critically limited due to the enormous complexity. In this study, we present a method to make a complex tree ensemble interpretable by simplifying the model. Specifically, we formalize the simplification of tree ensembles as a model selection problem. Given a complex tree ensemble, we aim at obtaining the simplest representation that is essentially equivalent to the original one. To this end, we derive a Bayesian model selection algorithm that optimizes the simplified model while maintaining the prediction performance. Our numerical experiments on several datasets showed that complicated tree ensembles were reasonably approximated as interpretable.

Keywords

Cite

@article{arxiv.1606.09066,
  title  = {Making Tree Ensembles Interpretable: A Bayesian Model Selection Approach},
  author = {Satoshi Hara and Kohei Hayashi},
  journal= {arXiv preprint arXiv:1606.09066},
  year   = {2017}
}

Comments

21 pages

R2 v1 2026-06-22T14:38:17.778Z