English

Visualizations for Bayesian Additive Regression Trees

Computation 2022-09-13 v2

Abstract

Tree-based regression and classification has become a standard tool in modern data science. Bayesian Additive Regression Trees (BART) has in particular gained wide popularity due its flexibility in dealing with interactions and non-linear effects. BART is a Bayesian tree-based machine learning method that can be applied to both regression and classification problems and yields competitive or superior results when compared to other predictive models. As a Bayesian model, BART allows the practitioner to explore the uncertainty around predictions through the posterior distribution. In this paper, we present new visualization techniques for exploring BART models. We construct conventional plots to analyze a model's performance and stability as well as create new tree-based plots to analyze variable importance, interaction, and tree structure. We employ Value Suppressing Uncertainty Palettes (VSUP) to construct heatmaps that display variable importance and interactions jointly using color scale to represent posterior uncertainty. Our new visualizations are designed to work with the most popular BART R packages available, namely BART, dbarts, and bartMachine. Our approach is implemented in the R package bartMan (BART Model ANalysis).

Keywords

Cite

@article{arxiv.2208.08966,
  title  = {Visualizations for Bayesian Additive Regression Trees},
  author = {Alan Inglis and Andrew Parnell and Catherine Hurley},
  journal= {arXiv preprint arXiv:2208.08966},
  year   = {2022}
}

Comments

25 pages, 15 figures