English

Benchmarking Automatic Machine Learning Frameworks

Machine Learning 2018-08-21 v1 Artificial Intelligence Machine Learning

Abstract

AutoML serves as the bridge between varying levels of expertise when designing machine learning systems and expedites the data science process. A wide range of techniques is taken to address this, however there does not exist an objective comparison of these techniques. We present a benchmark of current open source AutoML solutions using open source datasets. We test auto-sklearn, TPOT, auto_ml, and H2O's AutoML solution against a compiled set of regression and classification datasets sourced from OpenML and find that auto-sklearn performs the best across classification datasets and TPOT performs the best across regression datasets.

Keywords

Cite

@article{arxiv.1808.06492,
  title  = {Benchmarking Automatic Machine Learning Frameworks},
  author = {Adithya Balaji and Alexander Allen},
  journal= {arXiv preprint arXiv:1808.06492},
  year   = {2018}
}

Comments

9 pages, 8 figures, 5 tables