English

Machine Learning using Stata/Python

Computation 2021-03-05 v1 Machine Learning Mathematical Software

Abstract

We present two related Stata modules, r_ml_stata and c_ml_stata, for fitting popular Machine Learning (ML) methods both in regression and classification settings. Using the recent Stata/Python integration platform (sfi) of Stata 16, these commands provide hyper-parameters' optimal tuning via K-fold cross-validation using greed search. More specifically, they make use of the Python Scikit-learn API to carry out both cross-validation and outcome/label prediction.

Cite

@article{arxiv.2103.03122,
  title  = {Machine Learning using Stata/Python},
  author = {Giovanni Cerulli},
  journal= {arXiv preprint arXiv:2103.03122},
  year   = {2021}
}

Comments

Keywords: Machine Learning, Stata, Python, Optimal tuning

R2 v1 2026-06-23T23:45:33.428Z