English

Feature Selection Tutorial with Python Examples

Machine Learning 2021-06-14 v1

Abstract

In Machine Learning, feature selection entails selecting a subset of the available features in a dataset to use for model development. There are many motivations for feature selection, it may result in better models, it may provide insight into the data and it may deliver economies in data gathering or data processing. For these reasons feature selection has received a lot of attention in data analytics research. In this paper we provide an overview of the main methods and present practical examples with Python implementations. While the main focus is on supervised feature selection techniques, we also cover some feature transformation methods.

Keywords

Cite

@article{arxiv.2106.06437,
  title  = {Feature Selection Tutorial with Python Examples},
  author = {Padraig Cunningham and Bahavathy Kathirgamanathan and Sarah Jane Delany},
  journal= {arXiv preprint arXiv:2106.06437},
  year   = {2021}
}

Comments

20 pages, 19 figures