English

A Skew-Sensitive Evaluation Framework for Imbalanced Data Classification

Machine Learning 2023-11-20 v2 Artificial Intelligence

Abstract

Class distribution skews in imbalanced datasets may lead to models with prediction bias towards majority classes, making fair assessment of classifiers a challenging task. Metrics such as Balanced Accuracy are commonly used to evaluate a classifier's prediction performance under such scenarios. However, these metrics fall short when classes vary in importance. In this paper, we propose a simple and general-purpose evaluation framework for imbalanced data classification that is sensitive to arbitrary skews in class cardinalities and importances. Experiments with several state-of-the-art classifiers tested on real-world datasets from three different domains show the effectiveness of our framework - not only in evaluating and ranking classifiers, but also training them.

Keywords

Cite

@article{arxiv.2010.05995,
  title  = {A Skew-Sensitive Evaluation Framework for Imbalanced Data Classification},
  author = {Min Du and Nesime Tatbul and Brian Rivers and Akhilesh Kumar Gupta and Lucas Hu and Wei Wang and Ryan Marcus and Shengtian Zhou and Insup Lee and Justin Gottschlich},
  journal= {arXiv preprint arXiv:2010.05995},
  year   = {2023}
}

Comments

17 pages, Data-centric Machine Learning Research (DMLR) Workshop at ICML 2023