Comparing Classifiers: A Case Study Using PyCM
Machine Learning
2026-02-17 v1 Artificial Intelligence
Abstract
Selecting an optimal classification model requires a robust and comprehensive understanding of the performance of the model. This paper provides a tutorial on the PyCM library, demonstrating its utility in conducting deep-dive evaluations of multi-class classifiers. By examining two different case scenarios, we illustrate how the choice of evaluation metrics can fundamentally shift the interpretation of a model's efficacy. Our findings emphasize that a multi-dimensional evaluation framework is essential for uncovering small but important differences in model performance. However, standard metrics may miss these subtle performance trade-offs.
Cite
@article{arxiv.2602.13482,
title = {Comparing Classifiers: A Case Study Using PyCM},
author = {Sadra Sabouri and Alireza Zolanvari and Sepand Haghighi},
journal= {arXiv preprint arXiv:2602.13482},
year = {2026}
}
Comments
13 pages, 3 figures, 2 tables