Related papers: Robust performance metrics for imbalanced classifi…

A study on cost behaviors of binary classification measures in class-imbalanced problems

This work investigates into cost behaviors of binary classification measures in a background of class-imbalanced problems. Twelve performance measures are studied, such as F measure, G-means in terms of accuracy rates, and of recall and…

Machine Learning · Computer Science 2014-03-28 Bao-Gang Hu , Wei-Ming Dong

The MCC-F1 curve: a performance evaluation technique for binary classification

Many fields use the ROC curve and the PR curve as standard evaluations of binary classification methods. Analysis of ROC and PR, however, often gives misleading and inflated performance evaluations, especially with an imbalanced ground…

Machine Learning · Statistics 2020-06-23 Chang Cao , Davide Chicco , Michael M. Hoffman

Asymptotic Properties of Matthews Correlation Coefficient

Evaluating classifications is crucial in statistics and machine learning, as it influences decision-making across various fields, such as patient prognosis and therapy in critical conditions. The Matthews correlation coefficient (MCC) is…

Methodology · Statistics 2024-06-18 Yuki Itaya , Jun Tamura , Kenichi Hayashi , Kouji Yamamoto

Statistical Inference of the Matthews Correlation Coefficient for Multiclass Classification

Classification problems are essential statistical tasks that form the foundation of decision-making across various fields, including patient prognosis and treatment strategies for critical conditions. Consequently, evaluating the…

Methodology · Statistics 2025-03-11 Jun Tamura , Yuki Itaya , Kenichi Hayashi , Kouji Yamamoto

Beyond Rebalancing: Benchmarking Binary Classifiers Under Class Imbalance Without Rebalancing Techniques

Class imbalance poses a significant challenge to supervised classification, particularly in critical domains like medical diagnostics and anomaly detection where minority class instances are rare. While numerous studies have explored…

Machine Learning · Computer Science 2025-09-10 Ali Nawaz , Amir Ahmad , Shehroz S. Khan

Assessing Software Defection Prediction Performance: Why Using the Matthews Correlation Coefficient Matters

Context: There is considerable diversity in the range and design of computational experiments to assess classifiers for software defect prediction. This is particularly so, regarding the choice of classifier performance metrics.…

Software Engineering · Computer Science 2020-03-04 Jingxiu Yao , Martin Shepperd

Weighted MCC: A Robust Measure of Multiclass Classifier Performance for Observations with Individual Weights

Several performance measures are used to evaluate binary and multiclass classification tasks. But individual observations may often have distinct weights, and none of these measures are sensitive to such varying weights. We propose a new…

Machine Learning · Statistics 2025-12-25 Rommel Cortez , Bala Krishnamoorthy

Good Classification Measures and How to Find Them

Several performance measures can be used for evaluating classification results: accuracy, F-measure, and many others. Can we say that some of them are better than others, or, ideally, choose one measure that is best in all situations? To…

Machine Learning · Computer Science 2022-01-25 Martijn Gösgens , Anton Zhiyanov , Alexey Tikhonov , Liudmila Prokhorenkova

Balancing the Scales: A Comprehensive Study on Tackling Class Imbalance in Binary Classification

Class imbalance in binary classification tasks remains a significant challenge in machine learning, often resulting in poor performance on minority classes. This study comprehensively evaluates three widely-used strategies for handling…

Machine Learning · Computer Science 2024-10-01 Mohamed Abdelhamid , Abhyuday Desai

Analysis and Comparison of Classification Metrics

A variety of different performance metrics are commonly used in the machine learning literature for the evaluation of classification systems. Some of the most common ones for measuring quality of hard decisions are standard and balanced…

Machine Learning · Computer Science 2023-09-22 Luciana Ferrer

Online Classification with Complex Metrics

We present a framework and analysis of consistent binary classification for complex and non-decomposable performance metrics such as the F-measure and the Jaccard measure. The proposed framework is general, as it applies to both batch and…

Machine Learning · Statistics 2018-02-13 Bowei Yan , Oluwasanmi Koyejo , Kai Zhong , Pradeep Ravikumar

Appropriateness of Performance Indices for Imbalanced Data Classification: An Analysis

Indices quantifying the performance of classifiers under class-imbalance, often suffer from distortions depending on the constitution of the test set or the class-specific classification accuracy, creating difficulties in assessing the…

Machine Learning · Computer Science 2020-08-28 Sankha Subhra Mullick , Shounak Datta , Sourish Gunesh Dhekane , Swagatam Das

Optimal Binary Classification Beyond Accuracy

The vast majority of statistical theory on binary classification characterizes performance in terms of accuracy. However, accuracy is known in many cases to poorly reflect the practical consequences of classification error, most famously in…

Statistics Theory · Mathematics 2022-09-27 Shashank Singh , Justin Khim

Classification Performance Metric for Imbalance Data Based on Recall and Selectivity Normalized in Class Labels

In the classification of a class imbalance dataset, the performance measure used for the model selection and comparison to competing methods is a major issue. In order to overcome this problem several performance measures are defined and…

Machine Learning · Computer Science 2020-06-25 Robert Burduk

Theory of Optimizing Pseudolinear Performance Measures: Application to F-measure

Non-linear performance measures are widely used for the evaluation of learning algorithms. For example, $F$-measure is a commonly used performance measure for classification problems in machine learning and information retrieval community.…

Machine Learning · Computer Science 2018-01-03 Shameem A Puthiya Parambath , Nicolas Usunier , Yves Grandvalet

A Minimax Probability Machine for Non-Decomposable Performance Measures

Imbalanced classification tasks are widespread in many real-world applications. For such classification tasks, in comparison with the accuracy rate, it is usually much more appropriate to use non-decomposable performance measures such as…

Machine Learning · Computer Science 2021-03-16 Junru Luo , Hong Qiao , Bo Zhang

Towards Competitive Classifiers for Unbalanced Classification Problems: A Study on the Performance Scores

Although a great methodological effort has been invested in proposing competitive solutions to the class-imbalance problem, little effort has been made in pursuing a theoretical understanding of this matter. In order to shed some light on…

Machine Learning · Statistics 2016-09-04 Jonathan Ortigosa-Hernández , Iñaki Inza , Jose A. Lozano

Measuring Class-Imbalance Sensitivity of Deterministic Performance Evaluation Metrics

The class-imbalance issue is intrinsic to many real-world machine learning tasks, particularly to the rare-event classification problems. Although the impact and treatment of imbalanced data is widely known, the magnitude of a metric's…

Machine Learning · Computer Science 2022-06-22 Azim Ahmadzadeh , Rafal A. Angryk

Multi-fairness under class-imbalance

Recent studies showed that datasets used in fairness-aware machine learning for multiple protected attributes (referred to as multi-discrimination hereafter) are often imbalanced. The class-imbalance problem is more severe for the often…

Machine Learning · Computer Science 2022-06-22 Arjun Roy , Vasileios Iosifidis , Eirini Ntoutsi

Binary Classification with Karmic, Threshold-Quasi-Concave Metrics

Complex performance measures, beyond the popular measure of accuracy, are increasingly being used in the context of binary classification. These complex performance measures are typically not even decomposable, that is, the loss evaluated…

Machine Learning · Statistics 2018-06-05 Bowei Yan , Oluwasanmi Koyejo , Kai Zhong , Pradeep Ravikumar