Related papers: Comparing Classifiers: A Case Study Using PyCM

Metrics for Multi-Class Classification: an Overview

Classification tasks in machine learning involving more than two classes are known by the name of "multi-class classification". Performance indicators are very useful when the aim is to evaluate and compare different classification models…

Machine Learning · Statistics 2020-08-14 Margherita Grandini , Enrico Bagli , Giorgio Visani

Selecting a classification performance measure: matching the measure to the problem

The problem of identifying to which of a given set of classes objects belong is ubiquitous, occurring in many research domains and application areas, including medical diagnosis, financial decision making, online commerce, and national…

Machine Learning · Computer Science 2024-09-20 David J. Hand , Peter Christen , Sumayya Ziyad

Evaluation of Performance Measures for Classifiers Comparison

The selection of the best classification algorithm for a given dataset is a very widespread problem, occuring each time one has to choose a classifier to solve a real-world problem. It is also a complex task with many important…

Machine Learning · Computer Science 2012-08-16 Vincent Labatut , Hocine Cherifi

On the Value of ML Models

We argue that, when establishing and benchmarking Machine Learning (ML) models, the research community should favour evaluation metrics that better capture the value delivered by their model in practical applications. For a specific class…

Machine Learning · Computer Science 2021-12-14 Fabio Casati , Pierre-André Noël , Jie Yang

Fairer and more accurate, but for whom?

Complex statistical machine learning models are increasingly being used or considered for use in high-stakes decision-making pipelines in domains such as financial services, health care, criminal justice and human services. These models are…

Applications · Statistics 2017-07-04 Alexandra Chouldechova , Max G'Sell

A Comprehensive Assessment Benchmark for Rigorously Evaluating Deep Learning Image Classifiers

Reliable and robust evaluation methods are a necessary first step towards developing machine learning models that are themselves robust and reliable. Unfortunately, current evaluation protocols typically used to assess classifiers fail to…

Machine Learning · Computer Science 2025-05-26 Michael W. Spratling

A Large-Scale Empirical Study on Improving the Fairness of Image Classification Models

Fairness has been a critical issue that affects the adoption of deep learning models in real practice. To improve model fairness, many existing methods have been proposed and evaluated to be effective in their own contexts. However, there…

Machine Learning · Computer Science 2024-03-26 Junjie Yang , Jiajun Jiang , Zeyu Sun , Junjie Chen

Interactive Classification Metrics: A graphical application to build robust intuition for classification model evaluation

Machine learning continues to grow in popularity in academia, in industry, and is increasingly used in other fields. However, most of the common metrics used to evaluate even simple binary classification models have shortcomings that are…

Machine Learning · Computer Science 2024-12-25 David H. Brown , Davide Chicco

Probabilistic Multi-Dimensional Classification

Multi-dimensional classification (MDC) can be employed in a range of applications where one needs to predict multiple class variables for each given instance. Many existing MDC methods suffer from at least one of inaccuracy, scalability,…

Machine Learning · Computer Science 2023-11-28 Vu-Linh Nguyen , Yang Yang , Cassio de Campos

Classifier comparison using precision

New proposed models are often compared to state-of-the-art using statistical significance testing. Literature is scarce for classifier comparison using metrics other than accuracy. We present a survey of statistical methods that can be used…

Machine Learning · Computer Science 2016-11-17 Lovedeep Gondara

Consistent Classification with Generalized Metrics

We propose a framework for constructing and analyzing multiclass and multioutput classification metrics, i.e., involving multiple, possibly correlated multiclass labels. Our analysis reveals novel insights on the geometry of feasible…

Machine Learning · Statistics 2019-08-27 Xiaoyan Wang , Ran Li , Bowei Yan , Oluwasanmi Koyejo

Assessing the Performance of Diagnostic Classification Models in Small Sample Contexts with Different Estimation Methods

Fueled by the call for formative assessments, diagnostic classification models (DCMs) have recently gained popularity in psychometrics. Despite their potential for providing diagnostic information that aids in classroom instruction and…

Computation · Statistics 2022-08-26 Motonori Oka , Kensuke Okada

Evaluating software defect prediction performance: an updated benchmarking study

Accurately predicting faulty software units helps practitioners target faulty units and prioritize their efforts to maintain software quality. Prior studies use machine-learning models to detect faulty software code. We revisit past studies…

Software Engineering · Computer Science 2019-01-08 Libo Li , Stefan Lessmann , Bart Baesens

An Empirical Investigation into Benchmarking Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification

Deep learning models have proven to be highly successful. Yet, their over-parameterization gives rise to model multiplicity, a phenomenon in which multiple models achieve similar performance but exhibit distinct underlying behaviours. This…

Machine Learning · Computer Science 2023-11-28 Prakhar Ganesh

Accuracy Measures for the Comparison of Classifiers

The selection of the best classification algorithm for a given dataset is a very widespread problem. It is also a complex one, in the sense it requires to make several important methodological choices. Among them, in this work we focus on…

Machine Learning · Computer Science 2012-07-18 Vincent Labatut , Hocine Cherifi

Data Classification With Multiprocessing

Classification is one of the most important tasks in Machine Learning (ML) and with recent advancements in artificial intelligence (AI) it is important to find efficient ways to implement it. Generally, the choice of classification…

Machine Learning · Computer Science 2023-12-27 Anuja Dixit , Shreya Byreddy , Guanqun Song , Ting Zhu

The Multiplex Classification Framework: optimizing multi-label classifiers through problem transformation, ontology engineering, and model ensembling

Classification is a fundamental task in machine learning. While conventional methods-such as binary, multiclass, and multi-label classification-are effective for simpler problems, they may not adequately address the complexities of some…

Machine Learning · Computer Science 2024-12-20 Mauro Nievas Offidani , Facundo Roffet , Claudio Augusto Delrieux , Maria Carolina Gonzalez Galtier , Marcos Zarate

Classifier uncertainty: evidence, potential impact, and probabilistic treatment

Classifiers are often tested on relatively small data sets, which should lead to uncertain performance metrics. Nevertheless, these metrics are usually taken at face value. We present an approach to quantify the uncertainty of…

Machine Learning · Statistics 2021-03-05 Niklas Tötsch , Daniel Hoffmann

Beyond Average Performance -- exploring regions of deviating performance for black box classification models

Machine learning models are becoming increasingly popular in different types of settings. This is mainly caused by their ability to achieve a level of predictive performance that is hard to match by human experts in this new era of big…

Machine Learning · Computer Science 2021-09-20 Luis Torgo , Paulo Azevedo , Ines Areosa

OCCAM: Towards Cost-Efficient and Accuracy-Aware Classification Inference

Classification tasks play a fundamental role in various applications, spanning domains such as healthcare, natural language processing and computer vision. With the growing popularity and capacity of machine learning models, people can…

Computer Vision and Pattern Recognition · Computer Science 2025-02-26 Dujian Ding , Bicheng Xu , Laks V. S. Lakshmanan