Related papers: Threshold Choice Methods: the Missing Link

Analysis and Comparison of Classification Metrics

A variety of different performance metrics are commonly used in the machine learning literature for the evaluation of classification systems. Some of the most common ones for measuring quality of hard decisions are standard and balanced…

Machine Learning · Computer Science 2023-09-22 Luciana Ferrer

Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs

Machine learning-based decision support systems are increasingly deployed in clinical settings, where probabilistic scoring functions are used to inform and prioritize patient management decisions. However, widely used scoring rules, such…

Machine Learning · Computer Science 2025-07-01 Gerardo A. Flores , Alyssa H. Smith , Julia A. Fukuyama , Ashia C. Wilson

Technical Note: Towards ROC Curves in Cost Space

ROC curves and cost curves are two popular ways of visualising classifier performance, finding appropriate thresholds according to the operating condition, and deriving useful aggregated measures such as the area under the ROC curve (AUC)…

Artificial Intelligence · Computer Science 2011-08-01 José Hernández-Orallo , Peter Flach , Cèsar Ferri

Calibrating Black Box Classification Models through the Thresholding Method

In high-dimensional classification settings, we wish to seek a balance between high power and ensuring control over a desired loss function. In many settings, the points most likely to be misclassified are those who lie near the decision…

Machine Learning · Statistics 2017-06-06 Arun Srinivasan

Inconsistency of evaluation metrics in link prediction

Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a…

Social and Information Networks · Computer Science 2024-02-27 Yilin Bi , Xinshan Jiao , Yan-Li Lee , Tao Zhou

Evaluating classification performance across operating contexts: A comparison of decision curve analysis and cost curves

Classification models typically predict a score and use a decision threshold to produce a classification. Appropriate model evaluation should carefully consider the context in which a model will be used, including the relative value of…

Machine Learning · Computer Science 2025-09-30 Louise AC Millard , Peter A Flach

Classification Performance Metric Elicitation and its Applications

Given a learning problem with real-world tradeoffs, which cost function should the model be trained to optimize? This is the metric selection problem in machine learning. Despite its practical interest, there is limited formal guidance on…

Machine Learning · Statistics 2022-08-22 Gaurush Hiranandani

Performance Metrics (Error Measures) in Machine Learning Regression, Forecasting and Prognostics: Properties and Typology

Performance metrics (error measures) are vital components of the evaluation frameworks in various fields. The intention of this study was to overview of a variety of performance metrics and approaches to their classification. The main goal…

Methodology · Statistics 2019-01-29 Alexei Botchkarev

On resampling methods for model assessment in penalized and unpenalized logistic regression

Penalized logistic regression methods are frequently used to investigate the relationship between a binary outcome and a set of explanatory variables. The model performance can be assessed by measures such as the concordance statistic…

Methodology · Statistics 2021-01-20 Angelika Geroldinger , Lara Lusa , Mariana Nold , Georg Heinze

Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization

Creating impact in real-world settings requires artificial intelligence techniques to span the full pipeline from data, to predictive models, to decisions. These components are typically approached separately: a machine learning model is…

Machine Learning · Computer Science 2018-11-22 Bryan Wilder , Bistra Dilkina , Milind Tambe

Dealing with Class Imbalance using Thresholding

We propose thresholding as an approach to deal with class imbalance. We define the concept of thresholding as a process of determining a decision boundary in the presence of a tunable parameter. The threshold is the maximum value of this…

Machine Learning · Computer Science 2016-07-12 Charmgil Hong , Rumi Ghosh , Soundar Srinivasan

Adaptive label thresholding methods for online multi-label classification

Existing online multi-label classification works cannot well handle the online label thresholding problem and lack the regret analysis for their online algorithms. This paper proposes a novel framework of adaptive label thresholding…

Machine Learning · Computer Science 2022-11-15 Tingting Zhai , Hongcheng Tang , Hao Wang

Extended sample size calculations for evaluation of prediction models using a threshold for classification

When evaluating the performance of a model for individualised risk prediction, the sample size needs to be large enough to precisely estimate the performance measures of interest. Current sample size guidance is based on precisely…

Methodology · Statistics 2024-07-01 Rebecca Whittle , Joie Ensor , Lucinda Archer , Gary S. Collins , Paula Dhiman , Alastair Denniston , Joseph Alderman , Amardeep Legha , Maarten van Smeden , Karel G. Moons , Jean-Baptiste Cazier , Richard D. Riley , Kym I. E. Snell

Predicting Relative Thresholds for Object Oriented Metrics

Object-oriented software metrics provide a numerical characterization of software quality. They have also been used in the assessment and identification of technical debt. However, metrics generally need to be used with thresholds as…

Software Engineering · Computer Science 2021-04-05 Sultan Alhusain

Discriminating abilities of threshold-free evaluation metrics in link prediction

Link prediction is a paradigmatic and challenging problem in network science, which attempts to uncover missing links or predict future links, based on known topology. A fundamental but still unsolved issue is how to choose proper metrics…

Data Analysis, Statistics and Probability · Physics 2023-03-22 Tao Zhou

Evaluating Posterior Probabilities: Decision Theory, Proper Scoring Rules, and Calibration

Most machine learning classifiers are designed to output posterior probabilities for the classes given the input sample. These probabilities may be used to make the categorical decision on the class of the sample; provided as input to a…

Machine Learning · Statistics 2024-08-07 Luciana Ferrer , Daniel Ramos

A Consequentialist Critique of Binary Classification Evaluation: Theory, Practice, and Tools

Machine learning-supported decisions, such as ordering diagnostic tests or determining preventive custody, often require converting probabilistic forecasts into binary classifications. We adopt a consequentialist perspective from decision…

Machine Learning · Computer Science 2026-03-11 Gerardo Flores , Abigail Schiff , Alyssa H. Smith , Julia A Fukuyama , Ashia C. Wilson

Evaluation of Trace Alignment Quality and its Application in Medical Process Mining

Trace alignment algorithms have been used in process mining for discovering the consensus treatment procedures and process deviations. Different alignment algorithms, however, may produce very different results. No widely-adopted method…

Other Computer Science · Computer Science 2017-09-21 Moliang Zhou , Sen Yang , Shuyu Lv , Xinyu Li , Shuhong Chen , Ivan Marsic , Richard Farneth , Randall Burd

Proper-Composite Loss Functions in Arbitrary Dimensions

The study of a machine learning problem is in many ways is difficult to separate from the study of the loss function being used. One avenue of inquiry has been to look at these loss functions in terms of their properties as scoring rules…

Machine Learning · Computer Science 2022-09-02 Zac Cranko , Robert C. Williamson , Richard Nock