Related papers: Using theoretical ROC curves for analysing machine…

Bending the Curve: Improving the ROC Curve Through Error Redistribution

Classification performance is often not uniform over the data. Some areas in the input space are easier to classify than others. Features that hold information about the "difficulty" of the data may be non-discriminative and are therefore…

Machine Learning · Computer Science 2016-05-24 Oran Richman , Shie Mannor

A Review of the Receiver Operating Characteristic Curve and a Proof About the Area Beneath It

The Receiver Operating Characteristic (ROC) curve of a binary classifier has often been utilized to measure the performance of the classifier. The area beneath this curve is used in particular because of its quoted probabilistic…

Machine Learning · Computer Science 2026-05-05 Steven Redolfi

Assessing binary classifiers using only positive and unlabeled data

Assessing the performance of a learned model is a crucial part of machine learning. However, in some domains only positive and unlabeled examples are available, which prohibits the use of most standard evaluation metrics. We propose an…

Machine Learning · Statistics 2015-12-31 Marc Claesen , Jesse Davis , Frank De Smet , Bart De Moor

Decision Making with Machine Learning and ROC Curves

The Receiver Operating Characteristic (ROC) curve is a representation of the statistical information discovered in binary classification problems and is a key concept in machine learning and data science. This paper studies the statistical…

Methodology · Statistics 2019-05-09 Kai Feng , Han Hong , Ke Tang , Jingyuan Wang

FROC: Building Fair ROC from a Trained Classifier

This paper considers the problem of fair probabilistic binary classification with binary protected groups. The classifier assigns scores, and a practitioner predicts labels using a certain cut-off threshold based on the desired trade-off…

Machine Learning · Computer Science 2024-12-20 Avyukta Manjunatha Vummintala , Shantanu Das , Sujit Gujar

The VOROS: Lifting ROC curves to 3D

While the area under the ROC curve is perhaps the most common measure that is used to rank the relative performance of different binary classifiers, longstanding field folklore has noted that it can be a measure that ill-captures the…

Machine Learning · Computer Science 2024-12-19 Christopher Ratigan , Lenore Cowen

A better Beta for the H measure of classification performance

The area under the ROC curve is widely used as a measure of performance of classification rules. However, it has recently been shown that the measure is fundamentally incoherent, in the sense that it treats the relative severities of…

Methodology · Statistics 2013-08-02 David J. Hand , Christoforos Anagnostopoulos

A Probabilistic Theory of Supervised Similarity Learning for Pointwise ROC Curve Optimization

The performance of many machine learning techniques depends on the choice of an appropriate similarity or distance measure on the input space. Similarity learning (or metric learning) aims at building such a measure from training data so…

Machine Learning · Statistics 2019-01-25 Robin Vogel , Aurélien Bellet , Stéphan Clémençon

ROC Analysis for Paired Comparison Data

Paired comparison models are used for analyzing data that involves pairwise comparisons among a set of objects. When the outcomes of the pairwise comparisons have no ties, the paired comparison models can be generalized as a class of binary…

Methodology · Statistics 2022-11-29 Ran Huo , Mark E. Glickman

PAC Learnability in the Presence of Performativity

Following the wide-spread adoption of machine learning models in real-world applications, the phenomenon of performativity, i.e. model-dependent shifts in the test distribution, becomes increasingly prevalent. Unfortunately, since models…

Machine Learning · Statistics 2026-01-21 Ivan Kirev , Lyuben Baltadzhiev , Nikola Konstantinov

On Tree-based Methods for Similarity Learning

In many situations, the choice of an adequate similarity measure or metric on the feature space dramatically determines the performance of machine learning methods. Building automatically such measures is the specific purpose of…

Machine Learning · Statistics 2020-02-24 Stéphan Clémençon , Robin Vogel

On the Geometry of Receiver Operating Characteristic and Precision-Recall Curves

We study the geometry of Receiver Operating Characteristic (ROC) and Precision-Recall (PR) curves in binary classification problems. The key finding is that many of the most commonly used binary classification metrics are merely functions…

Machine Learning · Computer Science 2026-04-15 Reza Sameni

Partial VOROS: A Cost-aware Performance Metric for Binary Classifiers with Precision and Capacity Constraints

The ROC curve is widely used to assess binary classifiers. Yet for some applications, such as alert systems for monitoring hospitalized patients, conventional ROC analysis cannot meet two key deployment needs: enforcing a constraint on…

Machine Learning · Computer Science 2026-04-03 Christopher Ratigan , Kyle Heuton , Carissa Wang , Lenore Cowen , Michael C. Hughes

Proper likelihood ratio based ROC curves for general binary classification problems

Everybody writes that ROC curves, a very common tool in binary classification problems, should be optimal, and in particular concave, non-decreasing and above the 45-degree line. Everybody uses ROC curves, theoretical and especially…

Methodology · Statistics 2019-08-01 Lidia Sacchetto , Mauro Gasparini

Calibration of Machine Learning Classifiers for Probability of Default Modelling

Binary classification is highly used in credit scoring in the estimation of probability of default. The validation of such predictive models is based both on rank ability, and also on calibration (i.e. how accurately the probabilities…

Econometrics · Economics 2017-10-25 Pedro G. Fonseca , Hugo D. Lopes

Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction

For large, real-world inductive learning problems, the number of training examples often must be limited due to the costs associated with procuring, preparing, and storing the training examples and/or the computational costs associated with…

Artificial Intelligence · Computer Science 2011-06-24 F. Provost , G. M. Weiss

Binarised Regression with Instance-Varying Costs: Evaluation using Impact Curves

Many evaluation methods exist, each for a particular prediction task, and there are a number of prediction tasks commonly performed including classification and regression. In binarised regression, binary decisions are generated from a…

Machine Learning · Computer Science 2020-08-18 Matthew Dirks , David Poole

The MCC-F1 curve: a performance evaluation technique for binary classification

Many fields use the ROC curve and the PR curve as standard evaluations of binary classification methods. Analysis of ROC and PR, however, often gives misleading and inflated performance evaluations, especially with an imbalanced ground…

Machine Learning · Statistics 2020-06-23 Chang Cao , Davide Chicco , Michael M. Hoffman

Evaluating Probabilistic Classifiers: The Triptych

Probability forecasts for binary outcomes, often referred to as probabilistic classifiers or confidence scores, are ubiquitous in science and society, and methods for evaluating and comparing them are in great demand. We propose and study a…

Methodology · Statistics 2023-01-27 Timo Dimitriadis , Tilmann Gneiting , Alexander I. Jordan , Peter Vogel

How to Control the Error Rates of Binary Classifiers

The traditional binary classification framework constructs classifiers which may have good accuracy, but whose false positive and false negative error rates are not under users' control. In many cases, one of the errors is more severe and…

Machine Learning · Statistics 2020-10-22 Miloš Simić