Related papers: Cautious Calibration in Binary Classification

Binary Classifier Calibration: Bayesian Non-Parametric Approach

A set of probabilistic predictions is well calibrated if the events that are predicted to occur with probability p do in fact occur about p fraction of the time. Well calibrated predictions are particularly important when machine learning…

Machine Learning · Statistics 2014-01-14 Mahdi Pakdaman Naeini , Gregory F. Cooper , Milos Hauskrecht

Classifier Calibration: A survey on how to assess and improve predicted class probabilities

This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its…

Machine Learning · Computer Science 2023-06-16 Telmo Silva Filho , Hao Song , Miquel Perello-Nieto , Raul Santos-Rodriguez , Meelis Kull , Peter Flach

Two Sides of Miscalibration: Identifying Over and Under-Confidence Prediction for Network Calibration

Proper confidence calibration of deep neural networks is essential for reliable predictions in safety-critical tasks. Miscalibration can lead to model over-confidence and/or under-confidence; i.e., the model's confidence in its prediction…

Machine Learning · Computer Science 2023-08-08 Shuang Ao , Stefan Rueger , Advaith Siddharthan

Measuring Calibration in Deep Learning

Overconfidence and underconfidence in machine learning classifiers is measured by calibration: the degree to which the probabilities predicted for each class match the accuracy of the classifier on that prediction. How one measures…

Machine Learning · Computer Science 2020-08-11 Jeremy Nixon , Mike Dusenberry , Ghassen Jerfel , Timothy Nguyen , Jeremiah Liu , Linchuan Zhang , Dustin Tran

Boldness-Recalibration for Binary Event Predictions

Probability predictions are essential to inform decision making across many fields. Ideally, probability predictions are (i) well calibrated, (ii) accurate, and (iii) bold, i.e., spread out enough to be informative for decision making.…

Methodology · Statistics 2024-06-07 Adeline P. Guthrie , Christopher T. Franck

Calibrated Selective Classification

Selective classification allows models to abstain from making predictions (e.g., say "I don't know") when in doubt in order to obtain better effective accuracy. While typical selective models can be effective at producing more accurate…

Machine Learning · Computer Science 2024-06-24 Adam Fisch , Tommi Jaakkola , Regina Barzilay

Confidence Calibration of Classifiers with Many Classes

For classification models based on neural networks, the maximum predicted class probability is often used as a confidence score. This score rarely predicts well the probability of making a correct prediction and requires a post-processing…

Machine Learning · Computer Science 2024-11-07 Adrien LeCoz , Stéphane Herbin , Faouzi Adjed

On Computationally Efficient Multi-Class Calibration

Consider a multi-class labelling problem, where the labels can take values in $[k]$, and a predictor predicts a distribution over the labels. In this work, we study the following foundational question: Are there notions of multi-class…

Machine Learning · Computer Science 2024-06-11 Parikshit Gopalan , Lunjia Hu , Guy N. Rothblum

From Uncertainty to Precision: Enhancing Binary Classifier Performance through Calibration

The assessment of binary classifier performance traditionally centers on discriminative ability using metrics, such as accuracy. However, these metrics often disregard the model's inherent uncertainty, especially when dealing with sensitive…

Machine Learning · Computer Science 2024-02-13 Agathe Fernandes Machado , Arthur Charpentier , Emmanuel Flachaire , Ewen Gallic , François Hu

Unsupervised Calibration under Covariate Shift

A probabilistic model is said to be calibrated if its predicted probabilities match the corresponding empirical frequencies. Calibration is important for uncertainty quantification and decision making in safety-critical applications. While…

Machine Learning · Computer Science 2020-07-01 Anusri Pampari , Stefano Ermon

Binary Classification with Confidence Difference

Recently, learning with soft labels has been shown to achieve better performance than learning with hard labels in terms of model generalization, calibration, and robustness. However, collecting pointwise labeling confidence for all…

Machine Learning · Computer Science 2023-10-10 Wei Wang , Lei Feng , Yuchen Jiang , Gang Niu , Min-Ling Zhang , Masashi Sugiyama

Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

Rigorous statistical methods, including parameter estimation with accompanying uncertainties, underpin the validity of scientific discovery, especially in the natural sciences. With increasingly complex data models such as deep learning…

Machine Learning · Computer Science 2026-02-18 Aurora Grefsrud , Nello Blaser , Trygve Buanes

Better Uncertainty Calibration via Proper Scores for Classification and Beyond

With model trustworthiness being crucial for sensitive real-world applications, practitioners are putting more and more focus on improving the uncertainty calibration of deep neural networks. Calibration errors are designed to quantify the…

Machine Learning · Computer Science 2024-03-14 Sebastian G. Gruber , Florian Buettner

Non-Parametric Calibration for Classification

Many applications of classification methods not only require high accuracy but also reliable estimation of predictive uncertainty. However, while many current classification frameworks, in particular deep neural networks, achieve high…

Machine Learning · Computer Science 2020-02-28 Jonathan Wenger , Hedvig Kjellström , Rudolph Triebel

Calibration of Machine Learning Classifiers for Probability of Default Modelling

Binary classification is highly used in credit scoring in the estimation of probability of default. The validation of such predictive models is based both on rank ability, and also on calibration (i.e. how accurately the probabilities…

Econometrics · Economics 2017-10-25 Pedro G. Fonseca , Hugo D. Lopes

Hidden Heterogeneity: When to Choose Similarity-Based Calibration

Trustworthy classifiers are essential to the adoption of machine learning predictions in many real-world settings. The predicted probability of possible outcomes can inform high-stakes decision making, particularly when assessing the…

Machine Learning · Computer Science 2023-02-22 Kiri L. Wagstaff , Thomas G. Dietterich

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

We introduce a framework for calibrating machine learning models so that their predictions satisfy explicit, finite-sample statistical guarantees. Our calibration algorithms work with any underlying model and (unknown) data-generating…

Machine Learning · Computer Science 2022-10-03 Anastasios N. Angelopoulos , Stephen Bates , Emmanuel J. Candès , Michael I. Jordan , Lihua Lei

Is this model reliable for everyone? Testing for strong calibration

In a well-calibrated risk prediction model, the average predicted probability is close to the true event rate for any given subgroup. Such models are reliable across heterogeneous populations and satisfy strong notions of algorithmic…

Machine Learning · Computer Science 2023-07-31 Jean Feng , Alexej Gossmann , Romain Pirracchio , Nicholas Petrick , Gene Pennello , Berkman Sahiner

Better Multi-class Probability Estimates for Small Data Sets

Many classification applications require accurate probability estimates in addition to good class separation but often classifiers are designed focusing only on the latter. Calibration is the process of improving probability estimates by…

Machine Learning · Computer Science 2020-01-31 Tuomo Alasalmi , Jaakko Suutala , Heli Koskimäki , Juha Röning

Clustered Calibration: Representation-Aware Probability Calibration via Learned Subpopulations

Ensuring that predicted probabilities align with observed frequencies is critical in high-stakes domains such as clinical decision support, autonomous driving and financial risk assessment. Existing calibration methods typically apply a…

Machine Learning · Computer Science 2026-05-26 Tomer Lavi , Bracha Shapira , Nadav Rappoport