Related papers: Evaluating model calibration in classification

Calibration tests beyond classification

Most supervised machine learning tasks are subject to irreducible prediction errors. Probabilistic predictive models address this limitation by providing probability distributions that represent a belief over plausible targets, rather than…

Machine Learning · Statistics 2022-10-25 David Widmann , Fredrik Lindsten , Dave Zachariah

On the Calibration of Probabilistic Classifier Sets

Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble learning methods, are able to model aleatoric and epistemic uncertainty. Aleatoric uncertainty is then typically quantified via the Bayes…

Machine Learning · Statistics 2023-04-20 Thomas Mortier , Viktor Bengs , Eyke Hüllermeier , Stijn Luca , Willem Waegeman

Classifier Calibration: A survey on how to assess and improve predicted class probabilities

This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its…

Machine Learning · Computer Science 2023-06-16 Telmo Silva Filho , Hao Song , Miquel Perello-Nieto , Raul Santos-Rodriguez , Meelis Kull , Peter Flach

Calibration tests in multi-class classification: A unifying framework

In safety-critical applications a probabilistic model is usually required to be calibrated, i.e., to capture the uncertainty of its predictions accurately. In multi-class classification, calibration of the most confident predictions only is…

Machine Learning · Statistics 2022-09-30 David Widmann , Fredrik Lindsten , Dave Zachariah

Calibration through the Lens of Interpretability

Calibration is a frequently invoked concept when useful label probability estimates are required on top of classification accuracy. A calibrated model is a function whose values correctly reflect underlying label probabilities. Calibration…

Machine Learning · Computer Science 2024-12-03 Alireza Torabian , Ruth Urner

Calibration through the Lens of Indistinguishability

Calibration is a classical notion from the forecasting literature which aims to address the question: how should predicted probabilities be interpreted? In a world where we only get to observe (discrete) outcomes, how should we evaluate a…

Machine Learning · Computer Science 2025-09-03 Parikshit Gopalan , Lunjia Hu

Calibrating Predictions to Decisions: A Novel Approach to Multi-Class Calibration

When facing uncertainty, decision-makers want predictions they can trust. A machine learning provider can convey confidence to decision-makers by guaranteeing their predictions are distribution calibrated -- amongst the inputs that receive…

Machine Learning · Statistics 2021-07-14 Shengjia Zhao , Michael P. Kim , Roshni Sahoo , Tengyu Ma , Stefano Ermon

Cross-calibration of probabilistic forecasts

When providing probabilistic forecasts for uncertain future events, it is common to strive for calibrated forecasts, that is, the predictive distribution should be compatible with the observed outcomes. Several notions of calibration are…

Methodology · Statistics 2015-05-21 Christof Strähl , Johanna F. Ziegel

Calibrated Selective Classification

Selective classification allows models to abstain from making predictions (e.g., say "I don't know") when in doubt in order to obtain better effective accuracy. While typical selective models can be effective at producing more accurate…

Machine Learning · Computer Science 2024-06-24 Adam Fisch , Tommi Jaakkola , Regina Barzilay

Temporal Probability Calibration

In many applications, accurate class probability estimates are required, but many types of models produce poor quality probability estimates despite achieving acceptable classification accuracy. Even though probability calibration has been…

Machine Learning · Computer Science 2020-02-18 Tim Leathart , Maksymilian Polaczuk

Calibrate: Interactive Analysis of Probabilistic Model Output

Analyzing classification model performance is a crucial task for machine learning practitioners. While practitioners often use count-based metrics derived from confusion matrices, like accuracy, many applications, such as weather…

Human-Computer Interaction · Computer Science 2022-07-29 Peter Xenopoulos , Joao Rulff , Luis Gustavo Nonato , Brian Barr , Claudio Silva

Estimating Expected Calibration Errors

Uncertainty in probabilistic classifiers predictions is a key concern when models are used to support human decision making, in broader probabilistic pipelines or when sensitive automatic decisions have to be taken. Studies have shown that…

Machine Learning · Computer Science 2021-09-09 Nicolas Posocco , Antoine Bonnefoy

A comprehensive review of classifier probability calibration metrics

Probabilities or confidence values produced by artificial intelligence (AI) and machine learning (ML) models often do not reflect their true accuracy, with some models being under or over confident in their predictions. For example, if a…

Machine Learning · Computer Science 2025-04-28 Richard Oliver Lane

Scalable Utility-Aware Multiclass Calibration

Ensuring that classifiers are well-calibrated, i.e., their predictions align with observed frequencies, is a minimal and fundamental requirement for classifiers to be viewed as trustworthy. Existing methods for assessing multiclass…

Machine Learning · Computer Science 2025-10-30 Mahmoud Hegazy , Michael I. Jordan , Aymeric Dieuleveut

Unsupervised Calibration under Covariate Shift

A probabilistic model is said to be calibrated if its predicted probabilities match the corresponding empirical frequencies. Calibration is important for uncertainty quantification and decision making in safety-critical applications. While…

Machine Learning · Computer Science 2020-07-01 Anusri Pampari , Stefano Ermon

Reassessing How to Compare and Improve the Calibration of Machine Learning Models

A machine learning model is calibrated if its predicted probability for an outcome matches the observed frequency for that outcome conditional on the model prediction. This property has become increasingly important as the impact of machine…

Machine Learning · Computer Science 2025-02-25 Muthu Chidambaram , Rong Ge

Better Uncertainty Calibration via Proper Scores for Classification and Beyond

With model trustworthiness being crucial for sensitive real-world applications, practitioners are putting more and more focus on improving the uncertainty calibration of deep neural networks. Calibration errors are designed to quantify the…

Machine Learning · Computer Science 2024-03-14 Sebastian G. Gruber , Florian Buettner

Calibrated Top-1 Uncertainty estimates for classification by score based models

While the accuracy of modern deep learning models has significantly improved in recent years, the ability of these models to generate uncertainty estimates has not progressed to the same degree. Uncertainty methods are designed to provide…

Machine Learning · Statistics 2020-06-17 Adam M. Oberman , Chris Finlay , Alexander Iannantuono , Tiago Salvador

Classifier Calibration: with application to threat scores in cybersecurity

This paper explores the calibration of a classifier output score in binary classification problems. A calibrator is a function that maps the arbitrary classifier score, of a testing observation, onto $[0,1]$ to provide an estimate for the…

Machine Learning · Computer Science 2022-04-29 Waleed A. Yousef , Issa Traore , William Briguglio

Better Multi-class Probability Estimates for Small Data Sets

Many classification applications require accurate probability estimates in addition to good class separation but often classifiers are designed focusing only on the latter. Calibration is the process of improving probability estimates by…

Machine Learning · Computer Science 2020-01-31 Tuomo Alasalmi , Jaakko Suutala , Heli Koskimäki , Juha Röning