Related papers: Calibration tests beyond classification

Calibration tests in multi-class classification: A unifying framework

In safety-critical applications a probabilistic model is usually required to be calibrated, i.e., to capture the uncertainty of its predictions accurately. In multi-class classification, calibration of the most confident predictions only is…

Machine Learning · Statistics 2022-09-30 David Widmann , Fredrik Lindsten , Dave Zachariah

Evaluating model calibration in classification

Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their…

Machine Learning · Computer Science 2019-02-20 Juozas Vaicenavicius , David Widmann , Carl Andersson , Fredrik Lindsten , Jacob Roll , Thomas B. Schön

Calibration by Distribution Matching: Trainable Kernel Calibration Metrics

Calibration ensures that probabilistic forecasts meaningfully capture uncertainty by requiring that predicted probabilities align with empirical frequencies. However, many existing calibration methods are specialized for post-hoc…

Machine Learning · Computer Science 2023-11-01 Charles Marx , Sofian Zalouk , Stefano Ermon

Reassessing How to Compare and Improve the Calibration of Machine Learning Models

A machine learning model is calibrated if its predicted probability for an outcome matches the observed frequency for that outcome conditional on the model prediction. This property has become increasingly important as the impact of machine…

Machine Learning · Computer Science 2025-02-25 Muthu Chidambaram , Rong Ge

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

We introduce a framework for calibrating machine learning models so that their predictions satisfy explicit, finite-sample statistical guarantees. Our calibration algorithms work with any underlying model and (unknown) data-generating…

Machine Learning · Computer Science 2022-10-03 Anastasios N. Angelopoulos , Stephen Bates , Emmanuel J. Candès , Michael I. Jordan , Lihua Lei

Towards reliable predictive analytics: a generalized calibration framework

Calibration is a pivotal aspect in predictive modeling, as it ensures that the predictions closely correspond with what we observe empirically. The contemporary calibration framework, however, is predominantly focused on prediction models…

Methodology · Statistics 2023-09-18 Bavo De Cock Campo

On the Calibration of Probabilistic Classifier Sets

Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble learning methods, are able to model aleatoric and epistemic uncertainty. Aleatoric uncertainty is then typically quantified via the Bayes…

Machine Learning · Statistics 2023-04-20 Thomas Mortier , Viktor Bengs , Eyke Hüllermeier , Stijn Luca , Willem Waegeman

Enforcing Calibration in Multi-Output Probabilistic Regression with Pre-rank Regularization

Probabilistic models must be well calibrated to support reliable decision-making. While calibration in single-output regression is well studied, defining and achieving multivariate calibration in multi-output regression remains considerably…

Machine Learning · Statistics 2025-10-28 Naomi Desobry , Elnura Zhalieva , Souhaib Ben Taieb

Better Uncertainty Calibration via Proper Scores for Classification and Beyond

With model trustworthiness being crucial for sensitive real-world applications, practitioners are putting more and more focus on improving the uncertainty calibration of deep neural networks. Calibration errors are designed to quantify the…

Machine Learning · Computer Science 2024-03-14 Sebastian G. Gruber , Florian Buettner

A Generalized Bayesian Approach to Model Calibration

In model development, model calibration and validation play complementary roles toward learning reliable models. In this article, we expand the Bayesian Validation Metric framework to a general calibration and validation framework by…

Methodology · Statistics 2020-08-04 Tony Tohme , Kevin Vanslette , Kamal Youcef-Toumi

Calibration through the Lens of Interpretability

Calibration is a frequently invoked concept when useful label probability estimates are required on top of classification accuracy. A calibrated model is a function whose values correctly reflect underlying label probabilities. Calibration…

Machine Learning · Computer Science 2024-12-03 Alireza Torabian , Ruth Urner

Classifier Calibration: A survey on how to assess and improve predicted class probabilities

This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its…

Machine Learning · Computer Science 2023-06-16 Telmo Silva Filho , Hao Song , Miquel Perello-Nieto , Raul Santos-Rodriguez , Meelis Kull , Peter Flach

Non-Parametric Calibration of Probabilistic Regression

The task of calibration is to retrospectively adjust the outputs from a machine learning model to provide better probability estimates on the target variable. While calibration has been investigated thoroughly in classification, it has not…

Machine Learning · Statistics 2018-06-21 Hao Song , Meelis Kull , Peter Flach

Modular Conformal Calibration

Uncertainty estimates must be calibrated (i.e., accurate) and sharp (i.e., informative) in order to be useful. This has motivated a variety of methods for recalibration, which use held-out data to turn an uncalibrated model into a…

Machine Learning · Computer Science 2022-07-06 Charles Marx , Shengjia Zhao , Willie Neiswanger , Stefano Ermon

Calibration of Neural Networks

Neural networks solving real-world problems are often required not only to make accurate predictions but also to provide a confidence level in the forecast. The calibration of a model indicates how close the estimated confidence is to the…

Neural and Evolutionary Computing · Computer Science 2023-03-21 Ruslan Vasilev , Alexander D'yakonov

Nonparametric Distribution Regression Re-calibration

A key challenge in probabilistic regression is ensuring that predictive distributions accurately reflect true empirical uncertainty. Minimizing overall prediction error often encourages models to prioritize informativeness over calibration,…

Machine Learning · Statistics 2026-02-17 Ádám Jung , Domokos M. Kelen , András A. Benczúr

Calibrated Selective Classification

Selective classification allows models to abstain from making predictions (e.g., say "I don't know") when in doubt in order to obtain better effective accuracy. While typical selective models can be effective at producing more accurate…

Machine Learning · Computer Science 2024-06-24 Adam Fisch , Tommi Jaakkola , Regina Barzilay

A comprehensive review of classifier probability calibration metrics

Probabilities or confidence values produced by artificial intelligence (AI) and machine learning (ML) models often do not reflect their true accuracy, with some models being under or over confident in their predictions. For example, if a…

Machine Learning · Computer Science 2025-04-28 Richard Oliver Lane

Recipes for Calibration Checks in Safety-Critical Applications

Safety-critical prediction systems, such as autonomous vehicles, weather forecasters, and medical monitors, commonly rely on probabilistic forecasters. These forecasters make predictions about possible future outcomes, and their quality and…

Methodology · Statistics 2026-04-30 Romeo Valentin

Non-Parametric Calibration for Classification

Many applications of classification methods not only require high accuracy but also reliable estimation of predictive uncertainty. However, while many current classification frameworks, in particular deep neural networks, achieve high…

Machine Learning · Computer Science 2020-02-28 Jonathan Wenger , Hedvig Kjellström , Rudolph Triebel