Related papers: Improving Multi-Class Calibration through Normaliz…

Classifier Calibration with ROC-Regularized Isotonic Regression

Calibration of machine learning classifiers is necessary to obtain reliable and interpretable predictions, bridging the gap between model confidence and actual probabilities. One prominent technique, isotonic regression (IR), aims at…

Machine Learning · Computer Science 2023-11-22 Eugene Berta , Francis Bach , Michael Jordan

Classifier Calibration: A survey on how to assess and improve predicted class probabilities

This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its…

Machine Learning · Computer Science 2023-06-16 Telmo Silva Filho , Hao Song , Miquel Perello-Nieto , Raul Santos-Rodriguez , Meelis Kull , Peter Flach

Uncertainty-Aware Post-Hoc Calibration: Mitigating Confidently Incorrect Predictions Beyond Calibration Metrics

Despite extensive research on neural network calibration, existing methods typically apply global transformations that treat all predictions uniformly, overlooking the heterogeneous reliability of individual predictions. Furthermore, the…

Machine Learning · Computer Science 2025-10-22 Hassan Gharoun , Mohammad Sadegh Khorshidi , Kasra Ranjbarigderi , Fang Chen , Amir H. Gandomi

Class-wise and reduced calibration methods

For many applications of probabilistic classifiers it is important that the predicted confidence vectors reflect true probabilities (one says that the classifier is calibrated). It has been shown that common models fail to satisfy this…

Machine Learning · Statistics 2022-10-10 Michael Panchenko , Anes Benmerzoug , Miguel de Benito Delgado

Calibration of Machine Learning Classifiers for Probability of Default Modelling

Binary classification is highly used in credit scoring in the estimation of probability of default. The validation of such predictive models is based both on rank ability, and also on calibration (i.e. how accurately the probabilities…

Econometrics · Economics 2017-10-25 Pedro G. Fonseca , Hugo D. Lopes

Confidence Calibration of Classifiers with Many Classes

For classification models based on neural networks, the maximum predicted class probability is often used as a confidence score. This score rarely predicts well the probability of making a correct prediction and requires a post-processing…

Machine Learning · Computer Science 2024-11-07 Adrien LeCoz , Stéphane Herbin , Faouzi Adjed

Beyond temperature scaling: Obtaining well-calibrated multiclass probabilities with Dirichlet calibration

Class probabilities predicted by most multiclass classifiers are uncalibrated, often tending towards over-confidence. With neural networks, calibration can be improved by temperature scaling, a method to learn a single corrective…

Machine Learning · Computer Science 2019-10-29 Meelis Kull , Miquel Perello-Nieto , Markus Kängsepp , Telmo Silva Filho , Hao Song , Peter Flach

Enforcing Calibration in Multi-Output Probabilistic Regression with Pre-rank Regularization

Probabilistic models must be well calibrated to support reliable decision-making. While calibration in single-output regression is well studied, defining and achieving multivariate calibration in multi-output regression remains considerably…

Machine Learning · Statistics 2025-10-28 Naomi Desobry , Elnura Zhalieva , Souhaib Ben Taieb

A Study on the Calibration of In-context Learning

Accurate uncertainty quantification is crucial for the safe deployment of machine learning models, and prior research has demonstrated improvements in the calibration of modern language models (LMs). We study in-context learning (ICL), a…

Computation and Language · Computer Science 2024-03-29 Hanlin Zhang , Yi-Fan Zhang , Yaodong Yu , Dhruv Madeka , Dean Foster , Eric Xing , Himabindu Lakkaraju , Sham Kakade

Better Multi-class Probability Estimates for Small Data Sets

Many classification applications require accurate probability estimates in addition to good class separation but often classifiers are designed focusing only on the latter. Calibration is the process of improving probability estimates by…

Machine Learning · Computer Science 2020-01-31 Tuomo Alasalmi , Jaakko Suutala , Heli Koskimäki , Juha Röning

CLEAR: Calibrated Learning for Epistemic and Aleatoric Risk

Accurate uncertainty quantification is critical for reliable predictive modeling. Existing methods typically address either aleatoric uncertainty due to measurement noise or epistemic uncertainty resulting from limited data, but not both in…

Machine Learning · Statistics 2026-03-04 Ilia Azizi , Juraj Bodik , Jakob Heiss , Bin Yu

Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Recent works have shown that deep neural networks can achieve super-human performance in a wide range of image classification tasks in the medical imaging domain. However, these works have primarily focused on classification accuracy,…

Computer Vision and Pattern Recognition · Computer Science 2020-09-10 Gongbo Liang , Yu Zhang , Xiaoqin Wang , Nathan Jacobs

On Computationally Efficient Multi-Class Calibration

Consider a multi-class labelling problem, where the labels can take values in $[k]$, and a predictor predicts a distribution over the labels. In this work, we study the following foundational question: Are there notions of multi-class…

Machine Learning · Computer Science 2024-06-11 Parikshit Gopalan , Lunjia Hu , Guy N. Rothblum

Probability Calibration Trees

Obtaining accurate and well calibrated probability estimates from classifiers is useful in many applications, for example, when minimising the expected cost of classifications. Existing methods of calibrating probability estimates are…

Machine Learning · Computer Science 2018-09-17 Tim Leathart , Eibe Frank , Geoffrey Holmes , Bernhard Pfahringer

Modular Conformal Calibration

Uncertainty estimates must be calibrated (i.e., accurate) and sharp (i.e., informative) in order to be useful. This has motivated a variety of methods for recalibration, which use held-out data to turn an uncalibrated model into a…

Machine Learning · Computer Science 2022-07-06 Charles Marx , Shengjia Zhao , Willie Neiswanger , Stefano Ermon

Calibration tests in multi-class classification: A unifying framework

In safety-critical applications a probabilistic model is usually required to be calibrated, i.e., to capture the uncertainty of its predictions accurately. In multi-class classification, calibration of the most confident predictions only is…

Machine Learning · Statistics 2022-09-30 David Widmann , Fredrik Lindsten , Dave Zachariah

Stabilized Inverse Probability Weighting via Isotonic Calibration

Inverse weighting with an estimated propensity score is widely used by estimation methods in causal inference to adjust for confounding bias. However, directly inverting propensity score estimates can lead to instability, bias, and…

Methodology · Statistics 2025-04-11 Lars van der Laan , Ziming Lin , Marco Carone , Alex Luedtke

A Large-Scale Study of Probabilistic Calibration in Neural Network Regression

Accurate probabilistic predictions are essential for optimal decision making. While neural network miscalibration has been studied primarily in classification, we investigate this in the less-explored domain of regression. We conduct the…

Machine Learning · Computer Science 2023-06-08 Victor Dheur , Souhaib Ben Taieb

Measuring Calibration in Deep Learning

Overconfidence and underconfidence in machine learning classifiers is measured by calibration: the degree to which the probabilities predicted for each class match the accuracy of the classifier on that prediction. How one measures…

Machine Learning · Computer Science 2020-08-11 Jeremy Nixon , Mike Dusenberry , Ghassen Jerfel , Timothy Nguyen , Jeremiah Liu , Linchuan Zhang , Dustin Tran

Multiclass Calibration Assessment and Recalibration of Probability Predictions via the Linear Log Odds Calibration Function

Machine-generated probability predictions are essential in modern classification tasks such as image classification. A model is well calibrated when its predicted probabilities correspond to observed event frequencies. Despite the need for…

Machine Learning · Statistics 2026-02-24 Amy Vennos , Xin Xing , Christopher T. Franck