Related papers: Calibration Error Estimation Using Fuzzy Binning

Mitigating Bias in Calibration Error Estimation

For an AI system to be reliable, the confidence it expresses in its decisions must match its accuracy. To assess the degree of match, examples are typically binned by confidence and the per-bin mean confidence and accuracy are compared.…

Machine Learning · Computer Science 2022-02-14 Rebecca Roelofs , Nicholas Cain , Jonathon Shlens , Michael C. Mozer

TCE: A Test-Based Approach to Measuring Calibration Error

This paper proposes a new metric to measure the calibration error of probabilistic binary classifiers, called test-based calibration error (TCE). TCE incorporates a novel loss function based on a statistical test to examine the extent to…

Machine Learning · Statistics 2023-06-27 Takuo Matsubara , Niek Tax , Richard Mudd , Ido Guy

Properties of the ENCE and other MAD-based calibration metrics

The Expected Normalized Calibration Error (ENCE) is a popular calibration statistic used in Machine Learning to assess the quality of prediction uncertainties for regression problems. Estimation of the ENCE is based on the binning of…

Machine Learning · Computer Science 2023-05-23 Pascal Pernot

Soft Calibration Objectives for Neural Networks

Optimal decision making requires that classifiers produce uncertainty estimates consistent with their empirical accuracy. However, deep neural networks are often under- or over-confident in their predictions. Consequently, methods have been…

Machine Learning · Computer Science 2021-12-08 Archit Karandikar , Nicholas Cain , Dustin Tran , Balaji Lakshminarayanan , Jonathon Shlens , Michael C. Mozer , Becca Roelofs

Quantifying Calibration Error in Neural Networks Through Evidence-Based Theory

Trustworthiness in neural networks is crucial for their deployment in critical applications, where reliability, confidence, and uncertainty play pivotal roles in decision-making. Traditional performance metrics such as accuracy and…

Machine Learning · Computer Science 2025-09-05 Koffi Ismael Ouattara , Ioannis Krontiris , Theo Dimitrakos , Frank Kargl

Estimating Expected Calibration Errors

Uncertainty in probabilistic classifiers predictions is a key concern when models are used to support human decision making, in broader probabilistic pipelines or when sensitive automatic decisions have to be taken. Studies have shown that…

Machine Learning · Computer Science 2021-09-09 Nicolas Posocco , Antoine Bonnefoy

Beyond Probability Partitions: Calibrating Neural Networks with Semantic Aware Grouping

Research has shown that deep networks tend to be overly optimistic about their predictions, leading to an underestimation of prediction errors. Due to the limited nature of data, existing studies have proposed various methods based on model…

Machine Learning · Computer Science 2023-10-24 Jia-Qi Yang , De-Chuan Zhan , Le Gan

On Calibrated Model Uncertainty in Deep Learning

Estimated uncertainty by approximate posteriors in Bayesian neural networks are prone to miscalibration, which leads to overconfident predictions in critical tasks that have a clear asymmetric cost or significant losses. Here, we extend the…

Machine Learning · Computer Science 2022-06-17 Biraja Ghoshal , Allan Tucker

Assessing the Probabilistic Fit of Neural Regressors via Conditional Congruence

While significant progress has been made in specifying neural networks capable of representing uncertainty, deep networks still often suffer from overconfidence and misaligned predictive distributions. Existing approaches for measuring this…

Machine Learning · Computer Science 2025-10-24 Spencer Young , Riley Sinema , Cole Edgren , Andrew Hall , Nathan Dong , Porter Jenkins

Information-theoretic Generalization Analysis for Expected Calibration Error

While the expected calibration error (ECE), which employs binning, is widely adopted to evaluate the calibration performance of machine learning models, theoretical understanding of its estimation bias is limited. In this paper, we present…

Machine Learning · Computer Science 2025-05-27 Futoshi Futami , Masahiro Fujisawa

Soft Mean Expected Calibration Error (SMECE): A Calibration Metric for Probabilistic Labels

The Expected Calibration Error (ece), the dominant calibration metric in machine learning, compares predicted probabilities against empirical frequencies of binary outcomes. This is appropriate when labels are binary events. However, many…

Machine Learning · Computer Science 2026-03-17 Michael Leznik

On Deep Neural Network Calibration by Regularization and its Impact on Refinement

Deep neural networks have been shown to be highly miscalibrated. often they tend to be overconfident in their predictions. It poses a significant challenge for safety-critical systems to utilise deep neural networks (DNNs), reliably. Many…

Machine Learning · Computer Science 2022-05-05 Aditya Singh , Alessandro Bay , Biswa Sengupta , Andrea Mirabile

Bayesian Confidence Calibration for Epistemic Uncertainty Modelling

Modern neural networks have found to be miscalibrated in terms of confidence calibration, i.e., their predicted confidence scores do not reflect the observed accuracy or precision. Recent work has introduced methods for post-hoc confidence…

Computer Vision and Pattern Recognition · Computer Science 2021-09-22 Fabian Küppers , Jan Kronenberger , Jonas Schneider , Anselm Haselhoff

Confidence Calibration and Predictive Uncertainty Estimation for Deep Medical Image Segmentation

Fully convolutional neural networks (FCNs), and in particular U-Nets, have achieved state-of-the-art results in semantic segmentation for numerous medical imaging applications. Moreover, batch normalization and Dice loss have been used…

Image and Video Processing · Electrical Eng. & Systems 2020-07-06 Alireza Mehrtash , William M. Wells , Clare M. Tempany , Purang Abolmaesumi , Tina Kapur

Clustered Calibration: Representation-Aware Probability Calibration via Learned Subpopulations

Ensuring that predicted probabilities align with observed frequencies is critical in high-stakes domains such as clinical decision support, autonomous driving and financial risk assessment. Existing calibration methods typically apply a…

Machine Learning · Computer Science 2026-05-26 Tomer Lavi , Bracha Shapira , Nadav Rappoport

Extending confidence calibration to generalised measures of variation

We propose the Variation Calibration Error (VCE) metric for assessing the calibration of machine learning classifiers. The metric can be viewed as an extension of the well-known Expected Calibration Error (ECE) which assesses the…

Machine Learning · Computer Science 2026-02-16 Andrew Thompson , Vivek Desai

Optimizing Calibration by Gaining Aware of Prediction Correctness

Model calibration aims to align confidence with prediction correctness. The Cross-Entropy (CE) loss is widely used for calibrator training, which enforces the model to increase confidence on the ground truth class. However, we find the CE…

Computer Vision and Pattern Recognition · Computer Science 2025-02-13 Yuchi Liu , Lei Wang , Yuli Zou , James Zou , Liang Zheng

Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image Classification

This paper investigates novel classifier ensemble techniques for uncertainty calibration applied to various deep neural networks for image classification. We evaluate both accuracy and calibration metrics, focusing on Expected Calibration…

Computer Vision and Pattern Recognition · Computer Science 2025-01-20 Michael Schulze , Nikolas Ebert , Laurenz Reichardt , Oliver Wasenmüller

Combining Priors with Experience: Confidence Calibration Based on Binomial Process Modeling

Confidence calibration of classification models is a technique to estimate the true posterior probability of the predicted class, which is critical for ensuring reliable decision-making in practical applications. Existing confidence…

Methodology · Statistics 2025-02-19 Jinzong Dong , Zhaohui Jiang , Dong Pan , Haoyang Yu

Measuring Calibration in Deep Learning

Overconfidence and underconfidence in machine learning classifiers is measured by calibration: the degree to which the probabilities predicted for each class match the accuracy of the classifier on that prediction. How one measures…

Machine Learning · Computer Science 2020-08-11 Jeremy Nixon , Mike Dusenberry , Ghassen Jerfel , Timothy Nguyen , Jeremiah Liu , Linchuan Zhang , Dustin Tran