English
Related papers

Related papers: Calibration Error Estimation Using Fuzzy Binning

200 papers

For an AI system to be reliable, the confidence it expresses in its decisions must match its accuracy. To assess the degree of match, examples are typically binned by confidence and the per-bin mean confidence and accuracy are compared.…

Machine Learning · Computer Science 2022-02-14 Rebecca Roelofs , Nicholas Cain , Jonathon Shlens , Michael C. Mozer

This paper proposes a new metric to measure the calibration error of probabilistic binary classifiers, called test-based calibration error (TCE). TCE incorporates a novel loss function based on a statistical test to examine the extent to…

Machine Learning · Statistics 2023-06-27 Takuo Matsubara , Niek Tax , Richard Mudd , Ido Guy

The Expected Normalized Calibration Error (ENCE) is a popular calibration statistic used in Machine Learning to assess the quality of prediction uncertainties for regression problems. Estimation of the ENCE is based on the binning of…

Machine Learning · Computer Science 2023-05-23 Pascal Pernot

Optimal decision making requires that classifiers produce uncertainty estimates consistent with their empirical accuracy. However, deep neural networks are often under- or over-confident in their predictions. Consequently, methods have been…

Trustworthiness in neural networks is crucial for their deployment in critical applications, where reliability, confidence, and uncertainty play pivotal roles in decision-making. Traditional performance metrics such as accuracy and…

Machine Learning · Computer Science 2025-09-05 Koffi Ismael Ouattara , Ioannis Krontiris , Theo Dimitrakos , Frank Kargl

Uncertainty in probabilistic classifiers predictions is a key concern when models are used to support human decision making, in broader probabilistic pipelines or when sensitive automatic decisions have to be taken. Studies have shown that…

Machine Learning · Computer Science 2021-09-09 Nicolas Posocco , Antoine Bonnefoy

Research has shown that deep networks tend to be overly optimistic about their predictions, leading to an underestimation of prediction errors. Due to the limited nature of data, existing studies have proposed various methods based on model…

Machine Learning · Computer Science 2023-10-24 Jia-Qi Yang , De-Chuan Zhan , Le Gan

Estimated uncertainty by approximate posteriors in Bayesian neural networks are prone to miscalibration, which leads to overconfident predictions in critical tasks that have a clear asymmetric cost or significant losses. Here, we extend the…

Machine Learning · Computer Science 2022-06-17 Biraja Ghoshal , Allan Tucker

While significant progress has been made in specifying neural networks capable of representing uncertainty, deep networks still often suffer from overconfidence and misaligned predictive distributions. Existing approaches for measuring this…

Machine Learning · Computer Science 2025-10-24 Spencer Young , Riley Sinema , Cole Edgren , Andrew Hall , Nathan Dong , Porter Jenkins

While the expected calibration error (ECE), which employs binning, is widely adopted to evaluate the calibration performance of machine learning models, theoretical understanding of its estimation bias is limited. In this paper, we present…

Machine Learning · Computer Science 2025-05-27 Futoshi Futami , Masahiro Fujisawa

The Expected Calibration Error (ece), the dominant calibration metric in machine learning, compares predicted probabilities against empirical frequencies of binary outcomes. This is appropriate when labels are binary events. However, many…

Machine Learning · Computer Science 2026-03-17 Michael Leznik

Deep neural networks have been shown to be highly miscalibrated. often they tend to be overconfident in their predictions. It poses a significant challenge for safety-critical systems to utilise deep neural networks (DNNs), reliably. Many…

Machine Learning · Computer Science 2022-05-05 Aditya Singh , Alessandro Bay , Biswa Sengupta , Andrea Mirabile

Modern neural networks have found to be miscalibrated in terms of confidence calibration, i.e., their predicted confidence scores do not reflect the observed accuracy or precision. Recent work has introduced methods for post-hoc confidence…

Computer Vision and Pattern Recognition · Computer Science 2021-09-22 Fabian Küppers , Jan Kronenberger , Jonas Schneider , Anselm Haselhoff

Fully convolutional neural networks (FCNs), and in particular U-Nets, have achieved state-of-the-art results in semantic segmentation for numerous medical imaging applications. Moreover, batch normalization and Dice loss have been used…

Image and Video Processing · Electrical Eng. & Systems 2020-07-06 Alireza Mehrtash , William M. Wells , Clare M. Tempany , Purang Abolmaesumi , Tina Kapur

Ensuring that predicted probabilities align with observed frequencies is critical in high-stakes domains such as clinical decision support, autonomous driving and financial risk assessment. Existing calibration methods typically apply a…

Machine Learning · Computer Science 2026-05-26 Tomer Lavi , Bracha Shapira , Nadav Rappoport

We propose the Variation Calibration Error (VCE) metric for assessing the calibration of machine learning classifiers. The metric can be viewed as an extension of the well-known Expected Calibration Error (ECE) which assesses the…

Machine Learning · Computer Science 2026-02-16 Andrew Thompson , Vivek Desai

Model calibration aims to align confidence with prediction correctness. The Cross-Entropy (CE) loss is widely used for calibrator training, which enforces the model to increase confidence on the ground truth class. However, we find the CE…

Computer Vision and Pattern Recognition · Computer Science 2025-02-13 Yuchi Liu , Lei Wang , Yuli Zou , James Zou , Liang Zheng

This paper investigates novel classifier ensemble techniques for uncertainty calibration applied to various deep neural networks for image classification. We evaluate both accuracy and calibration metrics, focusing on Expected Calibration…

Computer Vision and Pattern Recognition · Computer Science 2025-01-20 Michael Schulze , Nikolas Ebert , Laurenz Reichardt , Oliver Wasenmüller

Confidence calibration of classification models is a technique to estimate the true posterior probability of the predicted class, which is critical for ensuring reliable decision-making in practical applications. Existing confidence…

Methodology · Statistics 2025-02-19 Jinzong Dong , Zhaohui Jiang , Dong Pan , Haoyang Yu

Overconfidence and underconfidence in machine learning classifiers is measured by calibration: the degree to which the probabilities predicted for each class match the accuracy of the classifier on that prediction. How one measures…

Machine Learning · Computer Science 2020-08-11 Jeremy Nixon , Mike Dusenberry , Ghassen Jerfel , Timothy Nguyen , Jeremiah Liu , Linchuan Zhang , Dustin Tran
‹ Prev 1 2 3 10 Next ›