Related papers: Estimating Classification Confidence Using Kernel …

Mix-n-Match: Ensemble and Compositional Methods for Uncertainty Calibration in Deep Learning

This paper studies the problem of post-hoc calibration of machine learning classifiers. We introduce the following desiderata for uncertainty calibration: (a) accuracy-preserving, (b) data-efficient, and (c) high expressive power. We show…

Machine Learning · Computer Science 2020-07-01 Jize Zhang , Bhavya Kailkhura , T. Yong-Jin Han

h-calibration: Rethinking Classifier Recalibration with Probabilistic Error-Bounded Objective

Deep neural networks have demonstrated remarkable performance across numerous learning tasks but often suffer from miscalibration, resulting in unreliable probability outputs. This has inspired many recent works on mitigating…

Machine Learning · Computer Science 2025-09-23 Wenjian Huang , Guiping Cao , Jiahao Xia , Jingkun Chen , Hao Wang , Jianguo Zhang

Meta-Cal: Well-controlled Post-hoc Calibration by Ranking

In many applications, it is desirable that a classifier not only makes accurate predictions, but also outputs calibrated posterior probabilities. However, many existing classifiers, especially deep neural network classifiers, tend to be…

Machine Learning · Statistics 2021-06-24 Xingchen Ma , Matthew B. Blaschko

Calibration by Distribution Matching: Trainable Kernel Calibration Metrics

Calibration ensures that probabilistic forecasts meaningfully capture uncertainty by requiring that predicted probabilities align with empirical frequencies. However, many existing calibration methods are specialized for post-hoc…

Machine Learning · Computer Science 2023-11-01 Charles Marx , Sofian Zalouk , Stefano Ermon

Heterogeneous Calibration: A post-hoc model-agnostic framework for improved generalization

We introduce the notion of heterogeneous calibration that applies a post-hoc model-agnostic transformation to model outputs for improving AUC performance on binary classification tasks. We consider overconfident models, whose performance is…

Machine Learning · Statistics 2022-02-11 David Durfee , Aman Gupta , Kinjal Basu

Calibration Meets Reality: Making Machine Learning Predictions Trustworthy

Post-hoc calibration methods are widely used to improve the reliability of probabilistic predictions from machine learning models. Despite their prevalence, a comprehensive theoretical understanding of these methods remains elusive,…

Machine Learning · Computer Science 2025-09-30 Kristina P. Sinaga , Arjun S. Nair

How to Fix a Broken Confidence Estimator: Evaluating Post-hoc Methods for Selective Classification with Deep Neural Networks

This paper addresses the problem of selective classification for deep neural networks, where a model is allowed to abstain from low-confidence predictions to avoid potential errors. We focus on so-called post-hoc methods, which replace the…

Machine Learning · Computer Science 2025-06-23 Luís Felipe P. Cattelan , Danilo Silva

Improving Predictor Reliability with Selective Recalibration

A reliable deep learning system should be able to accurately express its confidence with respect to its predictions, a quality known as calibration. One of the most effective ways to produce reliable confidence estimates with a pre-trained…

Machine Learning · Computer Science 2024-10-10 Thomas P. Zollo , Zhun Deng , Jake C. Snell , Toniann Pitassi , Richard Zemel

Measuring Calibration in Deep Learning

Overconfidence and underconfidence in machine learning classifiers is measured by calibration: the degree to which the probabilities predicted for each class match the accuracy of the classifier on that prediction. How one measures…

Machine Learning · Computer Science 2020-08-11 Jeremy Nixon , Mike Dusenberry , Ghassen Jerfel , Timothy Nguyen , Jeremiah Liu , Linchuan Zhang , Dustin Tran

Uncertainty-Aware Post-Hoc Calibration: Mitigating Confidently Incorrect Predictions Beyond Calibration Metrics

Despite extensive research on neural network calibration, existing methods typically apply global transformations that treat all predictions uniformly, overlooking the heterogeneous reliability of individual predictions. Furthermore, the…

Machine Learning · Computer Science 2025-10-22 Hassan Gharoun , Mohammad Sadegh Khorshidi , Kasra Ranjbarigderi , Fang Chen , Amir H. Gandomi

Post-hoc Models for Performance Estimation of Machine Learning Inference

Estimating how well a machine learning model performs during inference is critical in a variety of scenarios (for example, to quantify uncertainty, or to choose from a library of available models). However, the standard accuracy estimate of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-07 Xuechen Zhang , Samet Oymak , Jiasi Chen

Instance-Wise Monotonic Calibration by Constrained Transformation

Deep neural networks often produce miscalibrated probability estimates, leading to overconfident predictions. A common approach for calibration is fitting a post-hoc calibration map on unseen validation data that transforms predicted…

Machine Learning · Computer Science 2025-07-10 Yunrui Zhang , Gustavo Batista , Salil S. Kanhere

Domain-adaptive and Subgroup-specific Cascaded Temperature Regression for Out-of-distribution Calibration

Although deep neural networks yield high classification accuracy given sufficient training data, their predictions are typically overconfident or under-confident, i.e., the prediction confidences cannot truly reflect the accuracy. Post-hoc…

Computer Vision and Pattern Recognition · Computer Science 2024-02-15 Jiexin Wang , Jiahao Chen , Bing Su

Probabilistic Calibration by Design for Neural Network Regression

Generating calibrated and sharp neural network predictive distributions for regression problems is essential for optimal decision-making in many real-world applications. To address the miscalibration issue of neural networks, various…

Machine Learning · Computer Science 2024-03-19 Victor Dheur , Souhaib Ben Taieb

CalArena: A Large-Scale Post-Hoc Calibration Benchmark

Reliable probability estimates are critical in many machine learning applications, yet modern classifiers are often poorly calibrated. Post-hoc calibration provides a simple and widely used solution, but the large number of proposed…

Machine Learning · Computer Science 2026-05-29 Eugène Berta , David Holzmüller , Francis Bach , Michael I. Jordan

Post-hoc Uncertainty Calibration for Domain Drift Scenarios

We address the problem of uncertainty calibration. While standard deep neural networks typically yield uncalibrated predictions, calibrated confidence scores that are representative of the true likelihood of a prediction can be achieved…

Machine Learning · Computer Science 2021-06-24 Christian Tomani , Sebastian Gruber , Muhammed Ebrar Erdem , Daniel Cremers , Florian Buettner

Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection

Despite their impressive predictive performance in various computer vision tasks, deep neural networks (DNNs) tend to make overly confident predictions, which hinders their widespread use in safety-critical applications. While there have…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Teodora Popordanoska , Aleksei Tiulpin , Matthew B. Blaschko

Calibration tests beyond classification

Most supervised machine learning tasks are subject to irreducible prediction errors. Probabilistic predictive models address this limitation by providing probability distributions that represent a belief over plausible targets, rather than…

Machine Learning · Statistics 2022-10-25 David Widmann , Fredrik Lindsten , Dave Zachariah

Classifier Calibration: A survey on how to assess and improve predicted class probabilities

This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its…

Machine Learning · Computer Science 2023-06-16 Telmo Silva Filho , Hao Song , Miquel Perello-Nieto , Raul Santos-Rodriguez , Meelis Kull , Peter Flach

Where are we with calibration under dataset shift in image classification?

We conduct an extensive study on the state of calibration under real-world dataset shift for image classification. Our work provides important insights on the choice of post-hoc and in-training calibration techniques, and yields practical…

Computer Vision and Pattern Recognition · Computer Science 2025-10-23 Mélanie Roschewitz , Raghav Mehta , Fabio de Sousa Ribeiro , Ben Glocker