Related papers: Risk-based Calibration for Generative Classifiers

Minimax risk classifiers with 0-1 loss

Supervised classification techniques use training samples to learn a classification rule with small expected 0-1 loss (error probability). Conventional methods enable tractable learning and provide out-of-sample generalization by using…

Machine Learning · Statistics 2023-08-21 Santiago Mazuelas , Mauricio Romero , Peter Grünwald

Decision from Suboptimal Classifiers: Excess Risk Pre- and Post-Calibration

Probabilistic classifiers are central for making informed decisions under uncertainty. Based on the maximum expected utility principle, optimal decision rules can be derived using the posterior class probabilities and misclassification…

Machine Learning · Computer Science 2025-03-25 Alexandre Perez-Lebel , Gael Varoquaux , Sanmi Koyejo , Matthieu Doutreligne , Marine Le Morvan

An In-Depth Examination of Risk Assessment in Multi-Class Classification Algorithms

Advanced classification algorithms are being increasingly used in safety-critical applications like health-care, engineering, etc. In such applications, miss-classifications made by ML algorithms can result in substantial financial or…

Machine Learning · Computer Science 2024-12-06 Disha Ghandwani , Neeraj Sarna , Yuanyuan Li , Yang Lin

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

We introduce a framework for calibrating machine learning models so that their predictions satisfy explicit, finite-sample statistical guarantees. Our calibration algorithms work with any underlying model and (unknown) data-generating…

Machine Learning · Computer Science 2022-10-03 Anastasios N. Angelopoulos , Stephen Bates , Emmanuel J. Candès , Michael I. Jordan , Lihua Lei

Minimax Classification with 0-1 Loss and Performance Guarantees

Supervised classification techniques use training samples to find classification rules with small expected 0-1 loss. Conventional methods achieve efficient learning and out-of-sample generalization by minimizing surrogate losses over…

Machine Learning · Statistics 2021-08-12 Santiago Mazuelas , Andrea Zanoni , Aritz Perez

Ranking-Aware Calibration for Reliable Multimodal Reinforcement Learning

Reinforcement learning post-training has substantially improved the reasoning accuracy of vision-language models, yet the resulting policies remain poorly calibrated. Terminal correctness rewards provide no gradient that penalizes confident…

Machine Learning · Computer Science 2026-05-19 Peng Cui , Boyao Yang , Jun Zhu

Cross-Validation Conformal Risk Control

Conformal risk control (CRC) is a recently proposed technique that applies post-hoc to a conventional point predictor to provide calibration guarantees. Generalizing conformal prediction (CP), with CRC, calibration is ensured for a set…

Machine Learning · Computer Science 2024-05-02 Kfir M. Cohen , Sangwoo Park , Osvaldo Simeone , Shlomo Shamai

Class-wise and reduced calibration methods

For many applications of probabilistic classifiers it is important that the predicted confidence vectors reflect true probabilities (one says that the classifier is calibrated). It has been shown that common models fail to satisfy this…

Machine Learning · Statistics 2022-10-10 Michael Panchenko , Anes Benmerzoug , Miguel de Benito Delgado

Supervised classification via minimax probabilistic transformations

Conventional techniques for supervised classification constrain the classification rules considered and use surrogate losses for classification 0-1 loss. Favored families of classification rules are those that enjoy parametric…

Machine Learning · Statistics 2019-06-03 Santiago Mazuelas , Andrea Zanoni , Aritz Perez

Class-Incremental Learning with Generative Classifiers

Incrementally training deep neural networks to recognize new classes is a challenging problem. Most existing class-incremental learning methods store data or use generative replay, both of which have drawbacks, while 'rehearsal-free'…

Machine Learning · Computer Science 2023-11-10 Gido M. van de Ven , Zhe Li , Andreas S. Tolias

Learning to explore when mistakes are not allowed

Goal-Conditioned Reinforcement Learning (GCRL) provides a versatile framework for developing unified controllers capable of handling wide ranges of tasks, exploring environments, and adapting behaviors. However, its reliance on…

Machine Learning · Computer Science 2025-02-20 Charly Pecqueux-Guézénec , Stéphane Doncieux , Nicolas Perrin-Gilbert

Better Classifier Calibration for Small Data Sets

Classifier calibration does not always go hand in hand with the classifier's ability to separate the classes. There are applications where good classifier calibration, i.e. the ability to produce accurate probability estimates, is more…

Machine Learning · Computer Science 2020-05-26 Tuomo Alasalmi , Jaakko Suutala , Heli Koskimäki , Juha Röning

Active Calibration of Reachable Sets Using Approximate Pick-to-Learn

Reachability computations that rely on learned or estimated models require calibration in order to uphold confidence about their guarantees. Calibration generally involves sampling scenarios inside the reachable set. However, producing…

Systems and Control · Electrical Eng. & Systems 2026-03-27 Sampada Deglurkar , Ebonye Smith , Jingqi Li , Claire J. Tomlin

Calibration by Distribution Matching: Trainable Kernel Calibration Metrics

Calibration ensures that probabilistic forecasts meaningfully capture uncertainty by requiring that predicted probabilities align with empirical frequencies. However, many existing calibration methods are specialized for post-hoc…

Machine Learning · Computer Science 2023-11-01 Charles Marx , Sofian Zalouk , Stefano Ermon

A Conformal Risk Control Framework for Granular Word Assessment and Uncertainty Calibration of CLIPScore Quality Estimates

This study explores current limitations of learned image captioning evaluation metrics, specifically the lack of granular assessments for errors within captions, and the reliance on single-point quality estimates without considering…

Computation and Language · Computer Science 2025-06-03 Gonçalo Gomes , Bruno Martins , Chrysoula Zerva

Risk-Sensitive Policy with Distributional Reinforcement Learning

Classical reinforcement learning (RL) techniques are generally concerned with the design of decision-making policies driven by the maximisation of the expected outcome. Nevertheless, this approach does not take into consideration the…

Machine Learning · Computer Science 2023-01-02 Thibaut Théate , Damien Ernst

Evaluating probabilistic classifiers: Reliability diagrams and score decompositions revisited

A probability forecast or probabilistic classifier is reliable or calibrated if the predicted probabilities are matched by ex post observed frequencies, as examined visually in reliability diagrams. The classical binning and counting…

Methodology · Statistics 2021-08-26 Timo Dimitriadis , Tilmann Gneiting , Alexander I. Jordan

Probabilistic Scores of Classifiers, Calibration is not Enough

In binary classification tasks, accurate representation of probabilistic predictions is essential for various real-world applications such as predicting payment defaults or assessing medical risks. The model must then be well-calibrated to…

Machine Learning · Computer Science 2024-08-08 Agathe Fernandes Machado , Arthur Charpentier , Emmanuel Flachaire , Ewen Gallic , François Hu

Rethinking Early Stopping: Refine, Then Calibrate

Machine learning classifiers often produce probabilistic predictions that are critical for accurate and interpretable decision-making in various domains. The quality of these predictions is generally evaluated with proper losses, such as…

Machine Learning · Computer Science 2025-06-26 Eugène Berta , David Holzmüller , Michael I. Jordan , Francis Bach

Regularized Classification-Aware Quantization

Traditionally, quantization is designed to minimize the reconstruction error of a data source. When considering downstream classification tasks, other measures of distortion can be of interest; such as the 0-1 classification loss.…

Machine Learning · Computer Science 2021-07-22 Daniel Severo , Elad Domanovitz , Ashish Khisti