Related papers: Predicting Failures of Point Forecasts

Forecaster's Dilemma: Extreme Events and Forecast Evaluation

In public discussions of the quality of forecasts, attention typically focuses on the predictive performance in cases of extreme events. However, the restriction of conventional forecast evaluation methods to subsets of extreme observations…

Methodology · Statistics 2016-01-01 Sebastian Lerch , Thordis L. Thorarinsdottir , Francesco Ravazzolo , Tilmann Gneiting

Evaluating Weather Forecasts from a Decision Maker's Perspective

Standard weather forecast evaluations focus on the forecaster's perspective and on a statistical assessment comparing forecasts and observations. In practice, however, forecasts are used to make decisions, so it seems natural to take the…

Machine Learning · Computer Science 2025-12-18 Kornelius Raeth , Nicole Ludwig

Enabling electronic prognostics using thermal data

Prognostics is a process of assessing the extent of deviation or degradation of a product from its expected normal operating condition, and then, based on continuous monitoring, predicting the future reliability of the product. By being…

Materials Science · Physics 2007-09-13 N. Vchare , M. Pecht

Skill of data based predictions versus dynamical models -- case study on extreme temperature anomalies

We compare probabilistic predictions of extreme temperature anomalies issued by two different forecast schemes. One is a dynamical physical weather model, the other a simple data model. We recall the concept of skill scores in order to…

Applications · Statistics 2013-12-17 Stefan Siegert , Jochen Broecker , Holger Kantz

Uniform reliability tests for forecasting systems with small lead time

A long noted difficulty when assessing the reliability (or calibration) of forecasting systems is that reliability, in general, is a hypothesis not about a finite dimensional parameter but about an entire functional relationship. A…

Data Analysis, Statistics and Probability · Physics 2020-12-09 Jochen Bröcker

Beyond the Norms: Detecting Prediction Errors in Regression Models

This paper tackles the challenge of detecting unreliable behavior in regression algorithms, which may arise from intrinsic variability (e.g., aleatoric uncertainty) or modeling errors (e.g., model uncertainty). First, we formally introduce…

Machine Learning · Computer Science 2024-06-12 Andres Altieri , Marco Romanelli , Georg Pichler , Florence Alberge , Pablo Piantanida

Calibration through the Lens of Indistinguishability

Calibration is a classical notion from the forecasting literature which aims to address the question: how should predicted probabilities be interpreted? In a world where we only get to observe (discrete) outcomes, how should we evaluate a…

Machine Learning · Computer Science 2025-09-03 Parikshit Gopalan , Lunjia Hu

Variance estimation for Brier Score decomposition

The Brier Score is a widely-used criterion to assess the quality of probabilistic predictions of binary events. The expectation value of the Brier Score can be decomposed into the sum of three components called reliability, resolution, and…

Methodology · Statistics 2014-01-03 Stefan Siegert

"Calibeating": Beating Forecasters at Their Own Game

In order to identify expertise, forecasters should not be tested by their calibration score, which can always be made arbitrarily small, but rather by their Brier score. The Brier score is the sum of the calibration score and the refinement…

Theoretical Economics · Economics 2026-03-20 Dean P. Foster , Sergiu Hart

Truthful Elicitation of Imprecise Forecasts

The quality of probabilistic forecasts is crucial for decision-making under uncertainty. While proper scoring rules incentivize truthful reporting of precise forecasts, they fall short when forecasters face epistemic uncertainty about their…

Machine Learning · Computer Science 2025-07-18 Anurag Singh , Siu Lun Chau , Krikamol Muandet

Evaluating software defect prediction performance: an updated benchmarking study

Accurately predicting faulty software units helps practitioners target faulty units and prioritize their efforts to maintain software quality. Prior studies use machine-learning models to detect faulty software code. We revisit past studies…

Software Engineering · Computer Science 2019-01-08 Libo Li , Stefan Lessmann , Bart Baesens

Non-probabilistic odds and forecasting with imperfect models

Probability forecasts are intended to account for the uncertainties inherent in forecasting. It is suggested that from an end-user's point of view probability is not necessarily sufficient to reflect uncertainties that are not simply the…

Statistics Theory · Mathematics 2015-01-22 Kevin Judd

From Classification Accuracy to Proper Scoring Rules: Elicitability of Probabilistic Top List Predictions

In the face of uncertainty, the need for probabilistic assessments has long been recognized in the literature on forecasting. In classification, however, comparative evaluation of classifiers often focuses on predictions specifying a single…

Methodology · Statistics 2023-05-31 Johannes Resin

Cross-calibration of probabilistic forecasts

When providing probabilistic forecasts for uncertain future events, it is common to strive for calibrated forecasts, that is, the predictive distribution should be compatible with the observed outcomes. Several notions of calibration are…

Methodology · Statistics 2015-05-21 Christof Strähl , Johanna F. Ziegel

Interval Predictability in Discrete Event Systems

In this paper we study the problem of predictability in partially observable discrete event systems, i.e., the question whether an observer can predict the occurrence of a fault. We extend the definition of predictability to consider the…

Systems and Control · Computer Science 2015-08-05 Alban Grastien

Comparison of Uncertainty Quantification with Deep Learning in Time Series Regression

Increasingly high-stakes decisions are made using neural networks in order to make predictions. Specifically, meteorologists and hedge funds apply these techniques to time series data. When it comes to prediction, there are certain…

Machine Learning · Computer Science 2022-11-14 Levente Foldesi , Matias Valdenegro-Toro

On the Unknowable Limits to Prediction

We propose a rigorous decomposition of predictive error, highlighting that not all 'irreducible' error is genuinely immutable. Many domains stand to benefit from iterative enhancements in measurement, construct validity, and modeling. Our…

Machine Learning · Computer Science 2025-02-12 Jiani Yan , Charles Rahal

Calibrating Bayesian UNet++ for Sub-Seasonal Forecasting

Seasonal forecasting is a crucial task when it comes to detecting the extreme heat and colds that occur due to climate change. Confidence in the predictions should be reliable since a small increase in the temperatures in a year has a big…

Machine Learning · Computer Science 2024-04-05 Busra Asan , Abdullah Akgül , Alper Unal , Melih Kandemir , Gozde Unal

Making Early Predictions of the Accuracy of Machine Learning Applications

The accuracy of machine learning systems is a widely studied research topic. Established techniques such as cross-validation predict the accuracy on unseen data of the classifier produced by applying a given learning method to a given…

Machine Learning · Computer Science 2012-12-06 J. E. Smith , P. Caleb-Solly , M. A. Tahir , D. Sannen , H. van-Brussel

Estimation of Accurate and Calibrated Uncertainties in Deterministic models

In this paper we focus on the problem of assigning uncertainties to single-point predictions generated by a deterministic model that outputs a continuous variable. This problem applies to any state-of-the-art physics or engineering models…

Machine Learning · Statistics 2020-03-12 Enrico Camporeale , Algo Carè