Related papers: Backtesting forecast accuracy

Using the rejection sampling for finding tests

A new method based on the rejection sampling for finding statistical tests is proposed. This method is conceptually intuitive, easy to implement, and applicable for arbitrary dimension. To illustrate its potential applicability, three…

Methodology · Statistics 2026-03-11 Markku Kuismin

A Simple Lack-of-Fit Test for Regression Models

A simple test is proposed for examining the correctness of a given completely specified response function against unspecified general alternatives in the context of univariate regression. The usual diagnostic tools based on residuals plots…

Methodology · Statistics 2010-04-27 Jean-Baptiste Aubin , Samuela Leoni-Aubin

Spectral backtests of forecast distributions with application to risk management

We study a class of backtests for forecast distributions in which the test statistic depends on a spectral transformation that weights exceedance events by a function of the modeled probability level. The weighting scheme is specified by a…

Risk Management · Quantitative Finance 2019-07-30 Michael B. Gordy , Alexander J. McNeil

Testing against a linear regression model using ideas from shape-restricted estimation

A formal likelihood ratio hypothesis test for the validity of a parametric regression function is proposed, using a large-dimensional, nonparametric double cone alternative. For example, the test against a constant function uses the…

Methodology · Statistics 2014-06-30 Bodhisattva Sen , Mary Meyer

Bayesian Meta-Reasoning: Determining Model Adequacy from Within a Small World

This paper presents a Bayesian framework for assessing the adequacy of a model without the necessity of explicitly enumerating a specific alternate model. A test statistic is developed for tracking the performance of the model across…

Artificial Intelligence · Computer Science 2013-03-25 Kathryn Blackmond Laskey

Goodness-of-fit Testing in Linear Regression Models

Model checking plays an important role in linear regression as model misspecification seriously affects the validity and efficiency of regression analysis. In practice, model checking is often performed by subjectively evaluating the plot…

Statistics Theory · Mathematics 2019-11-19 Rok Blagus , Jakob Peterlin , Janez Stare

A Goodness-of-Fit Test for Statistical Models

Statistical modeling plays a fundamental role in understanding the underlying mechanism of massive data (statistical inference) and predicting the future (statistical prediction). Although all models are wrong, researchers try their best to…

Methodology · Statistics 2020-06-17 Hangjin Jiang

Elicitability and backtesting: Perspectives for banking regulation

Conditional forecasts of risk measures play an important role in internal risk management of financial institutions as well as in regulatory capital calculations. In order to assess forecasting performance of a risk measurement procedure,…

Risk Management · Quantitative Finance 2017-02-22 Natalia Nolde , Johanna F. Ziegel

Significance testing without truth

A popular approach to significance testing proposes to decide whether the given hypothesized statistical model is likely to be true (or false). Statistical decision theory provides a basis for this approach by requiring every significance…

Methodology · Statistics 2013-01-08 William Perkins , Mark Tygert , Rachel Ward

Estimating and evaluating counterfactual prediction models

Counterfactual prediction methods are required when a model will be deployed in a setting where treatment policies differ from the setting where the model was developed, or when a model provides predictions under hypothetical interventions…

Methodology · Statistics 2025-08-13 Christopher B. Boyer , Issa J. Dahabreh , Jon A. Steingrimsson

Pursuing a Prospective Perspective

Retrospective testing of predictive models does not consider the real-world context in which models are deployed. Prospective validation, on the other hand, enables meaningful comparisons between data generation processes by incorporating…

Machine Learning · Computer Science 2020-11-19 Steven Kearnes

Assessment of Point Process Models for Earthquake Forecasting

Models for forecasting earthquakes are currently tested prospectively in well-organized testing centers, using data collected after the models and their parameters are completely specified. The extent to which these models agree with the…

Methodology · Statistics 2013-12-23 Andrew Bray , Frederic Paik Schoenberg

A Prediction Model for System Testing Defects using Regression Analysis

This research describes the initial effort of building a prediction model for defects in system testing carried out by an independent testing team. The motivation to have such defect prediction model is to serve as early quality indicator…

Software Engineering · Computer Science 2014-01-24 Muhammad Dhiauddin Mohamed Suffian , Suhaimi Ibrahim

Proving prediction prudence

We study how to perform tests on samples of pairs of observations and predictions in order to assess whether or not the predictions are prudent. Prudence requires that that the mean of the difference of the observation-prediction pairs can…

Risk Management · Quantitative Finance 2022-10-03 Dirk Tasche

On Determining the Distribution of a Goodness-of-Fit Test Statistic

We consider the problem of goodness-of-fit testing for a model that has at least one unknown parameter that cannot be eliminated by transformation. Examples of such problems can be as simple as testing whether a sample consists of…

Methodology · Statistics 2021-04-28 Sean van der Merwe

Test Model Coverage Analysis under Uncertainty

In model-based testing (MBT) we may have to deal with a non-deterministic model, e.g. because abstraction was applied, or because the software under test itself is non-deterministic. The same test case may then trigger multiple possible…

Software Engineering · Computer Science 2019-09-13 I. S. W. B. Prasetya , Rick Klomp

Comparative e-backtests for general risk measures

Backtesting risk measures is a central task in financial regulation. While standard backtests evaluate whether a forecasting model is statistically consistent with observed losses, regulatory practice often requires assessing the…

Methodology · Statistics 2026-03-06 Zhanyi Jiao , Qiuqi Wang , Yimiao Zhao

Pre-validation Revisited

Pre-validation is a way to build prediction model with two datasets of significantly different feature dimensions. Previous work showed that the asymptotic distribution of the resulting test statistic for the pre-validated predictor…

Methodology · Statistics 2025-05-23 Jing Shang , Sourav Chatterjee , Trevor Hastie , Robert Tibshirani

Testing for changes in polynomial regression

We consider a nonlinear polynomial regression model in which we wish to test the null hypothesis of structural stability in the regression parameters against the alternative of a break at an unknown time. We derive the extreme value…

Statistics Theory · Mathematics 2008-10-23 Alexander Aue , Lajos Horváth , Marie Hušková , Piotr Kokoszka

Testing convex hypotheses on the mean of a Gaussian vector. Application to testing qualitative hypotheses on a regression function

In this paper we propose a general methodology, based on multiple testing, for testing that the mean of a Gaussian vector in R^n belongs to a convex set. We show that the test achieves its nominal level, and characterize a class of vectors…

Statistics Theory · Mathematics 2007-06-13 Yannick Baraud , Sylvie Huet , Beatrice Laurent