Related papers: Valid sequential inference on probability forecast…

Sequentially valid tests for forecast calibration

Forecasting and forecast evaluation are inherently sequential tasks. Predictions are often issued on a regular basis, such as every hour, day, or month, and their quality is monitored continuously. However, the classical statistical tools…

Methodology · Statistics 2022-07-04 Sebastian Arnold , Alexander Henzi , Johanna F. Ziegel

E-Values Expand the Scope of Conformal Prediction

Conformal prediction is a powerful framework for distribution-free uncertainty quantification. The standard approach to conformal prediction relies on comparing the ranks of prediction scores: under exchangeability, the rank of a future…

Machine Learning · Statistics 2025-05-07 Etienne Gauthier , Francis Bach , Michael I. Jordan

True and false discoveries with independent and sequential e-values

In this paper we use e-values in the context of multiple hypothesis testing assuming that the base tests produce independent, or sequential, e-values. Our simulation and empirical studies and theoretical considerations suggest that, under…

Methodology · Statistics 2024-08-14 Vladimir Vovk , Ruodu Wang

E-values as statistical evidence: A comparison to Bayes factors, likelihoods, and p-values

A recurring debate in the philosophy of statistics concerns what, exactly, should count as a measure of evidence for or against a given hypothesis. P-values, likelihood ratios, and Bayes factors all have their defenders. In this paper we…

Methodology · Statistics 2026-03-26 Ben Chugg , Aaditya Ramdas , Peter Grünwald

Prediction-Powered E-Values

Quality statistical inference requires a sufficient amount of data, which can be missing or hard to obtain. To this end, prediction-powered inference has risen as a promising methodology, but existing approaches are largely limited to…

Machine Learning · Statistics 2025-05-27 Daniel Csillag , Claudio José Struchiner , Guilherme Tegoni Goedert

E-values: Calibration, combination, and applications

Multiple testing of a single hypothesis and testing multiple hypotheses are usually done in terms of p-values. In this paper we replace p-values with their natural competitor, e-values, which are closely related to betting, Bayes factors,…

Statistics Theory · Mathematics 2021-10-26 Vladimir Vovk , Ruodu Wang

Confidence and discoveries with e-values

We discuss systematically two versions of confidence regions: those based on p-values and those based on e-values, a recent alternative to p-values. Both versions can be applied to multiple hypothesis testing, and in this paper we are…

Statistics Theory · Mathematics 2024-03-05 Vladimir Vovk , Ruodu Wang

Sequential Scoring Rule Evaluation for Forecast Method Selection

This paper shows that sequential statistical analysis techniques can be generalised to the problem of selecting between alternative forecasting methods using scoring rules. A return to basic principles is necessary in order to show that…

Statistics Theory · Mathematics 2025-05-15 David T. Frazier , Donald S. Poskitt

Conformal e-prediction

This paper discusses a counterpart of conformal prediction for e-values, conformal e-prediction. Conformal e-prediction is conceptually simpler and had been developed in the 1990s as a precursor of conformal prediction. When conformal…

Machine Learning · Computer Science 2025-05-20 Vladimir Vovk

P-value: A Bless or A Curse for Evidence-Based Studies?

As a convention, p-value is often computed in frequentist hypothesis testing and compared with the nominal significance level of 0.05 to determine whether or not to reject the null hypothesis. The smaller the p-value, the more significant…

Methodology · Statistics 2020-02-25 Haolun Shi , Guosheng Yin

Multiple Testing in Generalized Universal Inference

Compared to p-values, e-values provably guarantee safe, valid inference. If the goal is to test multiple hypotheses simultaneously, one can construct e-values for each individual test and then use the recently developed e-BH procedure to…

Methodology · Statistics 2024-12-03 Neil Dey , Ryan Martin , Jonathan P. Williams

Beyond Neyman-Pearson: e-values enable hypothesis testing with a data-driven alpha

A standard practice in statistical hypothesis testing is to mention the p-value alongside the accept/reject decision. We show the advantages of mentioning an e-value instead. With p-values, it is not clear how to use an extreme observation…

Methodology · Statistics 2024-04-04 Peter Grünwald

Always Valid Inference: Bringing Sequential Analysis to A/B Testing

A/B tests are typically analyzed via frequentist p-values and confidence intervals; but these inferences are wholly unreliable if users endogenously choose samples sizes by *continuously monitoring* their tests. We define *always valid*…

Statistics Theory · Mathematics 2019-07-18 Ramesh Johari , Leo Pekelis , David J. Walsh

Conformal e-testing

There is a useful counterpart of conformal prediction for e-values, called conformal e-prediction. Conformal prediction can serve as basis for testing the assumption of exchangeability, leading to conformal testing. Similarly, conformal…

Statistics Theory · Mathematics 2024-11-05 Vladimir Vovk , Ilia Nouretdinov , Alex Gammerman

Exact Anytime-valid Confidence Intervals for Contingency Tables and Beyond

E-variables are tools for retaining type-I error guarantee with optional stopping. We extend E-variables for sequential two-sample tests to general null hypotheses and anytime-valid confidence sequences. We provide implementations for…

Methodology · Statistics 2022-06-27 Rosanne Turner , Peter Grünwald

e-Values for Real-Time Residential Electricity Demand Forecast Model Selection

With the growing number of forecasting techniques and the increasing significance of forecast-based operation - particularly in the rapidly evolving energy sector - selecting the most effective forecasting model has become a critical task.…

Systems and Control · Electrical Eng. & Systems 2024-10-24 Fabian Backhaus , Karoline Brucke , Peter Ruckdeschel , Sunke Schlüters

Selective inference is easier with p-values

Selective inference is a subfield of statistics that enables valid inference after selection of a data-dependent question. In this paper, we introduce selectively dominant p-values, a class of p-values that allow practitioners to easily…

Methodology · Statistics 2024-11-22 Anav Sood

A note on p-values interpreted as plausibilities

P-values are a mainstay in statistics but are often misinterpreted. We propose a new interpretation of p-value as a meaningful plausibility, where this is to be interpreted formally within the inferential model framework. We show that, for…

Statistics Theory · Mathematics 2014-10-28 Ryan Martin , Chuanhai Liu

Divergence vs. Decision P-values: A Distinction Worth Making in Theory and Keeping in Practice

There are two distinct definitions of 'P-value' for evaluating a proposed hypothesis or model for the process generating an observed dataset. The original definition starts with a measure of the divergence of the dataset from what was…

Other Statistics · Statistics 2023-09-25 Sander Greenland

Equivalence testing for linear regression

We introduce equivalence testing procedures for linear regression analyses. Such tests can be very useful for confirming the lack of a meaningful association between a continuous outcome and a continuous or binary predictor. Specifically,…

Methodology · Statistics 2023-05-17 Harlan Campbell