Related papers: Continuous Testing: Unifying Tests and E-values

E-values: Calibration, combination, and applications

Multiple testing of a single hypothesis and testing multiple hypotheses are usually done in terms of p-values. In this paper we replace p-values with their natural competitor, e-values, which are closely related to betting, Bayes factors,…

Statistics Theory · Mathematics 2021-10-26 Vladimir Vovk , Ruodu Wang

Beyond Neyman-Pearson: e-values enable hypothesis testing with a data-driven alpha

A standard practice in statistical hypothesis testing is to mention the p-value alongside the accept/reject decision. We show the advantages of mentioning an e-value instead. With p-values, it is not clear how to use an extreme observation…

Methodology · Statistics 2024-04-04 Peter Grünwald

Safe Testing

We develop the theory of hypothesis testing based on the e-value, a notion of evidence that, unlike the p-value, allows for effortlessly combining results from several studies in the common scenario where the decision to perform a new study…

Statistics Theory · Mathematics 2023-03-13 Peter Grünwald , Rianne de Heide , Wouter Koolen

E-values as unnormalized weights in multiple testing

We study how to combine p-values and e-values, and design multiple testing procedures where both p-values and e-values are available for every hypothesis. Our results provide a new perspective on multiple testing with data-driven weights:…

Methodology · Statistics 2023-07-19 Nikolaos Ignatiadis , Ruodu Wang , Aaditya Ramdas

E-values and sequential power-one tests for monotonicity and unimodality

We develop e-values and e-processes testing the null hypothesis that a distribution over nonnegative integers is monotone, and that a distribution over integers is unimodal given a certain mode. Our e-processes lead to tests of power one…

Statistics Theory · Mathematics 2026-04-23 Hongjian Wang , Aaditya Ramdas

Sequentially valid tests for forecast calibration

Forecasting and forecast evaluation are inherently sequential tasks. Predictions are often issued on a regular basis, such as every hour, day, or month, and their quality is monitored continuously. However, the classical statistical tools…

Methodology · Statistics 2022-07-04 Sebastian Arnold , Alexander Henzi , Johanna F. Ziegel

E-Values Expand the Scope of Conformal Prediction

Conformal prediction is a powerful framework for distribution-free uncertainty quantification. The standard approach to conformal prediction relies on comparing the ranks of prediction scores: under exchangeability, the rank of a future…

Machine Learning · Statistics 2025-05-07 Etienne Gauthier , Francis Bach , Michael I. Jordan

E-values as statistical evidence: A comparison to Bayes factors, likelihoods, and p-values

A recurring debate in the philosophy of statistics concerns what, exactly, should count as a measure of evidence for or against a given hypothesis. P-values, likelihood ratios, and Bayes factors all have their defenders. In this paper we…

Methodology · Statistics 2026-03-26 Ben Chugg , Aaditya Ramdas , Peter Grünwald

Why bother with Bayesian t-tests?

Given the well-known and fundamental problems with hypothesis testing via classical (point-form) significance tests, there has been a general move to alternative approaches, often focused on the Bayesian t-test. We show that the Bayesian…

Statistics Theory · Mathematics 2022-11-07 Fintan Costello , Paul Watts

Conformal e-testing

There is a useful counterpart of conformal prediction for e-values, called conformal e-prediction. Conformal prediction can serve as basis for testing the assumption of exchangeability, leading to conformal testing. Similarly, conformal…

Statistics Theory · Mathematics 2024-11-05 Vladimir Vovk , Ilia Nouretdinov , Alex Gammerman

Continuous Quantum Hypothesis Testing

I propose a general quantum hypothesis testing theory that enables one to test hypotheses about any aspect of a physical system, including its dynamics, based on a series of observations. For example, the hypotheses can be about the…

Quantum Physics · Physics 2012-04-30 Mankei Tsang

The E-measure

We introduce the E-measure: a measure-like generalization of the E-value to a class of hypotheses. Unlike classical measures, E-measures are closed under infimums instead of addition. They arise from a compatibility axiom with logical…

Statistics Theory · Mathematics 2026-04-23 Nick W. Koning

Testing with p*-values: Between p-values, mid p-values, and e-values

We introduce the notion of p*-values (p*-variables), which generalizes p-values (p-variables) in several senses. The new notion has four natural interpretations: operational, probabilistic, Bayesian, and frequentist. A main example of a…

Statistics Theory · Mathematics 2022-02-24 Ruodu Wang

The 'Right' Extension of Type-I Error to Data-Dependent Levels

The literature on hypothesis testing with data-dependent and post-hoc significance levels relies on a particular extension of the Type-I error to data-dependent levels. Existing arguments for this extension are heuristic, and primarily…

Statistics Theory · Mathematics 2026-05-28 Nick W. Koning

Uniformly most powerful Bayesian tests

Uniformly most powerful tests are statistical hypothesis tests that provide the greatest power against a fixed null hypothesis among all tests of a given size. In this article, the notion of uniformly most powerful tests is extended to the…

Statistics Theory · Mathematics 2014-01-30 Valen E. Johnson

Test statistics and p-values

We point out that the traditional notion of test statistic is too narrow, and we propose a natural generalization that is arguably maximal. The study is restricted to simple statistical hypotheses.

Methodology · Statistics 2020-01-07 Yuri Gurevich , Vladimir Vovk

True and false discoveries with independent and sequential e-values

In this paper we use e-values in the context of multiple hypothesis testing assuming that the base tests produce independent, or sequential, e-values. Our simulation and empirical studies and theoretical considerations suggest that, under…

Methodology · Statistics 2024-08-14 Vladimir Vovk , Ruodu Wang

Optimal e-value testing for properly constrained hypotheses

Hypothesis testing via e-variables can be framed as a sequential betting game, where a player each round picks an e-variable. A good player's strategy results in an effective statistical test that rejects the null hypothesis as soon as…

Statistics Theory · Mathematics 2025-05-30 Eugenio Clerico

Multiple Testing in Generalized Universal Inference

Compared to p-values, e-values provably guarantee safe, valid inference. If the goal is to test multiple hypotheses simultaneously, one can construct e-values for each individual test and then use the recently developed e-BH procedure to…

Methodology · Statistics 2024-12-03 Neil Dey , Ryan Martin , Jonathan P. Williams

Valid sequential inference on probability forecast performance

Probability forecasts for binary events play a central role in many applications. Their quality is commonly assessed with proper scoring rules, which assign forecasts a numerical score such that a correct forecast achieves a minimal…

Methodology · Statistics 2022-07-04 Alexander Henzi , Johanna F. Ziegel