Related papers: Bayes, E-values and Testing

Anytime-valid sequential testing for elicitable functionals via supermartingales

We design sequential tests for a large class of nonparametric null hypotheses based on elicitable and identifiable functionals. Such functionals are defined in terms of scoring functions and identification functions, which are ideal…

Statistics Theory · Mathematics 2023-06-06 Philippe Casgrain , Martin Larsson , Johanna Ziegel

E-values as statistical evidence: A comparison to Bayes factors, likelihoods, and p-values

A recurring debate in the philosophy of statistics concerns what, exactly, should count as a measure of evidence for or against a given hypothesis. P-values, likelihood ratios, and Bayes factors all have their defenders. In this paper we…

Methodology · Statistics 2026-03-26 Ben Chugg , Aaditya Ramdas , Peter Grünwald

Towards E-Value Based Stopping Rules for Bayesian Deep Ensembles

Bayesian Deep Ensembles (BDEs) represent a powerful approach for uncertainty quantification in deep learning, combining the robustness of Deep Ensembles (DEs) with flexible multi-chain MCMC. While DEs are affordable in most deep learning…

Machine Learning · Computer Science 2026-04-21 Emanuel Sommer , Rickmer Schulte , Sarah Deubner , Julius Kobialka , David Rügamer

E-values for Adaptive Clinical Trials: Anytime-Valid Monitoring in Practice

Adaptive clinical trials rely on interim analyses, flexible stopping, and data-dependent design modifications that complicate statistical guarantees when fixed-horizon test statistics are repeatedly inspected or reused after adaptations.…

Methodology · Statistics 2026-02-09 Alexandra Sokolova , Vadim Sokolov

Valid sequential inference on probability forecast performance

Probability forecasts for binary events play a central role in many applications. Their quality is commonly assessed with proper scoring rules, which assign forecasts a numerical score such that a correct forecast achieves a minimal…

Methodology · Statistics 2022-07-04 Alexander Henzi , Johanna F. Ziegel

Admissible anytime-valid sequential inference must rely on nonnegative martingales

Confidence sequences, anytime p-values (called p-processes in this paper), and e-processes all enable sequential inference for composite and nonparametric classes of distributions at arbitrary stopping times. Examining the literature, one…

Statistics Theory · Mathematics 2022-11-08 Aaditya Ramdas , Johannes Ruf , Martin Larsson , Wouter Koolen

Performance of Test Supermartingale Confidence Intervals for the Success Probability of Bernoulli Trials

Given a composite null hypothesis H, test supermartingales are non-negative supermartingales with respect to H with initial value 1. Large values of test supermartingales provide evidence against H. As a result, test supermartingales are an…

Statistics Theory · Mathematics 2020-03-27 Peter Wills , Emanuel Knill , Kevin Coakley , Yanbao Zhang

E-values and sequential power-one tests for monotonicity and unimodality

We develop e-values and e-processes testing the null hypothesis that a distribution over nonnegative integers is monotone, and that a distribution over integers is unimodal given a certain mode. Our e-processes lead to tests of power one…

Statistics Theory · Mathematics 2026-04-23 Hongjian Wang , Aaditya Ramdas

Test Martingales for bounded random variables

Given a random sample from a random variable $T$ which is bounded from above, $T\le\tau$ a.s., we define processes that are positive supermartingales if $E(T)\ge\mu$. Such processes are called test martingales. Tests of the supermartingale…

Methodology · Statistics 2018-02-20 Harrie Hendriks

Sequentially valid tests for forecast calibration

Forecasting and forecast evaluation are inherently sequential tasks. Predictions are often issued on a regular basis, such as every hour, day, or month, and their quality is monitored continuously. However, the classical statistical tools…

Methodology · Statistics 2022-07-04 Sebastian Arnold , Alexander Henzi , Johanna F. Ziegel

A Sequential Test for Log-Concavity

On observing a sequence of i.i.d.\ data with distribution $P$ on $\mathbb{R}^d$, we ask the question of how one can test the null hypothesis that $P$ has a log-concave density. This paper proves one interesting negative and positive result:…

Statistics Theory · Mathematics 2023-01-10 Aditya Gangrade , Alessandro Rinaldo , Aaditya Ramdas

Anytime Valid Tests of Conditional Independence Under Model-X

We propose a sequential, anytime-valid method to test the conditional independence of a response $Y$ and a predictor $X$ given a random vector $Z$. The proposed test is based on e-statistics and test martingales, which generalize likelihood…

Methodology · Statistics 2023-02-22 Peter Grünwald , Alexander Henzi , Tyron Lardy

Asymptotically Log-Optimal Bayes-Assisted Confidence Sequences for Bounded Means

Confidence sequences based on test martingales provide time-uniform uncertainty quantification for the mean of bounded IID observations without parametric distributional assumptions. Their practical efficiency, however, depends strongly on…

Machine Learning · Statistics 2026-05-12 Valentin Kilian , Stefano Cortinovis , François Caron

Power comparison of sequential testing by betting procedures

In this paper, we derive power guarantees of some sequential tests for bounded mean under general alternatives. We focus on testing procedures using nonnegative supermartingales which are anytime valid and consider alternatives which…

Statistics Theory · Mathematics 2025-10-15 Amaury Durand , Olivier Wintenberger

E-Values Expand the Scope of Conformal Prediction

Conformal prediction is a powerful framework for distribution-free uncertainty quantification. The standard approach to conformal prediction relies on comparing the ranks of prediction scores: under exchangeability, the rank of a future…

Machine Learning · Statistics 2025-05-07 Etienne Gauthier , Francis Bach , Michael I. Jordan

Test Martingales for bounded random variables

Given a positive random variable $X$, $X\ge0$ a.s., a null hypothesis $H_0:E(X)\le\mu$ and a random sample of infinite size of $X$, we construct test supermartingales for $H_0$, i.e. positive processes that are supermartingale if the null…

Methodology · Statistics 2021-09-21 Harrie Hendriks

Sequential Nonparametric Testing with the Law of the Iterated Logarithm

We propose a new algorithmic framework for sequential hypothesis testing with i.i.d. data, which includes A/B testing, nonparametric two-sample testing, and independence testing as special cases. It is novel in several ways: (a) it takes…

Machine Learning · Statistics 2016-03-03 Akshay Balsubramani , Aaditya Ramdas

Adaptive clinical trials based on design-optimal e-values with automatic curtailment: An application to single-arm trials with binary data

The e-value is gaining traction as a robust alternative to p-values and Bayes factors for quantifying statistical evidence. e-values are a promising method for adaptive clinical trials due to their anytime-validity: e-values ensure type I…

Methodology · Statistics 2026-05-28 Stef Baas , Judith ter Schure , Joost van Rosmalen

Test Martingales, Bayes Factors and $p$-Values

A nonnegative martingale with initial value equal to one measures evidence against a probabilistic hypothesis. The inverse of its value at some stopping time can be interpreted as a Bayes factor. If we exaggerate the evidence by considering…

Statistics Theory · Mathematics 2011-06-17 Glenn Shafer , Alexander Shen , Nikolai Vereshchagin , Vladimir Vovk

Sequential Monte-Carlo testing by betting

In a Monte-Carlo test, the observed dataset is fixed, and several resampled or permuted versions of the dataset are generated in order to test a null hypothesis that the original dataset is exchangeable with the resampled/permuted ones.…

Methodology · Statistics 2025-05-05 Lasse Fischer , Aaditya Ramdas