YEAST: Yet Another Sequential Test

Alexey Kurennoy; Majed Dodin; Tural Gurbanov; Ana Peleteiro Ramallo

YEAST: Yet Another Sequential Test

Methodology 2025-10-08 v5

Authors: Alexey Kurennoy , Majed Dodin , Tural Gurbanov , Ana Peleteiro Ramallo

Abstract

Online evaluation of machine learning models is typically conducted through A/B experiments. Sequential statistical tests are valuable tools for analysing these experiments, as they enable researchers to stop data collection early without increasing the risk of false discoveries. However, existing sequential tests either limit the number of interim analyses or suffer from low statistical power. In this paper, we introduce a novel sequential test designed for continuous monitoring of A/B experiments. We validate our method using semi-synthetic simulations and demonstrate that it outperforms current state-of-the-art sequential testing approaches. Our method is derived using a new technique that inverts a bound on the probability of threshold crossing, based on a classical maximal inequality.

Keywords

multiple testing hypothesis testing reliability analysis

Cite

@article{arxiv.2406.16523,
  title  = {YEAST: Yet Another Sequential Test},
  author = {Alexey Kurennoy and Majed Dodin and Tural Gurbanov and Ana Peleteiro Ramallo},
  journal= {arXiv preprint arXiv:2406.16523},
  year   = {2025}
}

Comments

18 pages, 1 figure, 7 tables

YEAST: Yet Another Sequential Test

Abstract

Keywords

Cite

Comments

Related papers