English

YEAST: Yet Another Sequential Test

Methodology 2025-10-08 v5

Abstract

Online evaluation of machine learning models is typically conducted through A/B experiments. Sequential statistical tests are valuable tools for analysing these experiments, as they enable researchers to stop data collection early without increasing the risk of false discoveries. However, existing sequential tests either limit the number of interim analyses or suffer from low statistical power. In this paper, we introduce a novel sequential test designed for continuous monitoring of A/B experiments. We validate our method using semi-synthetic simulations and demonstrate that it outperforms current state-of-the-art sequential testing approaches. Our method is derived using a new technique that inverts a bound on the probability of threshold crossing, based on a classical maximal inequality.

Keywords

Cite

@article{arxiv.2406.16523,
  title  = {YEAST: Yet Another Sequential Test},
  author = {Alexey Kurennoy and Majed Dodin and Tural Gurbanov and Ana Peleteiro Ramallo},
  journal= {arXiv preprint arXiv:2406.16523},
  year   = {2025}
}

Comments

18 pages, 1 figure, 7 tables