English

Classifier-Based Nonparametric Sequential Hypothesis Testing

Statistics Theory 2026-03-23 v1 Information Theory math.IT Statistics Theory

Abstract

We consider the problem of constructing sequential power-one tests where the null and alternative classes are specified indirectly through historical or offline data. More specifically, given an offline dataset consisting of observations from L+1L+1 distributions {P0,P1,,PL}\{P_0, P_1, \ldots, P_L\}, and a new unlabeled data stream {Xt:t1}i.i.dPθ\{X_t: t \geq 1\} \overset{i.i.d}{\sim} P_\theta, the goal is to decide between the null H0:θ=0H_0: \theta = 0, against the alternative H1:θ[L]:={1,,L}H_1: \theta \in [L]:=\{1,\ldots,L\}. Our main methodological contribution is a general approach for designing a level-α\alpha power-one test for this problem using a multi-class classifier trained on the given offline dataset. Working under a mild "separability" condition on the distributions and the trained classifier, we obtain an upper bound on the expected stopping time of our proposed level-α\alpha test, and then show that in general this cannot be improved. In addition to rejecting the null, we show that our procedure can also identify the true underlying distribution almost surely. We then establish a sufficient condition to ensure the required separability of the classifier, and provide some converse results to investigate the role of the size of the offline dataset and the family of classifiers among classifier-based tests that satisfy the level-α\alpha power-one criterion. Finally, we present an extension of our analysis for the training-and-testing distribution mismatch and illustrate an application to sequential change detection. Empirical results using both synthetic and real data provide support for our theoretical results.

Keywords

Cite

@article{arxiv.2603.20135,
  title  = {Classifier-Based Nonparametric Sequential Hypothesis Testing},
  author = {Chia-Yu Hsu and Shubhanshu Shekhar},
  journal= {arXiv preprint arXiv:2603.20135},
  year   = {2026}
}