English

Testing for Conditional Independence in Binary Single-Index Models

Methodology 2025-12-23 v1

Abstract

We wish to test whether a real-valued variable ZZ has explanatory power, in addition to a multivariate variable XX, for a binary variable YY. Thus, we are interested in testing the hypothesis P(Y=1X,Z)=P(Y=1X)\mathbb{P}(Y=1\, | \, X,Z)=\mathbb{P}(Y=1\, | \, X), based on nn i.i.d.\ copies of (X,Y,Z)(X,Y,Z). In order to avoid the curse of dimensionality, we follow the common approach of assuming that the dependence of both YY and ZZ on XX is through a single-index XβX^\top\beta only. Splitting the sample on both YY-values, we construct a two-sample empirical process of transformed ZZ-variables, after splitting the XX-space into parallel strips. Studying this two-sample empirical process is challenging: it does not converge weakly to a standard Brownian bridge, but after an appropriate normalization it does. We use this result to construct distribution-free tests.

Keywords

Cite

@article{arxiv.2512.19641,
  title  = {Testing for Conditional Independence in Binary Single-Index Models},
  author = {John H. J. Einmahl and Denis Kojevnikov and Bas J. M. Werker},
  journal= {arXiv preprint arXiv:2512.19641},
  year   = {2025}
}