English

Deploying the Conditional Randomization Test in High Multiplicity Problems

Methodology 2022-04-08 v3

Abstract

This paper introduces the sequential CRT, which is a variable selection procedure that combines the conditional randomization test (CRT) and Selective SeqStep+. Valid p-values are constructed via the flexible CRT, which are then ordered and passed through the selective SeqStep+ filter to produce a list of discoveries. We develop theory guaranteeing control on the false discovery rate (FDR) even though the p-values are not independent. We show in simulations that our novel procedure indeed controls the FDR and are competitive with -- and sometimes outperform -- state-of-the-art alternatives in terms of power. Finally, we apply our methodology to a breast cancer dataset with the goal of identifying biomarkers associated with cancer stage.

Keywords

Cite

@article{arxiv.2110.02422,
  title  = {Deploying the Conditional Randomization Test in High Multiplicity Problems},
  author = {Shuangning Li and Emmanuel J. Candès},
  journal= {arXiv preprint arXiv:2110.02422},
  year   = {2022}
}