Deploying the Conditional Randomization Test in High Multiplicity Problems
Methodology
2022-04-08 v3
Abstract
This paper introduces the sequential CRT, which is a variable selection procedure that combines the conditional randomization test (CRT) and Selective SeqStep+. Valid p-values are constructed via the flexible CRT, which are then ordered and passed through the selective SeqStep+ filter to produce a list of discoveries. We develop theory guaranteeing control on the false discovery rate (FDR) even though the p-values are not independent. We show in simulations that our novel procedure indeed controls the FDR and are competitive with -- and sometimes outperform -- state-of-the-art alternatives in terms of power. Finally, we apply our methodology to a breast cancer dataset with the goal of identifying biomarkers associated with cancer stage.
Cite
@article{arxiv.2110.02422,
title = {Deploying the Conditional Randomization Test in High Multiplicity Problems},
author = {Shuangning Li and Emmanuel J. Candès},
journal= {arXiv preprint arXiv:2110.02422},
year = {2022}
}