Simultaneous critical values for $t$-tests in very high dimensions
Abstract
This article considers the problem of multiple hypothesis testing using -tests. The observed data are assumed to be independently generated conditional on an underlying and unknown two-state hidden model. We propose an asymptotically valid data-driven procedure to find critical values for rejection regions controlling the -familywise error rate (-FWER), false discovery rate (FDR) and the tail probability of false discovery proportion (FDTP) by using one-sample and two-sample -statistics. We only require a finite fourth moment plus some very general conditions on the mean and variance of the population by virtue of the moderate deviations properties of -statistics. A new consistent estimator for the proportion of alternative hypotheses is developed. Simulation studies support our theoretical results and demonstrate that the power of a multiple testing procedure can be substantially improved by using critical values directly, as opposed to the conventional -value approach. Our method is applied in an analysis of the microarray data from a leukemia cancer study that involves testing a large number of hypotheses simultaneously.
Cite
@article{arxiv.1102.2046,
title = {Simultaneous critical values for $t$-tests in very high dimensions},
author = {Hongyuan Cao and Michael R. Kosorok},
journal= {arXiv preprint arXiv:1102.2046},
year = {2011}
}
Comments
Published in at http://dx.doi.org/10.3150/10-BEJ272 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)