To How Many Simultaneous Hypothesis Tests Can Normal, Student'stor Bootstrap Calibration Be Applied?

1 December 2007

journal article
Published by Taylor & Francis in Journal of the American Statistical Association

Vol. 102 (480) , 1282-1288
https://doi.org/10.1198/016214507000000969

Abstract

In the analysis of microarray data, and in some other contemporary statistical problems, it is not uncommon to apply hypothesis tests in a highly simultaneous way. The number, N say, of tests used can be much larger than the sample sizes, n, to which the tests are applied, yet we wish to calibrate the tests so that the overall level of the simultaneous test is accurate. Often the sampling distribution is quite different for each test, so there may not be an opportunity to combine data across samples. In this setting, how large can N be, as a function of n, before level accuracy becomes poor? Here we answer this question in cases where the statistic under test is of Student's t type. We show that if either the normal or Student t distribution is used for calibration, then the level of the simultaneous test is accurate provided that log N increases at a strictly slower rate than n ^1/3 as n diverges. On the other hand, if bootstrap methods are used for calibration, then we may choose log N almost as large as n ^1/2 and still achieve asymptotic-level accuracy. The implications of these results are explored both theoretically and numerically.

Keywords

All Related Versions

Version 1, 2007-01-01, RePEc (Unconfirmed version)

This publication has 2 references indexed in Scilit:

Removing intensity effects and identifying significant genes for Affymetrix arrays in macrophage migration inhibitory factor-suppressed neuroblastoma cells
Proceedings of the National Academy of Sciences, 2005
A Two-Way Semilinear Model for Normalization and Analysis of cDNA Microarray Data
Journal of the American Statistical Association, 2005