To how many simultaneous hypothesis tests can normal student's t or bootstrap calibrations be applied

Fan, Jianqing, Hall, Peter and Yao, Qiwei ORCID: 0000-0003-2065-8486 (2007) To how many simultaneous hypothesis tests can normal student's t or bootstrap calibrations be applied. Journal of the American Statistical Association, 102 (480). pp. 1282-1288. ISSN 0162-1459

Preview

PDF
Download (283kB) | Preview

Identification Number: 10.1198/016214507000000969

Abstract

In the analysis of microarray data, and in some other contemporary statistical problems, it is not uncommon to apply hypothesis tests in a highly simultaneous way. The number, N say, of tests used can be much larger than the sample sizes, n, to which the tests are applied, yet we wish to calibrate the tests so that the overall level of the simultaneous test is accurate. Often the sampling distribution is quite different for each test, so there may not be an opportunity to combine data across samples. In this setting, how large can N be, as a function of n, before level accuracy becomes poor? Here we answer this question in cases where the statistic under test is of Student's t type. We show that if either the normal or Student t distribution is used for calibration, then the level of the simultaneous test is accurate provided that log N increases at a strictly slower rate than n1/3 as n diverges. On the other hand, if bootstrap methods are used for calibration, then we may choose log N almost as large as n1/2 and still achieve asymptotic-level accuracy. The implications of these results are explored both theoretically and numerically.

Item Type:	Article
Official URL:	http://www.amstat.org/publications/JASA/
Additional Information:	© 2008 American Statistical Association
Divisions:	Statistics
Subjects:	H Social Sciences > HA Statistics
Date Deposited:	13 Jun 2008 09:15
Last Modified:	09 May 2025 14:07
URI:	http://eprints.lse.ac.uk/id/eprint/5399

Actions (login required)

View Item

Download Statistics

Downloads

Downloads per month over past year

View more statistics