Statistical significance for genomewide studies
Top Cited Papers
- 25 July 2003
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 100 (16) , 9440-9445
- https://doi.org/10.1073/pnas.1530509100
Abstract
With the increase in genomewide experiments and the sequencing of multiple genomes, the analysis of large data sets has become commonplace in biology. It is often the case that thousands of features in a genomewide data set are tested against some null hypothesis, where a number of features are expected to be significant. Here we propose an approach to measuring statistical significance in these genomewide studies based on the concept of the false discovery rate. This approach offers a sensible balance between the number of true and false positives that is automatically calibrated and easily interpreted. In doing so, a measure of statistical significance called the q value is associated with each tested feature. The q value is similar to the well known p value, except it is a measure of significance in terms of the false discovery rate rather than the false positive rate. Our approach avoids a flood of false positive results, while offering a more liberal criterion than what has been used in genome scans for linkage.Keywords
This publication has 23 references indexed in Scilit:
- Outlier Detection and False Discovery Rates for Whole-Genome DNA MatchingJournal of the American Statistical Association, 2003
- From patterns to pathways: gene expression data analysis comes of ageNature Genetics, 2002
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- A Direct Approach to False Discovery RatesJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Operating Characteristics and Extensions of the False Discovery Rate ProcedureJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Empirical bayes methods and false discovery rates for microarraysGenetic Epidemiology, 2002
- Significance analysis of microarrays applied to the ionizing radiation responseProceedings of the National Academy of Sciences, 2001
- On the Adaptive Control of the False Discovery Rate in Multiple Testing With Independent StatisticsJournal of Educational and Behavioral Statistics, 2000
- Biochemistry and genetics of eukaryotic mismatch repair.Genes & Development, 1996
- Genetic dissection of complex traits: guidelines for interpreting and reporting linkage resultsNature Genetics, 1995