Critical Review of Published Microarray Studies for Cancer Outcome and Guidelines on Statistical Analysis and Reporting
Top Cited Papers
Open Access
- 17 January 2007
- journal article
- guideline
- Published by Oxford University Press (OUP) in JNCI Journal of the National Cancer Institute
- Vol. 99 (2) , 147-157
- https://doi.org/10.1093/jnci/djk018
Abstract
Both the validity and the reproducibility of microarray-based clinical research have been challenged. There is a need for critical review of the statistical analysis and reporting in published microarray studies that focus on cancer-related clinical outcomes. Studies published through 2004 in which microarray-based gene expression profiles were analyzed for their relation to a clinical cancer outcome were identified through a Medline search followed by hand screening of abstracts and full text articles. Studies that were eligible for our analysis addressed one or more outcomes that were either an event occurring during follow-up, such as death or relapse, or a therapeutic response. We recorded descriptive characteristics for all the selected studies. A critical review of outcome-related statistical analyses was undertaken for the articles published in 2004. Ninety studies were identified, and their descriptive characteristics are presented. Sixty-eight (76%) were published in journals of impact factor greater than 6. A detailed account of the 42 studies (47%) published in 2004 is reported. Twenty-one (50%) of them contained at least one of the following three basic flaws: 1) in outcome-related gene finding, an unstated, unclear, or inadequate control for multiple testing; 2) in class discovery, a spurious claim of correlation between clusters and clinical outcome, made after clustering samples using a selection of outcome-related differentially expressed genes; or 3) in supervised prediction, a biased estimation of the prediction accuracy through an incorrect cross-validation procedure. The most common and serious mistakes and misunderstandings recorded in published studies are described and illustrated. Based on this analysis, a proposal of guidelines for statistical analysis and reporting for clinical microarray studies, presented as a checklist of “Do's and Don'ts,” is provided.Keywords
This publication has 23 references indexed in Scilit:
- Roadmap for Developing and Validating Therapeutically Relevant Genomic ClassifiersJournal of Clinical Oncology, 2005
- Prediction error estimation: a comparison of resampling methodsBioinformatics, 2005
- Prediction of cancer outcome with microarrays: a multiple random validation strategyPublished by Elsevier ,2005
- Controlling the number of false discoveries: application to high-dimensional genomic dataJournal of Statistical Planning and Inference, 2004
- Limitations of the Odds Ratio in Gauging the Performance of a Diagnostic, Prognostic, or Screening MarkerAmerican Journal of Epidemiology, 2004
- Rules of evidence for cancer molecular-marker discovery and validationNature Reviews Cancer, 2004
- Predictive ability of DNA microarrays for cancer outcomes and correlates: an empirical assessmentThe Lancet, 2003
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- The use and analysis of microarray dataNature Reviews Drug Discovery, 2002
- More powerful procedures for multiple significance testingStatistics in Medicine, 1990