Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays
- 10 June 2005
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 102 (25) , 8961-8965
- https://doi.org/10.1073/pnas.0502674102
Abstract
Global analysis of gene expression by using DNA microarrays is employed increasingly to search for differences in biological properties between normal and diseased tissue. In such studies, expression that deviates from defined thresholds commonly is used for creating genetic signatures that characterize disease vs. normality. Although it is axiomatic that the threshold parameters applied to microarray analysis will alter the contents of such genetic signatures, the extent to which threshold choice can affect the fundamental conclusions made from microarray-based studies has not been elucidated. We used gabriel (Genetic Analysis By Rules Incorporating Expert Logic), a platform of knowledge-based algorithms for the global analysis of gene expression, together with conventional statistical approaches, to examine the sensitivity of conclusions to threshold choice in recently published microarray-based studies. An analysis of the effects of threshold decisions in one of these studies [Ramaswamy, S., Ross, K. N., Lander, E. S. & Golub, T. R. (2003) Nat. Genet. 33, 49–54], which arrived at the important conclusion that the metastatic potential of primary tumors is encoded by the bulk of cells in the tumor, is the focus of this article. We discovered that support for this conclusion highly depends on the threshold used to create gene expression signatures. We also found that threshold choice dramatically affected the gene function categories represented nonrandomly in signatures. Our results suggest that the robustness of biological conclusions made by using microarray analysis should be routinely assessed by examining the validity of the conclusions by using a range of threshold parameters.Keywords
This publication has 27 references indexed in Scilit:
- A molecular signature of metastasis in primary solid tumorsNature Genetics, 2002
- Nonparametric methods for identifying differentially expressed genes in microarray dataBioinformatics, 2002
- A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experimentsBioinformatics, 2002
- Analysis of DNA microarrays using algorithms that employ rule-based expert knowledgeProceedings of the National Academy of Sciences, 2002
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Global analysis of growth phase responsive gene expression and regulation of antibiotic biosynthetic pathways inStreptomyces coelicolorusing DNA microarraysGenes & Development, 2001
- Rich probabilistic models for gene expressionBioinformatics, 2001
- Computational analysis of microarray dataNature Reviews Genetics, 2001
- Support vector machine classification and validation of cancer tissue samples using microarray expression dataBioinformatics, 2000
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000