Statistical modeling of large microarray data sets to identify stimulus-response profiles
- 8 May 2001
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 98 (10) , 5631-5636
- https://doi.org/10.1073/pnas.101013198
Abstract
A statistical modeling approach is proposed for use in searching large microarray data sets for genes that have a transcriptional response to a stimulus. The approach is unrestricted with respect to the timing, magnitude or duration of the response, or the overall abundance of the transcript. The statistical model makes an accommodation for systematic heterogeneity in expression levels. Corresponding data analyses provide gene-specific information, and the approach provides a means for evaluating the statistical significance of such information. To illustrate this strategy we have derived a model to depict the profile expected for a periodically transcribed gene and used it to look for budding yeast transcripts that adhere to this profile. Using objective criteria, this method identifies 81% of the known periodic transcripts and 1,088 genes, which show significant periodicity in at least one of the three data sets analyzed. However, only one-quarter of these genes show significant oscillations in at least two data sets and can be classified as periodic with high confidence. The method provides estimates of the mean activation and deactivation times, induced and basal expression levels, and statistical measures of the precision of these estimates for each periodic transcript.Keywords
This publication has 17 references indexed in Scilit:
- Singular value decomposition for genome-wide expression data processing and modelingProceedings of the National Academy of Sciences, 2000
- Fundamental patterns underlying gene expression profiles: Simplicity from complexityProceedings of the National Academy of Sciences, 2000
- Array of hopeNature Genetics, 1999
- Comprehensive Identification of Cell Cycle–regulated Genes of the YeastSaccharomyces cerevisiaeby Microarray HybridizationMolecular Biology of the Cell, 1998
- Exploring the Metabolic and Genetic Control of Gene Expression on a Genomic ScaleScience, 1997
- α-Factor synchronization of budding yeastPublished by Elsevier ,1997
- Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.Proceedings of the National Academy of Sciences, 1996
- Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA MicroarrayScience, 1995
- Light-Directed, Spatially Addressable Parallel Chemical SynthesisScience, 1991
- Longitudinal data analysis using generalized linear modelsBiometrika, 1986