Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach
Open Access
- 29 June 2005
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 6 (1) , 165
- https://doi.org/10.1186/1471-2105-6-165
Abstract
In testing for differential gene expression involving multiple serial analysis of gene expression (SAGE) libraries, it is critical to account for both between and within library variation. Several methods have been proposed, including the t test, t w test, and an overdispersed logistic regression approach. The merits of these tests, however, have not been fully evaluated. Questions still remain on whether further improvements can be made. In this article, we introduce an overdispersed log-linear model approach to analyzing SAGE; we evaluate and compare its performance with three other tests: the two-sample t test, t w test and another based on overdispersed logistic linear regression. Analysis of simulated and real datasets show that both the log-linear and logistic overdispersion methods generally perform better than the t and t w tests; the log-linear method is further found to have better performance than the logistic method, showing equal or higher statistical power over a range of parameter values and with different data distributions. Overdispersed log-linear models provide an attractive and reliable framework for analyzing SAGE experiments involving multiple libraries. For convenience, the implementation of this method is available through a user-friendly web-interface available at http://www.cbcb.duke.edu/sage .Keywords
This publication has 31 references indexed in Scilit:
- Improved statistical tests for differential gene expression by shrinking variance components estimatesBiostatistics, 2004
- Genomics, Prior Probability, and Statistical Tests of Multiple HypothesesGenome Research, 2004
- POWER_SAGE: comparing statistical tests for SAGE experimentsBioinformatics, 2000
- Isolation of tissue‐type plasminogen activator, cathepsin H, and non‐specific cross‐reacting antigen from SK‐PC‐1 pancreas cancer cells using subtractive hybridizationFEBS Letters, 1996
- Serial Analysis of Gene ExpressionScience, 1995
- Enhanced expression of annexin II in human pancreatic carcinoma cells and primary pancreatic cancersCarcinogenesis: Integrative Cancer Research, 1993
- Extra-Poisson Variation in Log-Linear ModelsJournal of the Royal Statistical Society Series C: Applied Statistics, 1984
- Extra-Binomial Variation in Logistic Linear ModelsJournal of the Royal Statistical Society Series C: Applied Statistics, 1982
- The Generalization of `Student's' Problem when Several Different Population Variances are InvolvedBiometrika, 1947
- THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVEDBiometrika, 1947