A statistical perspective on gene expression data analysis
- 8 January 2003
- journal article
- review article
- Published by Wiley in Statistics in Medicine
- Vol. 22 (3) , 481-499
- https://doi.org/10.1002/sim.1350
Abstract
Rapid advances in biotechnology have resulted in an increasing interest in the use of oligonucleotide and spotted cDNA gene expression microarrays for medical research. These arrays are being widely used to understand the underlying genetic structure of various diseases, with the ultimate goal to provide better diagnosis, prevention and cure. This technology allows for measurement of expression levels from several thousands of genes simultaneously, thus resulting in an enormous amount of data. The role of the statistician is critical to the successful design of gene expression studies, and the analysis and interpretation of the resulting voluminous data. This paper discusses hypotheses common to gene expression studies, and describes some of the statistical methods suitable for addressing these hypotheses. S‐plus and SAS codes to perform the statistical methods are provided. Gene expression data from an unpublished oncologic study is used to illustrate these methods. Copyright © 2003 John Wiley & Sons, Ltd.Keywords
This publication has 26 references indexed in Scilit:
- Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression DataJournal of the American Statistical Association, 2002
- In Vivo Regulation of Human Skeletal Muscle Gene Expression by Thyroid HormoneGenome Research, 2002
- Three Cell Wall Mannoproteins Facilitate the Uptake of Iron in Saccharomyces cerevisiaePublished by Elsevier ,2001
- Experimental design for gene expression microarraysBiostatistics, 2001
- Gene-Expression Profiles in Hereditary Breast CancerNew England Journal of Medicine, 2001
- On Differential Variability of Expression Ratios: Improving Statistical Inference about Gene Expression Changes from Microarray DataJournal of Computational Biology, 2001
- Distinct types of diffuse large B-cell lymphoma identified by gene expression profilingNature, 2000
- [12] DNA arrays for analysis of gene expressionPublished by Elsevier ,1999
- High density synthetic oligonucleotide arraysNature Genetics, 1999
- Multiple Tests with Discrete DistributionsThe American Statistician, 1997