Gene-set analysis and reduction
Open Access
- 4 October 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Briefings in Bioinformatics
- Vol. 10 (1) , 24-34
- https://doi.org/10.1093/bib/bbn042
Abstract
Gene-set analysis aims to identify differentially expressed gene sets (pathways) by a phenotype in DNA microarray studies. We review here important methodological aspects of gene-set analysis and illustrate them with varying performance of several methods proposed in the literature. We emphasize the importance of distinguishing between ‘self-contained’ versus ‘competitive’ methods, following Goeman and Bühlmann. We also discuss reducing a gene set to its subset, consisting of ‘core members’ that chiefly contribute to the statistical significance of the differential expression of the initial gene set by phenotype. Significance analysis of microarray for gene-set reduction (SAM-GSR) can be used for an analytical reduction of gene sets to their core subsets. We apply SAM-GSR on a microarray dataset for identifying biological gene sets (pathways) whose gene expressions are associated with p53 mutation in cancer cell lines. Codes to implement SAM-GSR in the statistical package R can be downloaded from http://www.ualberta.ca/~yyasui/homepage.html.Keywords
This publication has 28 references indexed in Scilit:
- Comparative evaluation of gene-set analysis methodsBMC Bioinformatics, 2007
- Improving gene set analysis of microarray data by SAM-GSBMC Bioinformatics, 2007
- A method for computing the overall statistical significance of a treatment effect among a group of genesBMC Bioinformatics, 2006
- A multivariate approach for integrating genome-wide expression data and biological knowledgeBioinformatics, 2006
- Thousands of samples are needed to generate a robust gene list for predicting outcome in cancerProceedings of the National Academy of Sciences, 2006
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- Integrated modeling of clinical and gene expression information for personalized prediction of disease outcomesProceedings of the National Academy of Sciences, 2004
- PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetesNature Genetics, 2003
- Multiple Hypothesis Testing in Microarray ExperimentsStatistical Science, 2003
- Significance analysis of microarrays applied to the ionizing radiation responseProceedings of the National Academy of Sciences, 2001