Improved scoring of functional groups from gene expression data by decorrelating GO graph structure
Top Cited Papers
- 10 April 2006
- journal article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 22 (13) , 1600-1607
- https://doi.org/10.1093/bioinformatics/btl140
Abstract
Motivation: The result of a typical microarray experiment is a long list of genes with corresponding expression measurements. This list is only the starting point for a meaningful biological interpretation. Modern methods identify relevant biological processes or functions from gene expression data by scoring the statistical significance of predefined functional gene groups, e.g. based on Gene Ontology (GO). We develop methods that increase the explanatory power of this approach by integrating knowledge about relationships between the GO terms into the calculation of the statistical significance. Results: We present two novel algorithms that improve GO group scoring using the underlying GO graph topology. The algorithms are evaluated on real and simulated gene expression data. We show that both methods eliminate local dependencies between GO terms and point to relevant areas in the GO graph that remain undetected with state-of-the-art algorithms for scoring functional terms. A simulation study demonstrates that the new methods exhibit a higher level of detecting relevant biological terms than competing methods. Availability: topgo.bioinf.mpi-inf.mpg.de Contact: alexa@mpi-sb.mpg.de Supplementary Information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 13 references indexed in Scilit:
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- Ontological analysis of gene expression data: current tools, limitations, and open problemsBioinformatics, 2005
- Distinct gene expression profiles determine molecular treatment response in childhood acute lymphoblastic leukemiaBlood, 2005
- The Gene Ontology CategorizerBioinformatics, 2004
- A graph-theoretic approach to testing associations between disparate sources of functional genomics dataBioinformatics, 2004
- Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survivalBlood, 2004
- GOstat: find statistically overrepresented Gene Ontologies within a group of genesBioinformatics, 2004
- FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genesBioinformatics, 2004
- Global functional profiling of gene expression☆☆This work was funded in part by a Sun Microsystems grant awarded to S.D., NIH Grant HD36512 to S.A.K., a Wayne State University SOM Dean’s Post-Doctoral Fellowship, and an NICHD Contraception and Infertility Loan to G.C.O. Support from the WSU MCBI mode is gratefully appreciated.Genomics, 2003
- The control of the false discovery rate in multiple testing under dependencyThe Annals of Statistics, 2001