Comparing functional annotation analyses with Catmap
Open Access
- 9 December 2004
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 5 (1) , 193
- https://doi.org/10.1186/1471-2105-5-193
Abstract
Background: Ranked gene lists from microarray experiments are usually analysed by assigning significance to predefined gene categories, e.g., based on functional annotations. Tools performing such analyses are often restricted to a category score based on a cutoff in the ranked list and a significance calculation based on random gene permutations as null hypothesis. Results: We analysed three publicly available data sets, in each of which samples were divided in two classes and genes ranked according to their correlation to class labels. We developed a program, Catmap (available for download at http://bioinfo.thep.lu.se/Catmap), to compare different scores and null hypotheses in gene category analysis, using Gene Ontology annotations for category definition. When a cutoff-based score was used, results depended strongly on the choice of cutoff, introducing an arbitrariness in the analysis. Comparing results using random gene permutations and random sample permutations, respectively, we found that the assigned significance of a category depended strongly on the choice of null hypothesis. Compared to sample label permutations, gene permutations gave much smaller p-values for large categories with many coexpressed genes. Conclusions: In gene category analyses of ranked gene lists, a cutoff independent score is preferable. The choice of null hypothesis is very important; random gene permutations does not work well as an approximation to sample label permutations.Keywords
This publication has 24 references indexed in Scilit:
- Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrep-resented upstream motifsBMC Bioinformatics, 2004
- ACID: a database for microarray clone informationBioinformatics, 2004
- GOstat: find statistically overrepresented Gene Ontologies within a group of genesBioinformatics, 2004
- Iterative Group Analysis (iGA): A simple tool to enhance sensitivity and facilitate interpretation of microarray experimentsBMC Bioinformatics, 2004
- PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetesNature Genetics, 2003
- Global functional profiling of gene expression☆☆This work was funded in part by a Sun Microsystems grant awarded to S.D., NIH Grant HD36512 to S.A.K., a Wayne State University SOM Dean’s Post-Doctoral Fellowship, and an NICHD Contraception and Infertility Loan to G.C.O. Support from the WSU MCBI mode is gratefully appreciated.Genomics, 2003
- Profiling Gene Expression Using Onto-ExpressGenomics, 2002
- Gene expression profiling predicts clinical outcome of breast cancerNature, 2002
- Comprehensive Identification of Cell Cycle–regulated Genes of the YeastSaccharomyces cerevisiaeby Microarray HybridizationMolecular Biology of the Cell, 1998
- Individual Comparisons by Ranking MethodsBiometrics Bulletin, 1945