GO PaD: the Gene Ontology Partition Database
Open Access
- 10 November 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (suppl_1) , D322-D327
- https://doi.org/10.1093/nar/gkl799
Abstract
Gene Ontology (GO) has been widely used to infer functional significance associated with sets of genes in order to automate discoveries within large-scale genetic studies. A level in GO's direct acyclic graph structure is often assumed to be indicative of its terms' specificities, although other work has suggested this assumption does not hold. Unfortunately, quantitative analysis of biological functions based on nodes at the same level (as is common in gene enrichment analysis tools) can lead to incorrect conclusions as well as missed discoveries due to inefficient use of available information. This paper addresses these using an informational theoretic approach encoded in the GO Partition Database that guarantees to maximize information for gene enrichment analysis. The GO Partition Database was designed to feature ontology partitions with GO terms of similar specificity. The GO partitions comprise varying numbers of nodes and present relevant information theoretic statistics, so researchers can choose to analyze datasets at arbitrary levels of specificity. The GO Partition Database, featuring GO partition sets for functional analysis of genes from human and 10 other commonly studied organisms with a total of 131 972 genes, is available on the internet at: Author Webpage. The site also includes an online tutorial.Keywords
This publication has 11 references indexed in Scilit:
- The Gene Ontology (GO) project in 2006Nucleic Acids Research, 2006
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- EcoCyc: a comprehensive database resource for Escherichia coliNucleic Acids Research, 2004
- FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genesBioinformatics, 2004
- GeneInfoViz: constructing and visualizing gene relation networks.2004
- MIPS: analysis and annotation of proteins from whole genomesNucleic Acids Research, 2004
- The Gene Ontology (GO) database and informatics resourceNucleic Acids Research, 2004
- DAVID: Database for Annotation, Visualization, and Integrated DiscoveryGenome Biology, 2003
- YPDTM, PombePDTM and WormPDTM: model organism volumes of the BioKnowledgeTM Library, an integrated resource for protein informationNucleic Acids Research, 2001
- A Mathematical Theory of CommunicationBell System Technical Journal, 1948