In silicogene function prediction using ontology-based pattern identification
Open Access
- 5 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (7) , 1237-1245
- https://doi.org/10.1093/bioinformatics/bti111
Abstract
Motivation: With the emergence of genome-wide expression profiling data sets, the guilt by association (GBA) principle has been a cornerstone for deriving gene functional interpretations in silico. Given the limited success of traditional methods for producing clusters of genes with great amounts of functional similarity, new data-mining algorithms are required to fully exploit the potential of high-throughput genomic approaches. Results: Ontology-based pattern identification (OPI) is a novel data-mining algorithm that systematically identifies expression patterns that best represent existing knowledge of gene function. Instead of relying on a universal threshold of expression similarity to define functionally related groups of genes, OPI finds the optimal analysis settings that yield gene expression patterns and gene lists that best predict gene function using the principle of GBA. We applied OPI to a publicly available gene expression data set on the life cycle of the malarial parasite Plasmodium falciparum and systematically annotated genes for 320 functional categories based on current Gene Ontology annotations. An ontology-based hierarchical tree of the 320 categories provided a systems-wide biological view of this important malarial parasite. Availability: A web accessible P. falciparum e-annotation database containing the results of this study can be accessed online at http://carrier.gnf.org/publications/OPI Contact:zhou@gnf.orgKeywords
This publication has 20 references indexed in Scilit:
- Quantifying the relationship between co-expression, co-regulation and gene functionBMC Bioinformatics, 2004
- Iterative Group Analysis (iGA): A simple tool to enhance sensitivity and facilitate interpretation of microarray experimentsBMC Bioinformatics, 2004
- Microarrays--Guilt by AssociationScience, 2003
- Transformation and normalization of oligonucleotide microarray dataBioinformatics, 2003
- Discovery of Gene Function by Expression Profiling of the Malaria Parasite Life CycleScience, 2003
- PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetesNature Genetics, 2003
- Gene expression data preprocessingBioinformatics, 2003
- Monitoring the chromosome 2 intraerythrocytic transcriptome of Plasmodium falciparum using oligonucleotide arrays.The American Journal of Tropical Medicine and Hygiene, 2002
- Knowledge-based analysis of microarray gene expression data by using support vector machinesProceedings of the National Academy of Sciences, 2000
- Cluster analysis and display of genome-wide expression patternsProceedings of the National Academy of Sciences, 1998