Berkeley PHOG: PhyloFacts orthology group prediction web server
Open Access
- 12 May 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (suppl_2) , W84-W89
- https://doi.org/10.1093/nar/gkp373
Abstract
Ortholog detection is essential in functional annotation of genomes, with applications to phylogenetic tree construction, prediction of protein–protein interaction and other bioinformatics tasks. We present here the PHOG web server employing a novel algorithm to identify orthologs based on phylogenetic analysis. Results on a benchmark dataset from the TreeFam-A manually curated orthology database show that PHOG provides a combination of high recall and precision competitive with both InParanoid and OrthoMCL, and allows users to target different taxonomic distances and precision levels through the use of tree-distance thresholds. For instance, OrthoMCL-DB achieved 76% recall and 66% precision on this dataset; at a slightly higher precision (68%) PHOG achieves 10% higher recall (86%). InParanoid achieved 87% recall at 24% precision on this dataset, while a PHOG variant designed for high recall achieves 88% recall at 61% precision, increasing precision by 37% over InParanoid. PHOG is based on pre-computed trees in the PhyloFacts resource, and contains over 366 K orthology groups with a minimum of three species. Predicted orthologs are linked to GO annotations, pathway information and biological literature. The PHOG web server is available at http://phylofacts.berkeley.edu/orthologs/.Keywords
This publication has 25 references indexed in Scilit:
- TreeFam: 2008 UpdateNucleic Acids Research, 2007
- PhylomeDB: a database for genome-wide collections of gene phylogeniesNucleic Acids Research, 2007
- eggNOG: automated construction and annotation of orthologous groups of genesNucleic Acids Research, 2007
- Automated Protein Subfamily Identification and ClassificationPLoS Computational Biology, 2007
- Berkeley Phylogenomics Group web servers: resources for structural phylogenomic analysisNucleic Acids Research, 2007
- FlowerPower: clustering proteins into domain architecture classes for phylogenomic inference of protein functionBMC Ecology and Evolution, 2007
- Functional Classification Using Phylogenomic InferencePLoS Computational Biology, 2006
- Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databasesBioinformatics, 2005
- Automatic clustering of orthologs and in-paralogs from pairwise species comparisonsJournal of Molecular Biology, 2001
- Errors in genome annotationTrends in Genetics, 1999