Predicting protein functional sites with phylogenetic motifs
- 30 November 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 58 (2) , 309-320
- https://doi.org/10.1002/prot.20321
Abstract
In this report, we demonstrate that phylogenetic motifs, sequence regions conserving the overall familial phylogeny, represent a promising approach to protein functional site prediction. Across our structurally and functionally heterogeneous data set, phylogenetic motifs consistently correspond to functional sites defined by both surface loops and active site clefts. Additionally, the partially buried prosthetic group regions of cytochrome P450 and succinate dehydrogenase are identified as phylogenetic motifs. In nearly all instances, phylogenetic motifs are structurally clustered, despite little overall sequence proximity, around key functional site features. Based on calculated false‐positive expectations and standard motif identification methods, we show that phylogenetic motifs are generally conserved in sequence. This result implies that they can be considered motifs in the traditional sense as well. However, there are instances where phylogenetic motifs are not (overall) well conserved in sequence. This point is enticing, because it implies that phylogenetic motifs are able to identify key sequence regions that traditional motif‐based approaches would not. Further, phylogenetic motif results are also shown to be consistent with evolutionary trace results, and bootstrapping is used to demonstrate tree significance. Proteins 2005.Keywords
This publication has 48 references indexed in Scilit:
- Phylogenomic inference of protein molecular function: advances and challengesBioinformatics, 2004
- ELM server: a new resource for investigating short functional sites in modular eukaryotic proteinsNucleic Acids Research, 2003
- Motif-based construction of a functional map for mammalian olfactory receptorsGenomics, 2003
- Automatic Methods for Predicting Functionally Important ResiduesJournal of Molecular Biology, 2003
- Finding important sites in protein sequencesProceedings of the National Academy of Sciences, 2002
- Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins11Edited by J. ThorntonJournal of Molecular Biology, 2001
- Evolutionary trace analysis of TGF-β and related growth factors: implications for site-directed mutagenesisProtein Engineering, Design and Selection, 2000
- The quest to deduce protein function from sequence: the role of pattern databasesThe International Journal of Biochemistry & Cell Biology, 1999
- An Evolutionary Trace Method Defines Binding Surfaces Common to Protein FamiliesJournal of Molecular Biology, 1996
- A method to predict functional residues in proteinsNature Structural & Molecular Biology, 1995