Functionality of System Components: Conservation of Protein Function in Protein Feature Space
Open Access
- 1 January 2003
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 13 (11) , 2444-2449
- https://doi.org/10.1101/gr.1190803
Abstract
Many protein features useful for prediction of protein function can be predicted from sequence, including posttranslational modifications, subcellular localization, and physical/chemical properties. We show here that such protein features are more conserved among orthologs than paralogs, indicating they are crucial for protein function and thus subject to selective pressure. This means that a function prediction method based on sequence-derived features may be able to discriminate between proteins with different function even when they have highly similar structure. Also, such a method is likely to perform well on organisms other than the one on which it was trained. We evaluate the performance of such a method, ProtFun, which relies on protein features as its sole input, and show that the method gives similar performance for most eukaryotes and performs much better than anticipated on archaea and bacteria. From this analysis, we conclude that for the posttranslational modifications studied, both the cellular use and the sequence motifs are conserved within Eukarya.Keywords
This publication has 25 references indexed in Scilit:
- Prediction of Human Protein Function from Post-translational Modifications and Localization FeaturesJournal of Molecular Biology, 2002
- GenBankNucleic Acids Research, 2002
- Automatic clustering of orthologs and in-paralogs from pairwise species comparisonsJournal of Molecular Biology, 2001
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- Sequence and structure-based prediction of eukaryotic protein phosphorylation sitesJournal of Molecular Biology, 1999
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Automated genome sequence analysis and annotation.Bioinformatics, 1999
- PEST sequences and regulation by proteolysisTrends in Biochemical Sciences, 1996
- Non-globular domains in protein sequences: Automated segmentation using complexity measuresComputers & Chemistry, 1994
- Prediction of protein function from sequence propertiesBiochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology, 1984