Global mapping of the protein structure space and application in structure-based inference of protein function
- 10 February 2005
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 102 (10) , 3651-3656
- https://doi.org/10.1073/pnas.0409772102
Abstract
We have constructed a map of the "protein structure space" by using the pairwise structural similarity scores calculated for all nonredundant protein structures determined experimentally. As expected, proteins with similar structures clustered together in the map and the overall distribution of structural classes of this map followed closely that of the map of the "protein fold space" we have reported previously. Consequently, proteins sharing similar molecular functions also were found to colocalize in the protein structure space map, pointing toward a previously undescribed scheme for structure-based functional inference for remote homologues based on the proximity in the map of the protein structure space. We found that this scheme consistently outperformed other predictions made by using either the raw scores or normalized Z-scores of pairwise DALI structure alignment.Keywords
This publication has 29 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- The Protein Data BankActa Crystallographica Section D-Biological Crystallography, 2002
- Use of receiver operating characteristic (ROC) analysis to evaluate sequence matchingPublished by Elsevier ,2002
- Advances in structural genomicsCurrent Opinion in Structural Biology, 1999
- Profile hidden Markov models.Bioinformatics, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Enlarged representative set of protein structuresProtein Science, 1994
- Protein Structure Comparison by Alignment of Distance MatricesJournal of Molecular Biology, 1993
- Selection of a representative set of structures from brookhaven protein data bankProteins-Structure Function and Bioinformatics, 1992
- One thousand families for the molecular biologistNature, 1992