SCOPmap: Automated assignment of protein structures to evolutionary superfamilies
Open Access
- 14 December 2004
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 5 (1) , 197
- https://doi.org/10.1186/1471-2105-5-197
Abstract
Background: Inference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database. Results: The general strategy employed by this algorithm is to combine the results of several existing sequence and structure comparison tools applied to a query protein of known structure in order to find the homologs already classified in SCOP database and thus determine classification assignments. The algorithm is able to map domains within newly solved structures to the appropriate SCOP superfamily level with ~95% accuracy. Examples of correctly mapped remote homologs are discussed. The algorithm is also capable of identifying potential evolutionary relationships not specified in the SCOP database, thus helping to make it better. The strategy of the mapping algorithm is not limited to SCOP and can be applied to any other evolutionary-based classification scheme as well. SCOPmap is available for download. Conclusion: The SCOPmap program is useful for assigning domains in newly solved structures to appropriate superfamilies and for identifying evolutionary links between different superfamilies.Keywords
This publication has 55 references indexed in Scilit:
- RNA Synthesis in a Cage—Structural Studies of Reovirus Polymerase λ3Cell, 2002
- Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structureJournal of Molecular Biology, 2001
- Crystal structure of the MJ0490 gene product of the hyperthermophilic archaebacterium Methanococcus jannaschii, a novel member of the Lactate/Malate family of dehydrogenasesJournal of Molecular Biology, 2001
- Crystal structure of NADH-dependent ferredoxin reductase component in biphenyl dioxygenaseJournal of Molecular Biology, 2000
- The Protein Data BankNucleic Acids Research, 2000
- The N-terminal domain of the human Rad51 protein binds DNA: structure and a DNA binding surface as revealed by NMRJournal of Molecular Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Structural Features can be Unconserved in Proteins with Similar Folds: An Analysis of Side-chain to Side-chain Contacts Secondary Structure and AccessibilityJournal of Molecular Biology, 1994
- Structure of the C-terminal domain of the ribosomal protein from Escherichia coli at 1.7 ÅJournal of Molecular Biology, 1987