The ConSurf‐HSSP database: The mapping of evolutionary conservation among homologs onto PDB structures
- 21 December 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 58 (3) , 610-617
- https://doi.org/10.1002/prot.20305
Abstract
The HSSP (Homology‐Derived Secondary Structure of Proteins) database provides multiple sequence alignments (MSAs) for proteins of known three‐dimensional (3D) structure in the Protein Data Bank (PDB). The database also contains an estimate of the degree of evolutionary conservation at each amino acid position. This estimate, which is based on the relative entropy, correlates with the functional importance of the position; evolutionarily conserved positions (i.e., positions with limited variability and low entropy) are occasionally important to maintain the 3D structure and biological function(s) of the protein. We recently developed the Rate4Site algorithm for scoring amino acid conservation based on their calculated evolutionary rate. This algorithm takes into account the phylogenetic relationships between the homologs and the stochastic nature of the evolutionary process. Here we present the ConSurf‐HSSP database of Rate4Site estimates of the evolutionary rates of the amino acid positions, calculated using HSSP's MSAs. The database provides precalculated evolutionary rates for nearly all of the PDB. These rates are projected, using a color code, onto the protein structure, and can be viewed online using the ConSurf server interface. To exemplify the database, we analyzed in detail the conservation pattern obtained for pyruvate kinase and compared the results with those observed using the relative entropy scores of the HSSP database. It is reassuring to know that the main functional region of the enzyme is detectable using both conservation scores. Interestingly, the ConSurf‐HSSP calculations mapped additional functionally important regions, which are moderately conserved and were overlooked by the original HSSP estimate. The ConSurf‐HSSP database is available online (http://consurf‐hssp.tau.ac.il). Proteins 2005.Keywords
This publication has 48 references indexed in Scilit:
- In silico identification of functional protein interfacesComparative and Functional Genomics, 2003
- Computational methods for the prediction of protein interactionsCurrent Opinion in Structural Biology, 2002
- Inference of functional regions in proteins by quantification of evolutionary constraintsProceedings of the National Academy of Sciences, 2002
- Evolutionary predictions of binding surfaces and interactionsCurrent Opinion in Structural Biology, 2002
- Conservation helps to identify biologically relevant crystal contactsJournal of Molecular Biology, 2001
- Identification of protein oligomerization states by analysis of interface conservationProceedings of the National Academy of Sciences, 2001
- The Protein Data BankNucleic Acids Research, 2000
- Protein-protein interfaces: Analysis of amino acid conservation in homodimersProteins-Structure Function and Bioinformatics, 2000
- The HSSP database of protein structure-sequence alignments and family profilesNucleic Acids Research, 1998
- Database of homology‐derived protein structures and the structural meaning of sequence alignmentProteins-Structure Function and Bioinformatics, 1991