Detection of native‐like models for amino acid sequences of unknown three‐dimensional structure in a data base of known protein conformations
- 1 July 1992
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 13 (3) , 258-271
- https://doi.org/10.1002/prot.340130308
Abstract
We present an approach which can be used to identify native-like folds in a data base of protein conformations in the absence of any sequence homology to proteins in the data base. The method is based on a knowledge-based force field derived from a set of known protein conformations. A given sequence is mounted on all conformations in the data base andthe associated energies are calculated. Using several conformations and sequences from the globin family we show that the native conformation is identified correctly. In fact the resolution of the force field is high enough to discriminate between a native fold and several closely related conformations. We then apply the procedure to several globins of known sequence but unknown three dimensional structure. The homology of these sequences to globins of known structures in the data base ranges from 49 to 17%. Withone exception we find that for all globin sequences one of the known globinfolds is identified as the most favorable conformation. These results are obtained using a force field derived from a data base devoid of globins of known structure. We briefly discuss useful applications in protein structurlresearch and future development of our approach.Keywords
This publication has 29 references indexed in Scilit:
- Mandelate racemase and muconate lactonizing enzyme are mechanistically distinct and structurally homologousNature, 1990
- Calculation of conformational ensembles from potentials of mena forceJournal of Molecular Biology, 1990
- Between objectivity and subjectivityNature, 1990
- Aplysia limacina myoglobinJournal of Molecular Biology, 1989
- Refinement of a molecular model for lamprey hemoglobin from Petromyzon marinusJournal of Molecular Biology, 1985
- CHARMM: A program for macromolecular energy, minimization, and dynamics calculationsJournal of Computational Chemistry, 1983
- Amino acid sequence of the smallest polypeptide chain containing heme of extracellular hemoglobin from the polychaete tylorrhynchus heterochaetusBiochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology, 1982
- Structure of erythrocruorin in different ligand states refined at 1·4 Å resolutionJournal of Molecular Biology, 1979
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- The amino acid sequence of leghaemoglobin I from root nodules of broad bean (Vicia faba L.)FEBS Letters, 1975