An evaluation of the performance of an automated procedure for comparative modelling of protein tertiary structure
- 1 July 1993
- journal article
- research article
- Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection
- Vol. 6 (5) , 501-512
- https://doi.org/10.1093/protein/6.5.501
Abstract
A 3-D model of a protein can be constructed from its amino acid sequence and the 3-D structures of one or more homologues by annealing three sets of fragments: the structurally conserved regions, structurally variable regions and the side chains. The method encoded in the computer program COMPOSER was assessed by generating 3-D models of eight proteins whose crystal structures are already known and for which 3-D structures of homologues are available. In the structurally conserved regions, differences between modelled and X-ray structures are smaller than the differences between the X-ray structures of the modelled protein and the homologues used to build the model. When several homologues are used, the contributions of the known structures are weighted, preferably by the square of sequence similarity; this is especially important when the similarities of the homologues to the modelled structure differ greatly. The ‘collar’ extension approach, in which a similar region of different length in a homologue is used to extend the framework, can result in a more accurate model. If known homologues comprise more than one related group of proteins and they are both distantly related to the unknown, then alignment of the sequence to be modelled with each group of homologues facilitates identification of structurally conserved regions of the unknown and leads to an improved model. Models have root mean square differences (r.m.s.d.s) with the structures defined by X-ray analysis of between 0.73 and 1.56 Å for all Cα atoms, for seven of the eight models. For the model of mucor pepsin, where the closest homologue has 33% sequence identity and 20% of the residues are in structurally variable regions, the r.m.s.d. for the framework region is 1.71 Å and the r.m.s.d. for all Cα atoms is 3.47 Â.Keywords
This publication has 8 references indexed in Scilit:
- Tertiary structural constraints on protein evolutionary diversity: templates, key residues and structure predictionProceedings Of The Royal Society B-Biological Sciences, 1990
- From comparisons of protein sequences and structures to protein modelling and designTrends in Biochemical Sciences, 1990
- Definition of general topological equivalence in protein structuresJournal of Molecular Biology, 1990
- Crystal structure of plastocyanin from a green alga, Enteromorpha proliferaJournal of Molecular Biology, 1990
- Refined structure of baboon α-lactalbumin at 1.7 Å resolutionJournal of Molecular Biology, 1989
- Knowledge‐based protein modelling and designEuropean Journal of Biochemistry, 1988
- Comparison of solvent-inaccessible cores of homologous proteins: definitions useful for protein modellingProtein Engineering, Design and Selection, 1987
- Knowledge based modelling of homologous proteins, part I: three-dimensional frameworks derived from the simultaneous superposition of multiple structuresProtein Engineering, Design and Selection, 1987