An evaluation of the performance of an automated procedure for comparative modelling of protein tertiary structure

1 July 1993

journal article
research article
Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection

Vol. 6 (5) , 501-512
https://doi.org/10.1093/protein/6.5.501

Abstract

A 3-D model of a protein can be constructed from its amino acid sequence and the 3-D structures of one or more homologues by annealing three sets of fragments: the structurally conserved regions, structurally variable regions and the side chains. The method encoded in the computer program COMPOSER was assessed by generating 3-D models of eight proteins whose crystal structures are already known and for which 3-D structures of homologues are available. In the structurally conserved regions, differences between modelled and X-ray structures are smaller than the differences between the X-ray structures of the modelled protein and the homologues used to build the model. When several homologues are used, the contributions of the known structures are weighted, preferably by the square of sequence similarity; this is especially important when the similarities of the homologues to the modelled structure differ greatly. The ‘collar’ extension approach, in which a similar region of different length in a homologue is used to extend the framework, can result in a more accurate model. If known homologues comprise more than one related group of proteins and they are both distantly related to the unknown, then alignment of the sequence to be modelled with each group of homologues facilitates identification of structurally conserved regions of the unknown and leads to an improved model. Models have root mean square differences (r.m.s.d.s) with the structures defined by X-ray analysis of between 0.73 and 1.56 Å for all Cα atoms, for seven of the eight models. For the model of mucor pepsin, where the closest homologue has 33% sequence identity and 20% of the residues are in structurally variable regions, the r.m.s.d. for the framework region is 1.71 Å and the r.m.s.d. for all Cα atoms is 3.47 Â.

Keywords

This publication has 8 references indexed in Scilit:

Tertiary structural constraints on protein evolutionary diversity: templates, key residues and structure prediction
Proceedings Of The Royal Society B-Biological Sciences, 1990
From comparisons of protein sequences and structures to protein modelling and design
Trends in Biochemical Sciences, 1990
Definition of general topological equivalence in protein structures
Journal of Molecular Biology, 1990
Crystal structure of plastocyanin from a green alga, Enteromorpha prolifera
Journal of Molecular Biology, 1990
Refined structure of baboon α-lactalbumin at 1.7 Å resolution
Journal of Molecular Biology, 1989
Knowledge‐based protein modelling and design
European Journal of Biochemistry, 1988
Comparison of solvent-inaccessible cores of homologous proteins: definitions useful for protein modelling
Protein Engineering, Design and Selection, 1987
Knowledge based modelling of homologous proteins, part I: three-dimensional frameworks derived from the simultaneous superposition of multiple structures
Protein Engineering, Design and Selection, 1987