Networks of Coevolving Sites in Structural and Functional Domains of Serpin Proteins
Open Access
- 27 April 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 22 (7) , 1627-1634
- https://doi.org/10.1093/molbev/msi157
Abstract
Amino acids do not occur randomly in proteins; rather, their occurrence at any given site is strongly influenced by the amino acid composition at other sites, the structural and functional aspects of the region of the protein in which they occur, and the evolutionary history of the protein. The goal of our research study is to identify networks of coevolving sites within the serpin proteins (serine protease inhibitors) and classify them as being caused by structural-functional constraints or by evolutionary history. To address this, a matrix of pairwise normalized mutual information (NMI) values was computed among amino acid sites for the serpin proteins. The NMI matrix was partitioned into orthogonal patterns of amino acid variability by factor analysis. Each common factor pattern was interpreted as having phylogenetic and/or structural-functional explanations. In addition, we used a bootstrap factor analysis technique to limit the effects of phylogenetic history on our factor patterns. Our results show an extensive network of correlations among amino acid sites in key functional regions (reactive center loop, shutter, and breach). Additionally, we have discovered long-range coevolution for packed amino acids within the serpin protein core. Lastly, we have discovered a group of serpin sites which coevolve in the hydrophobic core region (s5B and s4B) and appear to represent sites important for formation of the “native” instead of the “latent” serpin structure. This research provides a better understanding on how protein structure evolves; in particular, it elucidates the selective forces creating coevolution among protein sites.Keywords
This publication has 41 references indexed in Scilit:
- Covarion Structure in Plastid Genome Evolution: A New Statistical TestMolecular Biology and Evolution, 2005
- Mutation of the Highly Conserved Tryptophan in the Serpin Breach Region Alters the Inhibitory Mechanism of Plasminogen Activator Inhibitor-1Biochemistry, 2003
- Detection of conserved physico-chemical characteristics of proteins by analyzing clusters of positions with co-ordinated substitutionsBioinformatics, 2001
- Phylogenetic Analyses of Amino Acid Variation in the Serpin ProteinsMolecular Biology and Evolution, 2001
- The Protein Data Bank and the challenge of structural genomics.Nature Structural & Molecular Biology, 2000
- Correlations Among Amino Acid Sites in bHLH Protein Domains: An Information Theoretic AnalysisMolecular Biology and Evolution, 2000
- Familial dementia caused by polymerization of mutant neuroserpinNature, 1999
- Positional Dependence, Cliques, and Predictive Motifs in the bHLH Protein DomainJournal of Molecular Evolution, 1999
- An analysis of simultaneous variation in protein structuresProtein Engineering, Design and Selection, 1997
- Covariation of residues in the homeodomain sequence familyProtein Science, 1995