Mutual Information in Protein Multiple Sequence Alignments Reveals Two Classes of Coevolving Positions
- 22 April 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Biochemistry
- Vol. 44 (19) , 7156-7165
- https://doi.org/10.1021/bi050293e
Abstract
Information theory was used to identify nonconserved coevolving positions in multiple sequence alignments from a variety of protein families. Coevolving positions in these alignments fall into two general categories. One set is composed of positions that coevolve with only one or two other positions. These positions often display direct amino acid side-chain interactions with their coevolving partner. The other set comprises positions that coevolve with many others and are frequently located in regions critical for protein function, such as active sites and surfaces involved in intermolecular interactions and recognition. We find that coevolving positions are more likely to change protein function when mutated than are positions showing little coevolution. These results imply that information theory may be applied generally to find coevolving, nonconserved positions that are part of functional sites in uncharacterized protein families. We propose that these coevolving positions compose an important subset of the positions in an alignment, and may be as important to the structure and function of the protein family as are highly conserved positions.Keywords
This publication has 5 references indexed in Scilit:
- The Jalview Java alignment editorBioinformatics, 2004
- The COG database: an updated version includes eukaryotesBMC Bioinformatics, 2003
- Do Proteins Learn to Evolve? The Hopfield Network as a Basis for the Understanding of Protein EvolutionJournal of Theoretical Biology, 2000
- Alanine-scanning Mutagenesis of the ∊ Subunit of the F1-F0 ATP Synthase from Escherichia coli Reveals Two Classes of MutantsPublished by Elsevier ,1995
- Crystal Structure of Myxococcus xanthus Nucleoside Diphosphate Kinase and its Interaction with a Nucleotide Substrate at 2·0 Å ResolutionJournal of Molecular Biology, 1993