Algorithmic complexity and thermodynamics of sequence-structure relationships in proteins
- 1 October 1997
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review E
- Vol. 56 (4) , 4545-4552
- https://doi.org/10.1103/physreve.56.4545
Abstract
The information contained in a protein’s amino acid sequence dictates its three-dimensional structure. In this situation a frozen or embedded structure, the sequence, contains information that ultimately influences a thermodynamic entity, the protein structure. The interplay between information and thermodynamics is explored by considering the algorithmic complexity and Kolmogorov’s universal probability of the sequence and of the structure. It is shown that the algorithmic complexity of a microstate of a polymer is given by its configurational entropy. Using this result and a lattice protein model, a quantitative estimate of the information contained in a protein’s structure is made. This is compared to the information content of the sequence. The information content of the sequence is approximately 2.5 bits per amino acid, while the content in the structure is approximately 0.5 bits per amino acid. It is estimated that virtually all the information contained in the protein structure is shared with the sequence. A deeper connection can be made between the shared information content and the thermodynamic entropy governing the system. Using Kolmogorov’s universal probability, it is possible to establish statistical-mechanical relationships for objects without resorting to a probabilistic ensemble formalism. This allows the thermodynamics of microstates of objects of known configurations to be determined. Using this formalism, the connection between sequence information and the structural thermodynamics of a protein can be made. This connection has strong implications for how protein sequences evolve over evolutionary time and demonstrates that this evolution is constrained by the thermodynamic evolution of the protein structure.Keywords
This publication has 13 references indexed in Scilit:
- Elements of Information TheoryPublished by Wiley ,2001
- The Shannon information entropy of protein sequencesBiophysical Journal, 1996
- Algorithmic complexity of a proteinPhysical Review E, 1996
- Algorithmic randomness and physical entropyPhysical Review A, 1989
- Thermodynamic cost of computation, algorithmic complexity and the information metricNature, 1989
- The Neutral Theory of Molecular EvolutionPublished by Cambridge University Press (CUP) ,1983
- Treatment of disordered and ordered systems of polymer chains by lattice methodsProceedings of the National Academy of Sciences, 1982
- Principles that Govern the Folding of Protein ChainsScience, 1973
- Information Theory and Statistical Mechanics. IIPhysical Review B, 1957
- Information Theory and Statistical MechanicsPhysical Review B, 1957