RNA folding and combinatory landscapes
- 1 March 1993
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review E
- Vol. 47 (3) , 2083-2099
- https://doi.org/10.1103/physreve.47.2083
Abstract
In this paper we view the folding of polynucleotide (RNA) sequences as a map that assigns to each sequence a minimum-free-energy pattern of base pairings, known as secondary structure. Considering only the free energy leads to an energy landscape over the sequence space. Taking into account structure generates a less visualizable nonscalar ‘‘landscape,’’ where a sequence space is mapped into a space of discrete ‘‘shapes.’’ We investigate the statistical features of both types of landscapes by computing autocorrelation functions, as well as distributions of energy and structure distances, as a function of distance in sequence space. RNA folding is characterized by very short structure correlation lengths compared to the diameter of the sequence space. The correlation lengths depend strongly on the size and the pairing rules of the underlying nucleotide alphabet. Our data suggest that almost every minimum-free-energy structure is found within a small neighborhood of any random sequence. The interest in such landscapes results from the fact that they govern natural and artificial processes of optimization by mutation and selection. Simple statistical model landscapes, like Kauffman and Levin’s n-k model [J. Theor. Biol. 128, 11 (1987)], are often used as a proxy for understanding realistic landscapes, like those induced by RNA folding. We make a detailed comparison between the energy landscapes derived from RNA folding and those obtained from the n-k model. We derive autocorrelation functions for several variants of the n-k model, and briefly summarize work on its fine structure. The comparison leads to an estimate for k=7–8, independent of n, where n is the chain length. While the scaling behaviors agree, the fine structure is considerably different in the two cases. The reason is seen to be the extremely high frequency of neutral neighbors, that is, neighbors with identical energy (or structure), in the RNA case.Keywords
This publication has 25 references indexed in Scilit:
- Towards a general theory of adaptive walks on rugged landscapesPublished by Elsevier ,2006
- Correlation structure of the landscape of the graph-bipartitioning problemJournal of Physics A: General Physics, 1992
- The landscape of the traveling salesman problemPhysics Letters A, 1992
- Correlated and uncorrelated fitness landscapes and how to tell the differenceBiological Cybernetics, 1990
- Population dynamics in a spin-glass model of chemical evolutionJournal of Molecular Evolution, 1989
- Low autocorrelation binary sequences : statistical mechanics and configuration space analysisJournal de Physique, 1987
- Mean-Field Equations for the Matching and the Travelling Salesman ProblemsEurophysics Letters, 1986
- Application of statistical mechanics to NP-complete problems in combinatorial optimisationJournal of Physics A: General Physics, 1986
- Random-energy model: An exactly solvable model of disordered systemsPhysical Review B, 1981
- Solvable Model of a Spin-GlassPhysical Review Letters, 1975