Chaos game representation of coding regions of human globin genes and alcohol dehydrogenase genes of phylogenetically divergent species
- 1 September 1992
- journal article
- research article
- Published by Springer Nature in Journal of Molecular Evolution
- Vol. 35 (3) , 261-269
- https://doi.org/10.1007/bf00178602
Abstract
Chaos game representation (CGR) is a novel holistic approach that provides a visual image of a DNA sequence quite different from the traditional linear arrangement of nucleotides. Although it is known that CGR patterns depict base composition and sequentiality, the biological significance of the specific features of each pattern is not understood. To systematically examine these features, we have examined the coding sequences of 7 human globin genes and 29 relatively conserved alcohol dehydrogenase (Adh) genes from phylogenetically divergent species. The CGRs of human globin cDNAs were similar to one another and to the entire human globin gene complex. Interestingly, human globin CGRs were also strikingly similar to human Adh CGRs. Adh CGRs were similar for genes of the same or closely related species but were different for relatively conserved Adh genes from distantly related species. Dinucleotide frequencies may account for the self-similar pattern that is characteristic of vertebrate CGRs and the genome-specific features of CGR patterns. Mutational frequencies of dinucleotides may vary among genome types. The special features of CG dinucleotides of vertebrates represent such an example. The CGR patterns examined thus far suggest that the evolution of a gene and its coding sequence should not be examined in isolation. Consideration should be given to genome-specific differential mutation rates for different dinucleotides or specific oligonucleotides.Keywords
This publication has 11 references indexed in Scilit:
- Molecular evolution of the zinc-containing long-chain alcohol dehydrogenase genes.Molecular Biology and Evolution, 1990
- Chaos game representation of gene structureNucleic Acids Research, 1990
- The GenBank®genetic sequence data bankNucleic Acids Research, 1988
- A comprehensive set of sequence analysis programs for the VAXNucleic Acids Research, 1984
- 5-Methylcytosine in Eukaryotic DNAScience, 1981
- DNA methylation and the frequency of CpG in animal DNANucleic Acids Research, 1980
- Molecular basis of base substitution hotspots in Escherichia coliNature, 1978
- Doublet frequency analysis of fractionated vertebrate nuclear DNAJournal of Molecular Biology, 1976
- Simple mathematical models with very complicated dynamicsNature, 1976
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970