Over-representation of the disease associated (CAG) and (CGG) repeats in the human genome
Open Access
- 1 January 1994
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 22 (9) , 1735-1740
- https://doi.org/10.1093/nar/22.9.1735
Abstract
Expansion of trimer repeats has recently been described as a new type of human mutation. Of the 64 possible trimer compositions, only the CGG and CAG repeats have been implicated in genetic diseases. This study intends to address two questions: (1)What makes the CGG and CAG repeats unique? (2) Could other trimer repeats be involved in this type of mutation? By computer analysis of trimer and hexamer frequency distributions in approximately 10 Mb of human DNA, twenty trimer motifs (ten complementary pairs) have been identified that are the most likely to be expanded. The frequency distribution study also indicated that the expanded trimer motif in Fragile-X syndrome is GGC instead of CGG. DNA linguistics studies revealed that the GGC/GCC and CAG/CTG repeats were over-represented in the human genome. Further analysis of base composition suggested that the CCA/TGG repeats may be involved in the trimer expansion mutation since they possessed many similar characteristics to GGC/GCC and CAG/CTG. The computer aided sequence analysis studies reported here may help to understand the molecular mechanisms of trimer repeat expansion.Keywords
This publication has 18 references indexed in Scilit:
- Origin of the expansion mutation in myotonic dystrophyNature Genetics, 1993
- Cloning of the essential myotonic dystrophy region and mapping of the putative defectNature, 1992
- Detection of an unstable fragment of DNA specific to individuals with myotonic dystrophyNature, 1992
- Expansion of an unstable DNA region and phenotypic variation in myotonic dystrophyNature, 1992
- Hereditary unstable DNA: a new explanation for some old genetic questions?Published by Elsevier ,1991
- Androgen receptor gene mutations in X-linked spinal and bulbar muscular atrophyNature, 1991
- Mapping of DNA Instability at the Fragile X to a Trinucleotide Repeat Sequence P(CCG) nScience, 1991
- The effect of codon usage on the oligonucleotide composition of the E.coli genome and identification of over-and underepresented sequences by Markow chain analysisNucleic Acids Research, 1987
- Molecular evolution of bacteriophages: evidence of selection against the recognition sites of host restriction enzymes.Molecular Biology and Evolution, 1986
- Markov chain analysis finds a significant influence of neighboring bases on the occurrence of a base in eucaryotic nuclear DNA sequences both protein-coding and noncodingJournal of Molecular Evolution, 1985