CpG Mutation Rates in the Human Genome Are Highly Dependent on Local GC Content
Open Access
- 10 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 22 (3) , 650-658
- https://doi.org/10.1093/molbev/msi043
Abstract
CpG dinucleotides mutate at a high rate because cytosine is vulnerable to deamination, cytosines in CpG dinucleotides are often methylated, and deamination of 5-methylcytosine (5mC) produces thymidine. Previous experiments have shown that DNA melting is the rate-limiting step in cytosine deamination. Here we show, through the analysis of human single-nucleotide polymorphisms (SNPs), that the mutation rate produced by 5mC deamination is highly dependent on local GC content. In fact, linear regression analysis showed that the log10 of the 5mC mutation rates (inferred from SNP frequencies) had slopes of −3 when graphed with respect to the GC content of neighboring sequences. This is the ideal slope that would be expected if the correlation between CpG underrepresentation and GC content had been solely caused by DNA melting. Moreover, this same result was obtained regardless of the SNP locations (all SNPs versus only SNPs in noncoding intergenic regions, excluding CpG islands) and regardless of the lengths over which GC content was calculated (SNP sequences with a modal length of 564 bp versus genomic contigs with a modal length of 163 kb). Several alternative interpretations are discussed.Keywords
This publication has 59 references indexed in Scilit:
- Distinct Changes of Genomic Biases in Nucleotide Substitution at the Time of Mammalian RadiationMolecular Biology and Evolution, 2003
- DNA Sequence Variation of Homo sapiensCold Spring Harbor Symposia on Quantitative Biology, 2003
- GenBankNucleic Acids Research, 2002
- Genetic variation of recent Alu insertions in human populationsJournal of Molecular Evolution, 1996
- THE HUMAN GENOME: Organization and Evolutionary HistoryAnnual Review of Genetics, 1995
- CpG islands, genes and isochores in the genomes of vertebratesGene, 1991
- Genetic exchange between endogenous and exogenous LINE-1 repetitive elements in mouse cellsNucleic Acids Research, 1990
- In Physiological Salt Conditions the Core Proteins of the Nucleosomes in Large Chromatin Fragments Denature at 73 °C and the DNA Unstacks at 85 °CJournal of Biological Chemistry, 1989
- The Mosaic Genome of Warm-Blooded VertebratesScience, 1985
- Increased G+C content of DNA stabilises methyl CpG dinucieotidesNucleic Acids Research, 1984