Majority of divergence between closely related DNA samples is due to indels
Open Access
- 2 April 2003
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 100 (8) , 4661-4665
- https://doi.org/10.1073/pnas.0330964100
Abstract
It was recently shown that indels are responsible for more than twice as many unmatched nucleotides as are base substitutions between samples of chimpanzee and human DNA. A larger sample has now been examined and the result is similar. The number of indels is ≈1/12th of the number of base substitutions and the average length of the indels is 36 nt, including indels up to 10 kb. The ratio (Ru) of unpaired nucleotides attributable to indels to those attributable to substitutions is 3.0 for this 2 million-nt chimp DNA sample compared with human. There is similar evidence of a large value of Ru for sea urchins from the polymorphism of a sample of Strongylocentrotus purpuratus DNA (Ru = 3–4). Other work indicates that similarly, per nucleotide affected, large differences are seen for indels in the DNA polymorphism of the plant Arabidopsis thaliana (Ru = 51). For the insect Drosophila melanogaster a high value of Ru (4.5) has been determined. For the nematode Caenorhabditis elegans the polymorphism data are incomplete but high values of Ru are likely. Comparison of two strains of Escherichia coli O157:H7 shows a preponderance of indels. Because these six examples are from very distant systematic groups the implication is that in general, for alignments of closely related DNA, indels are responsible for many more unmatched nucleotides than are base substitutions. Human genetic evidence suggests that indels are a major source of gene defects, indicating that indels are a significant source of evolutionary change.Keywords
This publication has 25 references indexed in Scilit:
- Human Diallelic Insertion/Deletion PolymorphismsAmerican Journal of Human Genetics, 2002
- Genomewide Comparison of DNA Sequences between Humans and ChimpanzeesAmerican Journal of Human Genetics, 2002
- Arabidopsis Map-Based Cloning in the Post-Genome EraPlant Physiology, 2002
- Strains of Escherichia coli O157:H7 Differ Primarily by Insertions or Deletions, Not Single-Nucleotide PolymorphismsJournal of Bacteriology, 2002
- Diversification of Escherichia coli genomes: are bacteriophages the major contributors?Trends in Microbiology, 2001
- Genome sequence of enterohaemorrhagic Escherichia coli O157:H7Nature, 2001
- Complete Genome Sequence of Enterohemorrhagic Eschelichia coli O157:H7 and Genomic Comparison with a Laboratory Strain K-12DNA Research, 2001
- Molecular Definition of Pericentric Inversion Breakpoints Occurring during the Evolution of Humans and ChimpanzeesGenomics, 1998
- The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignmentJournal of Molecular Evolution, 1995
- DNA Divergence Among HominoidsEvolution, 1989