Many species in one: DNA barcoding overestimates the number of species when nuclear mitochondrial pseudogenes are coamplified
Top Cited Papers
- 9 September 2008
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 105 (36) , 13486-13491
- https://doi.org/10.1073/pnas.0803076105
Abstract
Nuclear mitochondrial pseudogenes (numts) are nonfunctional copies of mtDNA in the nucleus that have been found in major clades of eukaryotic organisms. They can be easily coamplified with orthologous mtDNA by using conserved universal primers; however, this is especially problematic for DNA barcoding, which attempts to characterize all living organisms by using a short fragment of the mitochondrial cytochromecoxidase I (COI) gene. Here, we study the effect of numts on DNA barcoding based on phylogenetic and barcoding analyses of numt and mtDNA sequences in two divergent lineages of arthropods: grasshoppers and crayfish. Single individuals from both organisms have numts of the COI gene, many of which are highly divergent from orthologous mtDNA sequences, and DNA barcoding analysis incorrectly overestimates the number of unique species based on the standard metric of 3% sequence divergence. Removal of numts based on a careful examination of sequence characteristics, including indels, in-frame stop codons, and nucleotide composition, drastically reduces the incorrect inferences of the number of unique species, but even such rigorous quality control measures fail to identify certain numts. We also show that the distribution of numts is lineage-specific and the presence of numts cannot be knowna priori. Whereas DNA barcoding strives for rapid and inexpensive generation of molecular species tags, we demonstrate that the presence of COI numts makes this goal difficult to achieve when numts are prevalent and can introduce serious ambiguity into DNA barcoding.Keywords
This publication has 33 references indexed in Scilit:
- DNA barcoding cannot reliably identify species of the blowfly genusProtocalliphora(Diptera: Calliphoridae)Proceedings Of The Royal Society B-Biological Sciences, 2007
- Problems with DNA barcodes for species delimitation: ‘Ten species’ ofAstraptes fulgeratorreassessed (Lepidoptera: Hesperiidae)Systematics and Biodiversity, 2006
- Identification of Birds through DNA BarcodesPLoS Biology, 2004
- Origin of intra-individual variation in PCR-amplified mitochondrial cytochrome oxidase I of Thrips tabaci (Thysanoptera: Thripidae): mitochondrial heteroplasmy or nuclear integration?Hereditas, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- NUMTs in Sequenced Eukaryotic GenomesMolecular Biology and Evolution, 2004
- Biological identifications through DNA barcodesProceedings Of The Royal Society B-Biological Sciences, 2003
- Mitochondrial pseudogenes: evolution's misplaced witnessesPublished by Elsevier ,2001
- Recent stable insertion of mitochondrial DNA into an Arabidopsis polyubiquitin gene by nonhomologous recombination.Plant Cell, 1993
- Mitochondrial DNA sequences in the nuclear genome of a locustNature, 1983