The complete nucleotide sequence of the cassava (Manihot esculenta) chloroplast genome and the evolution of atpF in Malpighiales: RNA editing and multiple losses of a group II intron
- 24 January 2008
- journal article
- research article
- Published by Springer Nature in Theoretical and Applied Genetics
- Vol. 116 (5) , 723-737
- https://doi.org/10.1007/s00122-007-0706-y
Abstract
The complete sequence of the chloroplast genome of cassava (Manihot esculenta, Euphorbiaceae) has been determined. The genome is 161,453 bp in length and includes a pair of inverted repeats (IR) of 26,954 bp. The genome includes 128 genes; 96 are single copy and 16 are duplicated in the IR. There are four rRNA genes and 30 distinct tRNAs, seven of which are duplicated in the IR. The infA gene is absent; expansion of IRb has duplicated 62 amino acids at the 3′ end of rps19 and a number of coding regions have large insertions or deletions, including insertions within the 23S rRNA gene. There are 17 intron-containing genes in cassava, 15 of which have a single intron while two (clpP, ycf3) have two introns. The usually conserved atpF group II intron is absent and this is the first report of its loss from land plant chloroplast genomes. The phylogenetic distribution of the atpF intron loss was determined by a PCR survey of 251 taxa representing 34 families of Malpighiales and 16 taxa from closely related rosids. The atpF intron is not only missing in cassava but also from closely related Euphorbiaceae and other Malpighiales, suggesting that there have been at least seven independent losses. In cassava and all other sequenced Malphigiales, atpF gene sequences showed a strong association between C-to-T substitutions at nucleotide position 92 and the loss of the intron, suggesting that recombination between an edited mRNA and the atpF gene may be a possible mechanism for the intron loss.Keywords
This publication has 97 references indexed in Scilit:
- Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patternsProceedings of the National Academy of Sciences, 2007
- Expression of cholera toxin B–proinsulin fusion protein in lettuce and tobacco chloroplasts – oral administration protects against development of insulitis in non‐obese diabetic micePlant Biotechnology Journal, 2007
- Transgene containment by maternal inheritance: Effective or elusive?Proceedings of the National Academy of Sciences, 2007
- Starch grain evidence for the preceramic dispersals of maize and root crops into tropical dry and humid forests of PanamaProceedings of the National Academy of Sciences, 2007
- The complete nucleotide sequence of the coffee (Coffea arabicaL.) chloroplast genome: organization and implications for biotechnology and phylogenetic relationships amongst angiospermsPlant Biotechnology Journal, 2007
- Comparative chloroplast genomics: analyses including new sequences from the angiosperms Nuphar advena and Ranunculus macranthusBMC Genomics, 2007
- Arabidopsis Orthologs of Maize Chloroplast Splicing Factors Promote Splicing of Orthologous and Species-Specific Group II IntronsPlant Physiology, 2006
- Where Next for Genome Sequencing?Science, 2006
- An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IIBotanical Journal of the Linnean Society, 2003
- Molecular strategies for gene containment in transgenic cropsNature Biotechnology, 2002