The role of insertions/deletions in the evolution of the intergenic region betweenpsbA andtrnH in the chloroplast genome

Abstract
TrnH and the intergenic region betweentrnH andpsbA of the chloroplast genomes of alfalfa (Medicago sativa), Fabaceae, and petunia (Petunia hybrida), Solanaceae, were sequenced and compared to published sequences of that region from other members of those families. A striking feature of these comparisons is the occurrence of insertions/deletions between short, nearly perfect AT-rich direct repeats. The directionality of these mutations in the petunia, tobacco andNicotiana debneyi lineages within the Solanaceae cannot be discerned. However, we present several alternative hypotheses that are consistent with Goodspeed's 1954 evolutionary treatment of the genusNicotiana and family Solanaceae. Within the Fabaceae, the major size differences in the intergenic region between alfalfa, pea and soybean are due to insertions/ deletions between direct repeats. The alfalfa intergenic region has an inverted repeat stem-loop structure of 210 bases directly 5′ totrnH. This structure is an insert relative to the liverwort.Marchantia polymorpha. Portions of the insert are found also in pea and soybean as well as in published sequences from other dicots representing diverse orders: petunia, tobacco,N. debneyi (Scrophulariales), spinach (Caryophyllales), and Brassica napus (Capparales). Some of the regions of the insert that are missing in these plants appear to have resulted from deletions of sequences between different imperfect direct repeats within, or 5′ to and within the insert. Other deletions are not flanked by repeated sequences. A shrot insert flanked by imperfect direct repeats inB. napus occurs just witin the longer alfalfa insert suggesting that both alfalfa andB. napus have remnants of an even longer insert relative toM. polymorpha. From these analyses we hypothesize the insertion of a stem-loop structure into anM. polymorpha-like ancestral land plant, followd by deletions of sequences, often between different imperfect direct repeats within and upstream of the insert, leading to thepsbA-trnH intergenic sequences represented by the present-day plants examined.