Abundance, Distribution, and Transcriptional Activity of Repetitive Elements in the Maize Genome
Open Access
- 20 September 2001
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 11 (10) , 1660-1676
- https://doi.org/10.1101/gr.188201
Abstract
Long terminal repeat (LTR) retrotransposons have been shown to make up much of the maize genome. Although these elements are known to be prevalent in plant genomes of a middle-to-large size, little information is available on the relative proportions composed by specific families of elements in a single genome. We sequenced a library of randomly sheared genomic DNA from maize to characterize this genome. BLAST analysis of these sequences demonstrated that the maize genome is composed of diverse sequences that represent numerous families of retrotransposons. The largest families contain the previously described elements Huck, Ji, and Opie. Approximately 5% of the sequences are predicted to encode proteins. The genomic abundance of 16 families of elements was estimated by hybridization to an array of 10,752 maize bacterial artificial chromosome (BAC) clones. Comparisons of the number of elements present on individual BACs indicated that retrotransposons are in general randomly distributed across the maize genome. A second library was constructed that was selected to contain sequences hypomethylated in the maize genome. Sequence analysis of this library indicated that retroelements abundant in the genome are poorly represented in hypomethylated regions. Fifty-six retroelement sequences corresponding to the integrase and reverse transcriptase domains were isolated from ∼407,000 maize expressed sequence tags (ESTs). Phylogenetic analysis of these and the genomic retroelement sequences indicated that elements most abundant in the genome are less abundant at the transcript level than are more rare retrotransposons. Additional phylogenies also demonstrated that rice and maize retrotransposon families are frequently more closely related to each other than to families within the same species. An analysis of the GC content of the maize genomic library and that of maize ESTs did not support recently published data that the gene space in maize is found within a narrow GC range, but does indicate that genic sequences have a higher GC content than intergenic sequences (52% vs. 47% GC).Keywords
This publication has 68 references indexed in Scilit:
- Initial sequencing and analysis of the human genomeNature, 2001
- Generation and Analysis of 25 Mb of Genomic DNA from the Pufferfish Fugu rubripes by Sequence ScanningGenome Research, 1999
- Gypsy‐like retrotransposons are widespread in the plant kingdomThe Plant Journal, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Do Plants Have a One-Way Ticket to Genomic Obesity?Plant Cell, 1997
- Characterization of four dispersed repetitive DNA sequences fromZea maysand their use in constructing contiguous DNA fragments using YAC clonesGenome, 1996
- Complete Sequence of the Maize Chloroplast Genome: Gene Content, Hotspots of Divergence and Fine Tuning of Genetic Information by Transcript EditingJournal of Molecular Biology, 1995
- Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genomeNature, 1993
- Nuclear DNA content of some important plant speciesPlant Molecular Biology Reporter, 1991
- THE ISOCHORE ORGANIZATION OF THE HUMAN GENOMEAnnual Review of Genetics, 1989