Quality assessment of maize assembled genomic islands (MAGIs) and large-scale experimental verification of predicted genes
- 15 August 2005
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 102 (34) , 12282-12287
- https://doi.org/10.1073/pnas.0503394102
Abstract
Recent sequencing efforts have targeted the gene-rich regions of the maize ( Zea mays L.) genome. We report the release of an improved assembly of maize assembled genomic islands (MAGIs). The 114,173 resulting contigs have been subjected to computational and physical quality assessments. Comparisons to the sequences of maize bacterial artificial chromosomes suggest that at least 97% (160 of 165) of MAGIs are correctly assembled. Because the rates at which junction-testing PCR primers for genomic survey sequences (90-92%) amplify genomic DNA are not significantly different from those of control primers (≈91%), we conclude that a very high percentage of genic MAGIs accurately reflect the structure of the maize genome. EST alignments, ab initio gene prediction, and sequence similarity searches of the MAGIs are available at the Iowa State University MAGI web site. This assembly contains 46,688 ab initio predicted genes. The expression of almost half (628 of 1,369) of a sample of the predicted genes that lack expression evidence was validated by RT-PCR. Our analyses suggest that the maize genome contains between ≈33,000 and ≈54,000 expressed genes. Approximately 5% (32 of 628) of the maize transcripts discovered do not have detectable paralogs among maize ESTs or detectable homologs from other species in the GenBank NR nucleotide/protein database. Analyses therefore suggest that this assembly of the maize genome contains approximately 350 previously uncharacterized expressed genes. We hypothesize that these “orphans” evolved quickly during maize evolution and/or domestication.Keywords
This publication has 23 references indexed in Scilit:
- Sorghum Genome Sequencing by Methylation FiltrationPLoS Biology, 2005
- Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene SpacePlant Physiology, 2004
- Sequence composition, organization, and evolution of the core Triticeae genomeThe Plant Journal, 2004
- Types and Frequencies of Sequencing Errors in Methyl-Filtered and High C0t Maize Genome Survey SequencesPlant Physiology, 2004
- Widespread Paleopolyploidy in Model Plant Species Inferred from Age Distributions of Duplicate Genes[W]Plant Cell, 2004
- Sequencing the maize genomeCurrent Opinion in Plant Biology, 2004
- O-Glucosylation of cis-Zeatin in Maize. Characterization of Genes, Enzymes, and Endogenous CytokininsPlant Physiology, 2003
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. indica )Science, 2002
- National Science Foundation-Sponsored Workshop Report. Maize Genome Sequencing ProjectPlant Physiology, 2001
- DNA sequence quality trimming and vector removalBioinformatics, 2001