Sorghum Genome Sequencing by Methylation Filtration
Open Access
- 4 January 2005
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Biology
- Vol. 3 (1) , e13
- https://doi.org/10.1371/journal.pbio.0030013
Abstract
Sorghum bicolor is a close relative of maize and is a staple crop in Africa and much of the developing world because of its superior tolerance of arid growth conditions. We have generated sequence from the hypomethylated portion of the sorghum genome by applying methylation filtration (MF) technology. The evidence suggests that 96% of the genes have been sequence tagged, with an average coverage of 65% across their length. Remarkably, this level of gene discovery was accomplished after generating a raw coverage of less than 300 megabases of the 735-megabase genome. MF preferentially captures exons and introns, promoters, microRNAs, and simple sequence repeats, and minimizes interspersed repeats, thus providing a robust view of the functional parts of the genome. The sorghum MF sequence set is beneficial to research on sorghum and is also a powerful resource for comparative genomics among the grasses and across the entire plant kingdom. Thousands of hypothetical gene predictions in rice and Arabidopsis are supported by the sorghum dataset, and genomic similarities highlight evolutionarily conserved regions that will lead to a better understanding of rice and Arabidopsis.Keywords
This publication has 57 references indexed in Scilit:
- Sequencing the maize genomeCurrent Opinion in Plant Biology, 2004
- MicroRNAs in plantsGenes & Development, 2002
- Microsatellites are preferentially associated with nonrepetitive DNA in plant genomesNature Genetics, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- Variability in CpNpG methylation in higher plant genomesGene, 1997
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- mCCG methylation in angiospermsThe Plant Journal, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Key Features of Cereal Genome Organization as Revealed by the Use of Cytosine Methylation-Sensitive Restriction EndonucleasesGenomics, 1993
- Identification of protein coding regions by database similarity searchNature Genetics, 1993