Whole Genome Sequence Comparisons and “Full-Length” cDNA Sequences: A Combined Approach to Evaluate and Improve Arabidopsis Genome Annotation
Open Access
- 1 March 2004
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (3) , 406-413
- https://doi.org/10.1101/gr.1515604
Abstract
To evaluate the existing annotation of the Arabidopsis genome further, we generated a collection of evolutionary conserved regions (ecores) between Arabidopsis and rice. The ecore analysis provides evidence that the gene catalog of Arabidopsis is not yet complete, and that a number of these annotations require re-examination. To improve the Arabidopsis genome annotation further, we used a novel “full-length” enriched cDNA collection prepared from several tissues. An additional 1931 genes were covered by new “full-length” cDNA sequences, raising the number of annotated genes with a corresponding “full-length” cDNA sequence to about 14,000. Detailed comparisons between these “full-length” cDNA sequences and annotated genes show that this resource is very helpful in determining the correct structure of genes, in particular, those not yet supported by “full-length” cDNAs. In addition, a total of 326 genomic regions not included previously in the Arabidopsis genome annotation were detected by this cDNA resource, providing clues for new gene discovery. Because, as expected, the two data sets only partially overlap, their combination produces very useful information for improving the Arabidopsis genome annotation.Keywords
This publication has 30 references indexed in Scilit:
- Empirical Analysis of Transcriptional Activity in the Arabidopsis GenomeScience, 2003
- Improving the Arabidopsis genome annotation using maximal transcript alignment assembliesNucleic Acids Research, 2003
- Alternative splicing and proteome diversity in plants: the tip of the iceberg has just emergedTrends in Plant Science, 2003
- Assessing the Drosophila melanogaster and Anopheles
gambiae Genome Annotations Using Genome-Wide Sequence ComparisonsGenome Research, 2003
- Systematic Discovery of New Genes in the Saccharomyces cerevisiae GenomeGenome Research, 2003
- RIKEN Arabidopsis full-length cDNA databaseTrends in Plant Science, 2002
- Functional Annotation of a Full-Length Arabidopsis cDNA CollectionScience, 2002
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. japonica )Science, 2002
- In Search of the First Flower: A Jurassic Angiosperm, Archaefructus , from Northeast ChinaScience, 1998
- Normalization and subtraction: two approaches to facilitate gene discovery.Genome Research, 1996