The Use of MPSS for Whole-Genome Transcriptional Analysis in Arabidopsis
Open Access
- 2 August 2004
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (8) , 1641-1653
- https://doi.org/10.1101/gr.2275604
Abstract
We have generated 36,991,173 17-base sequence “signatures” representing transcripts from the model plant Arabidopsis. These data were derived by massively parallel signature sequencing (MPSS) from 14 libraries and comprised 268,132 distinct sequences. Comparable data were also obtained with 20-base signatures. We developed a method for handling these data and for comparing these signatures to the annotated Arabidopsis genome. As part of this procedure, 858,019 potential or “genomic” signatures were extracted from the Arabidopsis genome and classified based on the position and orientation of the signatures relative to annotated genes. A comparison of genomic and expressed signatures matched 67,735 signatures predicted to be derived from distinct transcripts and expressed at significant levels. Expressed signatures were derived from the sense strand of at least 19,088 of 29,084 annotated genes. A comparison of the genomic and expression signatures demonstrated that ∼7.7% of genomic signatures were underrepresented in the expression data. These genomic signatures contained one of 20 four-base words that were consistently associated with reduced MPSS abundances. More than 89% of the sum of the expressed signature abundances matched the Arabidopsis genome, and many of the unmatched signatures found in high abundances were predicted to match to previously uncharacterized transcripts.Keywords
This publication has 46 references indexed in Scilit:
- Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencingNature Biotechnology, 2004
- Arabidopsis MPSS. An Online Resource for Quantitative Expression AnalysisPlant Physiology, 2004
- Improving the Arabidopsis genome annotation using maximal transcript alignment assembliesNucleic Acids Research, 2003
- Laser Capture Microdissection of Cells from Plant TissuesPlant Physiology, 2003
- Global analysis of cell type‐specific gene expressionComparative and Functional Genomics, 2003
- The transcriptional activity of human Chromosome 22Genes & Development, 2003
- The genome sequence and structure of rice chromosome 1Nature, 2002
- Construction of a specialized cDNA library from plant cells isolated by laser capture microdissection: toward comprehensive analysis of the genes expressed in the rice phloemThe Plant Journal, 2002
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. indica )Science, 2002
- Direct Comparison of GeneChip and SAGE on the Quantitative Accuracy in Transcript Profiling AnalysisGenomics, 2000