De Novo Assembly of Chickpea Transcriptome Using Short Reads for Gene Discovery and Marker Identification
Top Cited Papers
Open Access
- 7 January 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in DNA Research
- Vol. 18 (1) , 53-63
- https://doi.org/10.1093/dnares/dsq028
Abstract
Chickpea ranks third among the food legume crops production in the world. However, the genomic resources available for chickpea are still very limited. In the present study, the transcriptome of chickpea was sequenced with short reads on Illumina Genome Analyzer platform. We have assessed the effect of sequence quality, various assembly parameters and assembly programs on the final assembly output. We assembled ∼107million high-quality trimmed reads using Velvet followed by Oases with optimal parameters into a non-redundant set of 53 409 transcripts (≥100 bp), representing about 28 Mb of unique transcriptome sequence. The average length of transcripts was 523 bp and N50 length of 900 bp with coverage of 25.7 rpkm (reads per kilobase per million). At the protein level, a total of 45 636 (85.5%) chickpea transcripts showed significant similarity with unigenes/predicted proteins from other legumes or sequenced plant genomes. Functional categorization revealed the conservation of genes involved in various biological processes in chickpea. In addition, we identified simple sequence repeat motifs in transcripts. The chickpea transcripts set generated here provides a resource for gene discovery and development of functional molecular markers. In addition, the strategy for de novo assembly of transcriptome data presented here will be helpful in other similar transcriptome studies.Keywords
This publication has 42 references indexed in Scilit:
- Three Sequenced Legume Genomes and Many Crop Species: Rich Opportunities for Translational GenomicsPlant Physiology, 2009
- Legume Transcription Factor Genes: What Makes Legumes So Special?Plant Physiology, 2009
- ABySS: A parallel assembler for short read sequence dataGenome Research, 2009
- RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 2009
- Genome Structure of the Legume, Lotus japonicusDNA Research, 2008
- Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencingMolecular Ecology, 2008
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- Identification and Characterization of Lineage-Specific Genes within the PoaceaePlant Physiology, 2007
- Expression of CAP2, an APETALA2-Family Transcription Factor from Chickpea, Enhances Growth and Tolerance to Dehydration and Salt Stress in Transgenic TobaccoPlant Physiology, 2006
- Genic microsatellite markers in plants: features and applicationsTrends in Biotechnology, 2005