Comparative analyses of six solanaceous transcriptomes reveal a high degree of sequence conservation and species-specific transcripts
Open Access
- 14 September 2005
- journal article
- research article
- Published by Springer Nature in BMC Genomics
- Vol. 6 (1) , 124
- https://doi.org/10.1186/1471-2164-6-124
Abstract
Background: The Solanaceae is a family of closely related species with diverse phenotypes that have been exploited for agronomic purposes. Previous studies involving a small number of genes suggested sequence conservation across the Solanaceae. The availability of large collections of Expressed Sequence Tags (ESTs) for the Solanaceae now provides the opportunity to assess sequence conservation and divergence on a genomic scale. Results: All available ESTs and Expressed Transcripts (ETs), 449,224 sequences for six Solanaceae species (potato, tomato, pepper, petunia, tobacco and Nicotiana benthamiana), were clustered and assembled into gene indices. Examination of gene ontologies revealed that the transcripts within the gene indices encode a similar suite of biological processes. Although the ESTs and ETs were derived from a variety of tissues, 55–81% of the sequences had significant similarity at the nucleotide level with sequences among the six species. Putative orthologs could be identified for 28–58% of the sequences. This high degree of sequence conservation was supported by expression profiling using heterologous hybridizations to potato cDNA arrays that showed similar expression patterns in mature leaves for all six solanaceous species. 16–19% of the transcripts within the six Solanaceae gene indices did not have matches among Solanaceae, Arabidopsis, rice or 21 other plant gene indices. Conclusion: Results from this genome scale analysis confirmed a high level of sequence conservation at the nucleotide level of the coding sequence among Solanaceae. Additionally, the results indicated that part of the Solanaceae transcriptome is likely to be unique for each species.Keywords
This publication has 41 references indexed in Scilit:
- The Institute for Genomic Research Osa1 Rice Genome Annotation DatabasePlant Physiology, 2005
- Gene expression profiling of potato responses to cold, heat, and salt stressFunctional & Integrative Genomics, 2005
- The Genomes of Oryza sativa: A History of DuplicationsPLoS Biology, 2005
- Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencingNature Biotechnology, 2004
- Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray ExperimentsStatistical Applications in Genetics and Molecular Biology, 2004
- Comparative Analyses of Potato Expressed Sequence Tag LibrariesPlant Physiology, 2003
- Cross-Referencing Eukaryotic Genomes: TIGR Orthologous Gene Alignments (TOGA)Genome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Nuclear DNA content of some important plant speciesPlant Molecular Biology Reporter, 1991