TomatEST database: in silico exploitation of EST data to explore expression patterns in tomato species
Open Access
- 16 November 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (Database) , D901-D905
- https://doi.org/10.1093/nar/gkl921
Abstract
TomatEST is a secondary database integrating expressed sequence tag (EST)/cDNA sequence information from different libraries of multiple tomato species. Redundant EST collections from each species are organized into clusters (gene indices). A cluster consists of one or multiple contigs. Multiple contigs in a cluster represent alternatively transcribed forms of a gene. The set of stand-alone EST sequences (singletons) and contigs, representing all the computationally defined ‘Transcript Indices’, are annotated according to similarity versus protein and RNA family databases. Sequence function description is integrated with the Gene Ontologies and the Enzyme Commission identifiers for a standard classification of gene products and for the mapping of the expressed sequences onto metabolic pathways. Information on the origin of the ESTs, on their structural features, on clusters and contigs, as well as on functional annotations are accessible via a user-friendly web interface. Specific facilities in the database allow Transcript Indices from a query be automatically classified in Enzyme classes and in metabolic pathways. The ‘on the fly’ mapping onto the metabolic maps is integrated in the analytical tools. The TomatEST database website is freely available at .Keywords
This publication has 18 references indexed in Scilit:
- From genomics to chemical genomics: new developments in KEGGNucleic Acids Research, 2006
- ParPEST: a pipeline for EST data analysis based on parallel computingBMC Bioinformatics, 2005
- Comparative Plant Genomics Resources at PlantGDBPlant Physiology, 2005
- Comparative analyses of six solanaceous transcriptomes reveal a high degree of sequence conservation and species-specific transcriptsBMC Genomics, 2005
- The Universal Protein Resource (UniProt)Nucleic Acids Research, 2004
- The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomesNucleic Acids Research, 2004
- Rfam: annotating non-coding RNAs in complete genomesNucleic Acids Research, 2004
- Comprehensive EST analysis of tomato and comparative genomics of fruit ripeningThe Plant Journal, 2004
- The ENZYME database in 2000Nucleic Acids Research, 2000
- dbEST — database for “expressed sequence tags”Nature Genetics, 1993