TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)
Open Access
- 1 January 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (Database) , D445-D451
- https://doi.org/10.1093/nar/gkl770
Abstract
The TBestDB database contains ∼370 000 clustered expressed sequence tag (EST) sequences from 49 organisms, covering a taxonomically broad range of poorly studied, mainly unicellular eukaryotes, and includes experimental information, consensus sequences, gene annotations and metabolic pathway predictions. Most of these ESTs have been generated by the Protist EST Program, a collaboration among six Canadian research groups. EST sequences are read from trace files up to a minimum quality cut-off, vector and linker sequence is masked, and the ESTs are clustered using phrap. The resulting consensus sequences are automatically annotated by using the AutoFACT program. The datasets are automatically checked for clustering errors due to chimerism and potential cross-contamination between organisms, and suspect data are flagged in or removed from the database. Access to data deposited in TBestDB by individual users can be restricted to those users for a limited period. With this first report on TBestDB, we open the database to the research community for free processing, annotation, interspecies comparisons and GenBank submission of EST data generated in individual laboratories. For instructions on submission to TBestDB, contact tbestdb@bch.umontreal.ca. The database can be queried at .Keywords
This publication has 35 references indexed in Scilit:
- dictyBase, the model organism database for Dictyostelium discoideumNucleic Acids Research, 2006
- CryptoDB: a Cryptosporidium bioinformatics resource updateNucleic Acids Research, 2006
- TcruziDB: an integrated, post-genomics community resource for Trypanosoma cruziNucleic Acids Research, 2006
- Multiple Metabolic Roles for the Nonphotosynthetic Plastid of the Green Alga Prototheca wickerhamiiEukaryotic Cell, 2005
- The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene OntologyNucleic Acids Research, 2004
- The COG database: an updated version includes eukaryotesBMC Bioinformatics, 2003
- The Closest Unicellular Relatives of AnimalsCurrent Biology, 2002
- KEGG: Kyoto Encyclopedia of Genes and GenomesNucleic Acids Research, 2000
- Mitochondrial EvolutionScience, 1999
- Base-Calling of Automated Sequencer Traces Using Phred. II. Error ProbabilitiesGenome Research, 1998