trEST, trGEN and Hits: access to databases of predicted protein sequences
- 1 January 2001
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 29 (1) , 148-151
- https://doi.org/10.1093/nar/29.1.148
Abstract
High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent efficient and effective searches of HTG and EST data for protein sequence homologies by standard search methods. Here, we briefly describe three newly developed resources that should make discovery of interesting genes in these sequence classes easier in the future, especially to biologists not having access to a powerful local bioinformatics environment. trEST and trGEN are regularly regenerated databases of hypothetical protein sequences predicted from EST and HTG sequences, respectively. Hits is a web-based data retrieval and analysis system providing access to precomputed matches between protein sequences (including sequences from trEST and trGEN) and patterns and profiles from Prosite and Pfam. The three resources can be accessed via the Hits home page (http://hits.isb-sib.ch).Keywords
This publication has 14 references indexed in Scilit:
- The InterPro database, an integrated documentation resource for protein families, domains and functional sitesNucleic Acids Research, 2001
- Dotlet: diagonal plots in a Web browserBioinformatics, 2000
- Searching the expressed sequence tag (EST) databases: panning for genes.Briefings in Bioinformatics, 2000
- The Pfam Protein Families DatabaseNucleic Acids Research, 2000
- SMART: a web-based tool for the study of genetically mobile domainsNucleic Acids Research, 2000
- CAP3: A DNA Sequence Assembly ProgramGenome Research, 1999
- ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences.1999
- Pieces of the puzzle: expressed sequence tags and the catalog of human genesJournal of Molecular Medicine, 1997
- A flexible motif search technique based on generalized profilesComputers & Chemistry, 1996
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994