ProTISA: a comprehensive resource for translation initiation site annotation in prokaryotic genomes
Open Access
- 16 October 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (Database) , D114-D119
- https://doi.org/10.1093/nar/gkm799
Abstract
Correct annotation of translation initiation site (TIS) is essential for both experiments and bioinformatics studies of prokaryotic translation initiation mechanism as well as understanding of gene regulation and gene structure. Here we describe a comprehensive database ProTISA, which collects TIS confirmed through a variety of available evidences for prokaryotic genomes, including Swiss-Prot experiments record, literature, conserved domain hits and sequence alignment between orthologous genes. Moreover, by combining the predictions from our recently developed TIS post-processor, ProTISA provides a refined annotation for the public database RefSeq. Furthermore, the database annotates the potential regulatory signals associated with translation initiation at the TIS upstream region. As of July 2007, ProTISA includes 440 microbial genomes with more than 390 000 confirmed TISs. The database is available at http://mech.ctb.pku.edu.cn/protisaKeywords
This publication has 25 references indexed in Scilit:
- Large-Scale Identification of N-Terminal Peptides in the Halophilic Archaea Halobacterium salinarum and Natronomonas pharaonisJournal of Proteome Research, 2007
- WebLogo: A Sequence Logo Generator: Figure 1Genome Research, 2004
- GS-Finder: a program to find bacterial gene start sites with a self-training methodThe International Journal of Biochemistry & Cell Biology, 2003
- EasyGene – a prokaryotic gene finder that ranks ORFs by statistical significanceBMC Bioinformatics, 2003
- Correlations between Shine-Dalgarno Sequences and Gene Features Such as Predicted Expression Levels and Operon StructuresJournal of Bacteriology, 2002
- Leaderless mRNAs in bacteria: surprises in ribosomal recruitment and translational controlMolecular Microbiology, 2002
- A probabilistic method for identifying start codons in bacterial genomesBioinformatics, 2001
- GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regionsNucleic Acids Research, 2001
- EcoGene: a genome sequence database for Escherichia coli K-12Nucleic Acids Research, 2000
- Compilation and analysis of DNA sequences associated with apparent streptomycete promotersNucleic Acids Research, 1992