PlantProm: a database of plant promoter sequences
- 1 January 2003
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (1) , 114-117
- https://doi.org/10.1093/nar/gkg041
Abstract
PlantProm DB, a plant promoter database, is an annotated, non-redundant collection of proximal promoter sequences for RNA polymerase II with experimentally determined transcription start site(s), TSS, from various plant species. The first release (2002.01) of PlantProm DB contains 305 entries including 71, 220 and 14 promoters from monocot, dicot and other plants, respectively. It provides DNA sequence of the promoter regions (-200 : +51) with TSS on the fixed position +201, taxonomic/promoter type classification of promoters and Nucleotide Frequency Matrices (NFM) for promoter elements: TATA-box, CCAAT-box and TSS-motif (Inr). Analysis of TSS-motifs revealed that their composition is different in dicots and monocots, as well as for TATA and TATA-less promoters. The database serves as learning set in developing plant promoter prediction programs. One such program (TSSP) based on discriminant analysis has been created by Softberry Inc. and the application of a support ftp: vector machine approach for promoter identification is under development. PlantProm DB is available at http://mendel.cs.rhul.ac.uk/ and http://www.softberry.com/.Keywords
This publication has 13 references indexed in Scilit:
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. japonica )Science, 2002
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. indica )Science, 2002
- TRANSCompel(R): a database on composite regulatory elements in eukaryotic genesNucleic Acids Research, 2002
- PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequencesNucleic Acids Research, 2002
- The Eukaryotic Promoter Database, EPD: new entry types and links to gene expression dataNucleic Acids Research, 2002
- Transcription Regulatory Regions Database (TRRD): its status in 2002Nucleic Acids Research, 2002
- The TRANSFAC system on gene expression regulationNucleic Acids Research, 2001
- Object-oriented Transcription Factors Database (ooTFD)Nucleic Acids Research, 2000
- Plant cis-acting regulatory DNA elements (PLACE) database: 1999Nucleic Acids Research, 1999
- Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragmentsJournal of Molecular Biology, 1992