Identification of ribosome binding sites inEscherichia coliusing neural network models
- 1 January 1995
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 23 (9) , 1632-1639
- https://doi.org/10.1093/nar/23.9.1632
Abstract
This study investigated the use of neural networks in the identification of Escherichia coli ribosome binding sites. The recognition of these sites based on primary sequence data is difficult due to the multiple determinants that define them. Additionally, secondary structure plays a significant role in the determination of the site and this information is difficult to include in the models. Efforts to solve this problem have so far yielded poor results. A new compilation of E. coli ribosome binding sites was generated for this study. Feedforward backpropagation networks were applied to their identification. Perceptrons were also applied, since they have been the previous best method since 1982. Evaluation of performance for all the neural networks and perceptrons was determined by ROC analysis. The neural network provided significant improvement in the recognition of these sites when compared with the previous best method, finding less than half the number of false positives when both models were adjusted to find an equal number of actual sites. The best neural network used an input window of 101 nucleotides and a single hidden layer of 9 units. Both the neural network and the perceptron trained on the new compilation performed better than the original perceptron published by Stormo et al. in 1982.Keywords
This publication has 45 references indexed in Scilit:
- Prediction of the disulfide-bonding state of cysteine in proteinsProtein Engineering, Design and Selection, 1990
- Predicting surface exposure of amino acids from protein sequenceProtein Engineering, Design and Selection, 1990
- Improvements in protein secondary structure prediction by an enhanced neural networkJournal of Molecular Biology, 1990
- Evaluation of neural network performance by receiver operating characteristic (ROC) analysis: examples from the biotechnology domainComputer Methods and Programs in Biomedicine, 1990
- Neural Network Models for Promoter RecognitionJournal of Biomolecular Structure and Dynamics, 1989
- Consensus methods for finding and ranking DNA binding sitesJournal of Molecular Biology, 1989
- ESCHERICHIA-COLI PROMOTERS .2. A SPACING CLASS-DEPENDENT PROMOTER SEARCH PROTOCOL1989
- An additional ribosome-binding site on mRNA of highly expressed genes and a bifunctional site on the colicin fragment of 16S rRNA fromEscherichia coli: important determinants of the efficiency of translation-initiationNucleic Acids Research, 1989
- The GenBank®genetic sequence data bankNucleic Acids Research, 1988
- Targeted random mutagenesis: the use of ambiguously synthesized oligonucleotides to mutagenize sequences immediately 5‘ of an ATG initiation codonNucleic Acids Research, 1983