Prediction of locally optimal splice sites in plant pre-mRNA with applications to gene identification in Arabidopsis thaliana genomic DNA
Open Access
- 1 October 1998
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 26 (20) , 4748-4757
- https://doi.org/10.1093/nar/26.20.4748
Abstract
Prediction of splice site selection and efficiency from sequence inspection is of fundamental interest (testing the current knowledge of requisite sequence features) and practical importance (genome annotation, design of mutant or transgenic organisms). In plants, the dominant variables affecting splice site selection and efficiency include the degree of matching to the extended splice site consensus and the local gradient of U- and G+C-composition (introns being U-rich and exons G+C-rich). We present a novel method for splice site prediction, which was particularly trained for maize and Arabidopsis thaliana. The method extends our previous algorithm based on logitlinear models by considering three variables simultaneously: intrinsic splice site strength, local optimality and fit with respect to the overall splice pattern prediction. We show that the method considerably improves prediction specificity without compromising the high degree of sensitivity required in gene prediction algorithms. Applications to gene identification are illustrated for Arabidopsis and suggest that successful methods must combine scoring for splice sites, coding potential and similarity with potential homologs in non-trivial ways. A WWW version of the SplicePredictor program is available at http:/gnomic.stanford.edu/~volker/SplicePredictor.html/Keywords
This publication has 31 references indexed in Scilit:
- Finding the genes in genomic DNAPublished by Elsevier ,2002
- SPLICE SITE SELECTION IN PLANT PRE-mRNA SPLICINGAnnual Review of Plant Biology, 1998
- Computational methods for the identification of genes in vertebrate genomic sequencesHuman Molecular Genetics, 1997
- Splicing of precursors to mRNA in higher plants: mechanism, regulation and sub-nuclear organisation of the spliceosomal machineryPlant Molecular Biology, 1996
- Finding genes by computer: the state of the artTrends in Genetics, 1996
- Evaluation of Gene Structure Prediction ProgramsGenomics, 1996
- Nuclear Pre-mRna Processing in Higher PlantsProgress in Nucleic Acid Research and Molecular Biology, 1994
- 3' splice site selection in dicot plant nuclei is position dependent.Molecular and Cellular Biology, 1993
- Factors affecting authentic 5' splice site selection in plant nuclei.Molecular and Cellular Biology, 1993
- The AU-rich sequences present in the introns of plant nuclear pre-mRNAs are required for splicingCell, 1989