SpliceMachine: predicting splice sites from high-dimensional local context representations
Open Access
- 25 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (8) , 1332-1338
- https://doi.org/10.1093/bioinformatics/bti166
Abstract
Motivation: In this age of complete genome sequencing, finding the location and structure of genes is crucial for further molecular research. The accurate prediction of intron boundaries largely facilitates the correct prediction of gene structure in nuclear genomes. Many tools for localizing these boundaries on DNA sequences have been developed and are available to researchers through the internet. Nevertheless, these tools still make many false positive predictions. Results: This manuscript presents a novel publicly available splice site prediction tool named SpliceMachine that (i) shows state-of-the-art prediction performance on Arabidopsis thaliana and human sequences, (ii) performs a computationally fast annotation and (iii) can be trained by the user on its own data. Availability: Results, figures and software are available at http://www.bioinformatics.psb.ugent.be/supplementary_data/ Contact: sven.degroeve@psb.ugent.be; yves.vandepeer@psb.ugent.beKeywords
This publication has 11 references indexed in Scilit:
- Current methods of gene prediction, their strengths and weaknessesNucleic Acids Research, 2002
- A computational analysis of sequence features involved in recognition of short intronsProceedings of the National Academy of Sciences, 2001
- GeneSplicer: a new computational method for splice site predictionNucleic Acids Research, 2001
- Gene structure prediction by spliced alignment of genomic DNA with protein sequences: increased accuracy by differential splice site scoringJournal of Molecular Biology, 2000
- Modeling splice sites with Bayes networksBioinformatics, 2000
- Evaluation of gene prediction software using a genomic data set: application to Arabidopsis thalianasequencesBioinformatics, 1999
- Wrappers for feature subset selectionArtificial Intelligence, 1997
- A branch point consensus from Arabidopsis found by non-circular analysis allows for better prediction of acceptor sitesNucleic Acids Research, 1997
- Improved Splice Site Detection in GenieJournal of Computational Biology, 1997
- Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence informationNucleic Acids Research, 1996