Accurate identification of alternatively spliced exons using support vector machine
Open Access
- 5 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (7) , 897-901
- https://doi.org/10.1093/bioinformatics/bti132
Abstract
Motivation: Alternative splicing is a major component of the regulatory action on mammalian transcriptomes. It is estimated that over half of all human genes have more than one splice variant. Previous studies have shown that alternatively spliced exons possess several features that distinguish them from constitutively spliced ones. Recently, we have demonstrated that such features can be used to distinguish alternative from constitutive exons. In the current study, we used advanced machine learning methods to generate robust classifier of alternative exons. Results: We extracted several hundred local sequence features of constitutive as well as alternative exons. Using feature selection methods we find seven attributes that are dominant for the task of classification. Several less informative features help to slightly increase the performance of the classifier. The classifier achieves a true positive rate of 50% for a false positive rate of 0.5%. This result enables one to reliably identify alternatively spliced exons in exon databases that are believed to be dominated by constitutive exons. Availability: Upon request from the authors. Contact:gideon@mta.ac.ilKeywords
This publication has 27 references indexed in Scilit:
- A Non-EST-Based Method for Exon-Skipping PredictionGenome Research, 2004
- Mismatch string kernels for discriminative protein classificationBioinformatics, 2004
- Sequence Information for the Splicing of Human Pre-mRNA Identified by Support Vector Machine ClassificationGenome Research, 2003
- Intronic Sequences Flanking Alternatively Spliced Exons Are Conserved Between Human and MouseGenome Research, 2003
- Selecting for Functional Alternative Splices in ESTsGenome Research, 2002
- Alternative pre-mRNA splicing and proteome expansion in metazoansNature, 2002
- Listening to silence and understanding nonsense: exonic mutations that affect splicingNature Reviews Genetics, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- A Discriminative Framework for Detecting Remote Protein HomologiesJournal of Computational Biology, 2000
- A compensatory base change in U1 snRNA suppresses a 5′ splice site mutationCell, 1986