Prediction of mRNA polyadenylation sites by support vector machine
Open Access
- 26 July 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 22 (19) , 2320-2325
- https://doi.org/10.1093/bioinformatics/btl394
Abstract
MRNA polyadenylation is responsible for the 3′ end formation of most mRNAs in eukaryotic cells and is linked to termination of transcription. Prediction of mRNA polyadenylation sites [poly(A) sites] can help identify genes, define gene boundaries, and elucidate regulatory mechanisms. Current methods for poly(A) site prediction achieve moderate sensitivity and specificity. Here, we present a method using support vector machine for poly(A) site prediction. Using 15 cis-regulatory elements that are over-represented in various regions surrounding poly(A) sites, this method achieves higher sensitivity and similar specificity when compared with polyadq, a common tool for poly(A) site prediction. In addition, we found that while the polyadenylation signal AAUAAA and U-rich elements are primary determinants for poly(A) site prediction, other elements contribute to both sensitivity and specificity of the prediction, indicating a combinatorial mechanism involving multiple elements when choosing poly(A) sites in human cells. Contact:btian@umdnj.eduKeywords
This publication has 28 references indexed in Scilit:
- Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylationRNA, 2005
- Analysis of a noncanonical poly(A) site reveals a tripartite mechanism for vertebrate poly(A) site recognitionGenes & Development, 2005
- Connections between mRNA 3′ end processing and transcription terminationCurrent Opinion in Cell Biology, 2005
- Computational analysis of 3′-ends of ESTs shows four classes of alternative polyadenylation in human, mouse, and ratGenome Research, 2005
- New perspectives on connecting messenger RNA 3′ end formation to transcriptionCurrent Opinion in Cell Biology, 2004
- Sequence Information for the Splicing of Human Pre-mRNA Identified by Support Vector Machine ClassificationGenome Research, 2003
- Variations in yeast 3′-processing cis-elements correlate with transcript stabilityTrends in Genetics, 2003
- An mRNA Surveillance Mechanism That Eliminates Transcripts Lacking Termination CodonsScience, 2002
- A rare polyadenylation signal mutation of the FOXP3 gene (AAUAAA→AAUGAA) leads to the IPEX syndromeImmunogenetics, 2001
- Support-vector networksMachine Learning, 1995