Implementation of a classification-based prediction model for plant mRNA Poly(A) sites
- 1 September 2008
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The poly(A) site of a messenger RNA (mRNA) defines the end of a transcript during eukaryotic gene expression. Finding poly(A) sites in genome sequences can help to annotate the ends of genes and predict alternative polyadenylation. However, it is challenging to predict plant poly(A) sites using computational methods because of the weak signals that determine the poly(A) sites. Here we describe a classification based plant poly(A) site recognition model. First, several feature representation methods like factorial moments, M encoding, and weight of signal patterns are adopted to describe the makeup of nucleotide sequences of poly(A) signals. Then, a training model using different classification algorithms like Bayesian network is built as a testing model to predict plant mRNA poly(A) sites. Comparing to previous plant poly(A) sites prediction software PASS that we developed, the recognition model introduced here has better performance, flexibility and expansibility.Keywords
This publication has 7 references indexed in Scilit:
- Unique Features of Nuclear mRNA Poly(A) Signals and Alternative Polyadenylation in Chlamydomonas reinhardtiiGenetics, 2008
- Genome level analysis of rice mRNA 3′-end processing signals and alternative polyadenylationNucleic Acids Research, 2008
- Statistical and Dynamical Equivalence of Different Elementary CellsJournal of Computational and Theoretical Nanoscience, 2007
- Predictive modeling of plant messenger RNA polyadenylation sitesBMC Bioinformatics, 2007
- Sequence analysis of mRNA polyadenylation signals of rice genesChinese Science Bulletin, 2006
- Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary StructuresPlant Physiology, 2005
- Bayesian Network ClassifiersMachine Learning, 1997