mGene.web: a web service for accurate computational gene finding
Open Access
- 3 June 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (Web Server) , W312-W316
- https://doi.org/10.1093/nar/gkp479
Abstract
We describe mGene.web, a web service for the genome-wide prediction of protein coding genes from eukaryotic DNA sequences. It offers pre-trained models for the recognition of gene structures including untranslated regions in an increasing number of organisms. With mGene.web, users have the additional possibility to train the system with their own data for other organisms on the push of a button, a functionality that will greatly accelerate the annotation of newly sequenced genomes. The system is built in a highly modular way, such that individual components of the framework, like the promoter prediction tool or the splice site predictor, can be used autonomously. The underlying gene finding system mGene is based on discriminative machine learning techniques and its high accuracy has been demonstrated in an international competition on nematode genomes. mGene.web is available at http://www.mgene.org/web, it is free of charge and can be used for eukaryotic genomes of small to moderate size (several hundred Mbp).Keywords
This publication has 17 references indexed in Scilit:
- nGASP – the nematode genome annotation assessment projectBMC Bioinformatics, 2008
- Support Vector Machines and Kernels for Computational BiologyPLoS Computational Biology, 2008
- Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised trainingGenome Research, 2008
- Steady progress and recent breakthroughs in the accuracy of automated genome annotationNature Reviews Genetics, 2008
- Accurate splice site prediction using support vector machinesBMC Bioinformatics, 2007
- Conrad: Gene prediction using conditional random fieldsGenome Research, 2007
- Global Discriminative Learning for Higher-Accuracy Computational Gene PredictionPLoS Computational Biology, 2007
- Improving the Caenorhabditis elegans Genome Annotation Using Machine LearningPLoS Computational Biology, 2007
- ARTS: accurate recognition of transcription starts in humanBioinformatics, 2006
- Using Multiple Alignments to Improve Gene PredictionJournal of Computational Biology, 2006