An integrated computational pipeline and database to support whole-genome sequence annotation
Open Access
- 23 December 2002
- journal article
- review article
- Published by Springer Nature in Genome Biology
Abstract
We describe here our experience in annotating the Drosophila melanogaster genome sequence, in the course of which we developed several new open-source software tools and a database schema to support large-scale genome annotation. We have developed these into an integrated and reusable software system for whole-genome annotation. The key contributions to overall annotation quality are the marshalling of high-quality sequences for alignments and the design of a system with an adaptable and expandable flexible architecture.Keywords
This publication has 30 references indexed in Scilit:
- The Generic Genome Browser: A Building Block for a Model Organism System DatabaseGenome Research, 2002
- The Bioperl Toolkit: Perl Modules for the Life SciencesGenome Research, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- The FlyBase database of the Drosophila genome projects and community literatureNucleic Acids Research, 2002
- Genie—Gene Finding in Drosophila melanogasterGenome Research, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990