MAGPIE/EGRET Annotation of the 2.9-Mb Drosophila melanogaster Adh Region
Open Access
- 1 April 2000
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 10 (4) , 502-510
- https://doi.org/10.1101/gr.10.4.502
Abstract
Our challenge in annotating the 2.91-Mb Adh region of theDrosophila melanogaster genome was to identify genetic and genomic features automatically, completely, and precisely within a 6-week period. To do so, we augmented the MAGPIE microbial genome annotation system to handle eukaryotic genomic sequence data. The new configuration required the integration of eukaryotic gene-finding tools and DNA repeat tools into the automatic data collection module. It also required us to define in MAGPIEnew strategies to combine data about eukaryotic exon predictions with functional data to refine the exon predictions. At the heart of the resulting new eukaryotic genome annotation system is a reverse comparison of public protein and complementary DNA sequences against the input genome to identify missing exons and to refine exon boundaries. The software modules that add eukaryotic genome annotation capability to MAGPIE are available as EGRET(Eukaryotic Genome RapidEvaluation Tool).Keywords
This publication has 18 references indexed in Scilit:
- Flexible Sequence Similarity Searching with the FASTA3 Program PackagePublished by Springer Nature ,1999
- Detecting Protein Function and Protein-Protein Interactions from Genome SequencesScience, 1999
- REPuter: fast computation of maximal repeats in complete genomes.Bioinformatics, 1999
- New features of the Blocks Database servers.Nucleic Acids Research, 1999
- PRINTS prepares for the new millenniumNucleic Acids Research, 1999
- Microbial Genescapes: Phyletic and Functional Patterns of ORF Distribution among ProkaryotesMicrobial & Comparative Genomics, 1998
- Microbial Genescapes: A Prokaryotic View of the Yeast GenomeMicrobial & Comparative Genomics, 1998
- Constructing Multigenome Views of Whole Microbial GenomesMicrobial & Comparative Genomics, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Fully automated genome analysis that reflects user needs and preferences. A detailed introduction to the MAGPIE system architectureBiochimie, 1996