Drosophila Genomic Sequence Annotation Using the BLOCKS+ Database
Open Access
- 1 April 2000
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 10 (4) , 543-546
- https://doi.org/10.1101/gr.10.4.543
Abstract
A simple and general homology-based method for gene finding was applied to the 2.9-Mb Drosophila melanogaster Adh region, the target sequence of the Genome Annotation Assessment Project (GASP). Each strand of the entire sequence was used as query of theBLOCKS+ database of conserved regions of proteins. This led to functional assignments for more than one-third of the genes and two-thirds of the transposons. Considering the enormous size of the query, the fact that only two false-positive matches were reported emphasizes the high selectivity of protein family-based methods for gene finding. We used the search results to improveBLOCKS+ by identifying compositionally biased blocks. Our results confirm that protein family databases can be used effectively in automated sequence annotation efforts.Keywords
This publication has 23 references indexed in Scilit:
- Genie—Gene Finding in Drosophila melanogasterGenome Research, 2000
- SMART: a web-based tool for the study of genetically mobile domainsNucleic Acids Research, 2000
- Automated construction and graphical presentation of protein blocks from unaligned sequencesGene, 1995
- The complete DNA sequence of yeast chromosome IIINature, 1992
- Complementary DNA Sequencing: Expressed Sequence Tags and Human Genome ProjectScience, 1991
- Automated assembly of protein blocks for database searchingNucleic Acids Research, 1991
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990
- gm: a practical tool for automating DNA sequence analysisBioinformatics, 1990
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988