Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophilagenome
Open Access
- 30 December 2002
- journal article
- research article
- Published by Springer Nature in Genome Biology
Abstract
It is widely accepted that comparative sequence data can aid the functional annotation of genome sequences; however, the most informative species and features of genome evolution for comparison remain to be determined. We analyzed conservation in eight genomic regions (apterous, even-skipped, fushi tarazu, twist, and Rhodopsins 1, 2, 3 and 4) from four Drosophila species (D. erecta, D. pseudoobscura, D. willistoni, and D. littoralis) covering more than 500 kb of the D. melanogaster genome. All D. melanogaster genes (and 78-82% of coding exons) identified in divergent species such as D. pseudoobscura show evidence of functional constraint. Addition of a third species can reveal functional constraint in otherwise non-significant pairwise exon comparisons. Microsynteny is largely conserved, with rearrangement breakpoints, novel transposable element insertions, and gene transpositions occurring in similar numbers. Rates of amino-acid substitution are higher in uncharacterized genes relative to genes that have previously been studied. Conserved non-coding sequences (CNCSs) tend to be spatially clustered with conserved spacing between CNCSs, and clusters of CNCSs can be used to predict enhancer sequences. Our results provide the basis for choosing species whose genome sequences would be most useful in aiding the functional annotation of coding and cis-regulatory sequences in Drosophila. Furthermore, this work shows how decoding the spatial organization of conserved sequences, such as the clustering of CNCSs, can complement efforts to annotate eukaryotic genomes on the basis of sequence conservation alone.Keywords
This publication has 94 references indexed in Scilit:
- Phylogeography and morphological variation of the branching octocoral Pseudopterogorgia elisabethaeMolecular Phylogenetics and Evolution, 2009
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- Human-mouse genome comparisons to locate regulatory sitesNature Genetics, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- Regulation of Two Pair-Rule Stripes by a Single Enhancer in theDrosophilaEmbryoDevelopmental Biology, 1996
- Regulation of a Segmentation Stripe by Overlapping Activators and Repressors in the Drosophila EmbryoScience, 1991
- Interspecific nucleotide sequence comparisons used to identify regulatory and structural features of the Drosophila hsp82 geneJournal of Molecular Biology, 1986