Rapid detection and curation of conserved DNA via enhanced-BLAT and EvoPrinterHD analysis
Open Access
- 28 February 2008
- journal article
- Published by Springer Nature in BMC Genomics
- Vol. 9 (1) , 106
- https://doi.org/10.1186/1471-2164-9-106
Abstract
Multi-genome comparative analysis has yielded important insights into the molecular details of gene regulation. We have developed EvoPrinter, a web-accessed genomics tool that provides a single uninterrupted view of conserved sequences as they appear in a species of interest. An EvoPrint reveals with near base-pair resolution those sequences that are essential for gene function. We describe here EvoPrinterHD, a 2nd-generation comparative genomics tool that automatically generates from a single input sequence an enhanced view of sequence conservation between evolutionarily distant species. Currently available for 5 nematode, 3 mosquito, 12 Drosophila, 20 vertebrate, 17 Staphylococcus and 20 enteric bacteria genomes, EvoPrinterHD employs a modified BLAT algorithm [enhanced-BLAT (eBLAT)], which detects up to 75% more conserved bases than identified by the BLAT alignments used in the earlier EvoPrinter program. The new program also identifies conserved sequences within rearranged DNA, highlights repetitive DNA, and detects sequencing gaps. EvoPrinterHD currently holds over 112 billion bp of indexed genomes in memory and has the flexibility of selecting a subset of genomes for analysis. An EvoDifferences profile is also generated to portray conserved sequences that are uniquely lost in any one of the orthologs. Finally, EvoPrinterHD incorporates options that allow for (1) re-initiation of the analysis using a different genome's aligning region as the reference DNA to detect species-specific changes in less-conserved regions, (2) rapid extraction and curation of conserved sequences, and (3) for bacteria, identifies unique or uniquely shared sequences present in subsets of genomes. EvoPrinterHD is a fast, high-resolution comparative genomics tool that automatically generates an uninterrupted species-centric view of sequence conservation and enables the discovery of conserved sequences within rearranged DNA. When combined with cis-Decoder, a program that discovers sequence elements shared among tissue specific enhancers, EvoPrinterHD facilitates the analysis of conserved sequences that are essential for coordinate gene regulation.Keywords
This publication has 20 references indexed in Scilit:
- Computation and Analysis of Genomic Multi-Sequence AlignmentsAnnual Review of Genomics and Human Genetics, 2007
- cis-Decoder discovers constellations of conserved DNA sequences shared among tissue-specific enhancersGenome Biology, 2007
- VectorBase: a home for invertebrate vectors of human pathogensNucleic Acids Research, 2006
- Molecular cloning and characterization of human Castor, a novel human gene upregulated during cell differentiationBiochemical and Biophysical Research Communications, 2006
- REDfly: a Regulatory Element Database for DrosophilaBioinformatics, 2005
- A regulatory code for neurogenic gene expression in theDrosophilaembryoDevelopment, 2004
- Aligning Multiple Genomic Sequences With the Threaded Blockset AlignerGenome Research, 2004
- CONREAL: Conserved Regulatory Elements Anchored Alignment Algorithm for Identification of Transcription Factor Binding Sites by Phylogenetic FootprintingGenome Research, 2003
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Human-mouse genome comparisons to locate regulatory sitesNature Genetics, 2000