Evaluation of phylogenetic footprint discovery for predicting bacterial cis-regulatory elements and revealing their evolution
Open Access
- 23 January 2008
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 9 (1) , 1-26
- https://doi.org/10.1186/1471-2105-9-37
Abstract
The detection of conserved motifs in promoters of orthologous genes (phylogenetic footprints) has become a common strategy to predict cis-acting regulatory elements. Several software tools are routinely used to raise hypotheses about regulation. However, these tools are generally used as black boxes, with default parameters. A systematic evaluation of optimal parameters for a footprint discovery strategy can bring a sizeable improvement to the predictions. We evaluate the performances of a footprint discovery approach based on the detection of over-represented spaced motifs. This method is particularly suitable for (but not restricted to) Bacteria, since such motifs are typically bound by factors containing a Helix-Turn-Helix domain. We evaluated footprint discovery in 368 Escherichia coli K12 genes with annotated sites, under 40 different combinations of parameters (taxonomical level, background model, organism-specific filtering, operon inference). Motifs are assessed both at the levels of correctness and significance. We further report a detailed analysis of 181 bacterial orthologs of the LexA repressor. Distinct motifs are detected at various taxonomical levels, including the 7 previously characterized taxon-specific motifs. In addition, we highlight a significantly stronger conservation of half-motifs in Actinobacteria, relative to Firmicutes, suggesting an intermediate state in specificity switching between the two Gram-positive phyla, and thereby revealing the on-going evolution of LexA auto-regulation. The footprint discovery method proposed here shows excellent results with E. coli and can readily be extended to predict cis-acting regulatory signals and propose testable hypotheses in bacterial genomes for which nothing is known about regulation.Keywords
This publication has 86 references indexed in Scilit:
- MicroFootPrinter: a tool for phylogenetic footprinting in prokaryotic genomesNucleic Acids Research, 2006
- PhyloGibbs: A Gibbs Sampling Motif Finder That Incorporates PhylogenyPLoS Computational Biology, 2005
- Assessing computational tools for the discovery of transcription factor binding sitesNature Biotechnology, 2005
- Conservation and Evolution of Cis-Regulatory Systems in Ascomycete FungiPLoS Biology, 2004
- WebLogo: A Sequence Logo Generator: Figure 1Genome Research, 2004
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- Database resources of the National Center for Biotechnology InformationNucleic Acids Research, 2000
- Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitationNature Biotechnology, 1998
- Identification of High Affinity Binding Sites for LexA which Define New DNA Damage-inducible Genes in Escherichia coliJournal of Molecular Biology, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990