Universal patterns of purifying selection at noncoding positions in bacteria
Open Access
- 21 November 2007
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 18 (1) , 148-160
- https://doi.org/10.1101/gr.6759507
Abstract
To investigate the dependence of the number of regulatory sites per intergenic region on genome size, we developed a new method for detecting purifying selection at noncoding positions in clades of related bacterial genomes. We comprehensively quantified evidence of purifying selection at noncoding positions across bacteria and found several striking universal patterns. Consistent with selection acting at transcriptional regulatory elements near the transcription start, we find a universal positional profile of selection with respect to gene starts and ends, showing most evidence of selection immediately upstream and least immediately downstream from genes. A further set of universal features indicates that selection for translation initiation efficiency is the major determinant of the sequence composition around translation start in all clades. In addition to a peak in selection at ribosomal binding sites, the region immediately around translation start shows a universal pattern of high adenine frequency, significant selection at silent positions, and avoidance of RNA secondary structure. Surprisingly, although the number of transcription factors (TF) increases quadratically with genome size, we present several lines of evidence that small and large genomes have the same average number of regulatory sites per intergenic region. By comparing the sequence diversity of the most and least conserved DNA words in intergenic regions across clades we provide evidence that the structure of transcription regulatory networks changes dramatically with genome size: Small genomes have a small number of TFs with a large number of target sites, whereas large genomes have a large number of TFs with a small number of target sites each.Keywords
This publication has 53 references indexed in Scilit:
- Ubiquitous selective constraints in the Drosophila genome revealed by a genome-wide interspecies comparisonGenome Research, 2006
- PhyloGibbs: A Gibbs Sampling Motif Finder That Incorporates PhylogenyPLoS Computational Biology, 2005
- Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomesGenome Research, 2005
- Analyzing and enhancing mRNA translational efficiency in an Escherichia coli in vitro expression systemBiochemical and Biophysical Research Communications, 2004
- Patterns of Evolutionary Constraints in Intronic and Intergenic DNA of DrosophilaGenome Research, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Selection of DNA binding sites by regulatory proteinsJournal of Molecular Biology, 1987
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978