The distinctive signatures of promoter regions and operon junctions across prokaryotes
Open Access
- 12 August 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (14) , 3980-3987
- https://doi.org/10.1093/nar/gkl563
Abstract
Here we show that regions upstream of first transcribed genes have oligonucleotide signatures that distinguish them from regions upstream of genes in the middle of operons. Databases of experimentally confirmed transcription units do not exist for most genomes. Thus, to expand the analyses into genomes with no experimentally confirmed data, we used genes conserved adjacent in evolutionarily distant genomes as representatives of genes inside operons. Likewise, we used divergently transcribed genes as representative examples of first transcribed genes. In model organisms, the trinucleotide signatures of regions upstream of these representative genes allow for operon predictions with accuracies close to those obtained with known operon data (0.8). Signature-based operon predictions have more similar phylogenetic profiles and higher proportions of genes in the same pathways than predicted transcription unit boundaries (TUBs). These results confirm that we are separating genes with related functions, as expected for operons, from genes not necessarily related, as expected for genes in different transcription units. We also test the quality of the predictions using microarray data in six genomes and show that the signature-predicted operons tend to have high correlations of expression. Oligonucleotide signatures should expand the number of tools available to identify operons even in poorly characterized genomes.Keywords
This publication has 47 references indexed in Scilit:
- Conservation of adjacency as evidence of paralogous operonsNucleic Acids Research, 2004
- Genome Update: promoter profilesMicrobiology, 2004
- Structure and evolution of transcriptional regulatory networksPublished by Elsevier ,2004
- Evolution of Protein Superfamilies and Bacterial Genome SizeJournal of Molecular Biology, 2004
- PREDICTING THE OPERON STRUCTURE OF BACILLUS SUBTILIS USING OPERON LENGTH, INTERGENE DISTANCE, AND GENE EXPRESSION INFORMATIONPacific Symposium on Biocomputing, 2003
- A probabilistic learning approach to whole-genome operon prediction.2000
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Activation of Transcription at σ54-dependent Promoters on Linear Templates Requires Intrinsic or Induced Bending of the DNAJournal of Molecular Biology, 1996
- σs-dependent promoters inEscherichia coliare located in DNA regions with intrinsic curvatureNucleic Acids Research, 1993
- Sequence distributions associated with DNA curvature are found upstream of strongE. colipromotersNucleic Acids Research, 1987