Conservation of adjacency as evidence of paralogous operons
- 11 October 2004
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 32 (18) , 5392-5397
- https://doi.org/10.1093/nar/gkh882
Abstract
Most of the analyses on the conservation of gene order are limited to orthologous genes. However, the organization of genes into operons might also result in the conservation of gene order of paralogous genes. Thus, we sought computational evidence that conservation of gene order of paralogous genes represents another level of conservation of genes in operons. We found that pairs of genes within experimentally characterized operons of Escherichia coli K12 and Bacillus subtilis tend to have more adjacently conserved paralogs than pairs of genes at transcription unit boundaries. The fraction of same strand gene pairs corresponding to conserved paralogs averages 0.07 with a maximum of 0.22 in Borrelia burgdorferi. The use of evidence from the conservation of adjacency of paralogous genes can improve the prediction of operons in E.coli K12 by approximately 0.27 over predictions using conservation of adjacency of orthologous genes alone.Keywords
This publication has 51 references indexed in Scilit:
- The SUPERFAMILY database in 2004: additions and improvementsNucleic Acids Research, 2004
- GenProtEC: an updated and improved analysis of functions of Escherichia coli K-12 proteinsNucleic Acids Research, 2004
- Two Paralogous Families of a Two-Gene Subtilisin Operon Are Widely Distributed in Oral TreponemesJournal of Bacteriology, 2003
- Evolution of transcription factors and the gene regulatory network in Escherichia coliNucleic Acids Research, 2003
- Evolutionary history, structural features and biochemical diversity of the NlpC/P60 superfamily of enzymesGenome Biology, 2003
- Lateral gene transfer and ancient paralogy of operons containing redundant copies of tryptophan-pathway genes in Xylellaspecies and in heterocystous cyanobacteriaGenome Biology, 2003
- Orthology, paralogy and proposed classification for paralog subtypesPublished by Elsevier ,2002
- The complete genomic sequence of Mycoplasma penetrans, an intracellular bacterial pathogen in humansNucleic Acids Research, 2002
- The roles of the polytopic membrane proteins NarK, NarU and NirC in Escherichia coli K‐12: two nitrate and three nitrite transportersMolecular Microbiology, 2002
- Complete genomes in WWW Entrez: data representation and analysis.Bioinformatics, 1999