Similar compositional biases are caused by very different mutational effects
- 26 October 2006
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 16 (12) , 1537-1547
- https://doi.org/10.1101/gr.5525106
Abstract
Compositional replication strand bias, commonly referred to as GC skew, is present in many genomes of prokaryotes, eukaryotes, and viruses. Although cytosine deamination in ssDNA (resulting in C→T changes on the leading strand) is often invoked as its major cause, the precise contributions of this and other substitution types are currently unknown. It is also unclear if the underlying mutational asymmetries are the same among taxa, are stable over time, or how closely the observed biases are to mutational equilibrium. We analyzed nearly neutral sites of seven taxa each with between three and six complete bacterial genomes, and inferred the substitution spectra of fourfold degenerate positions in nonhighly expressed genes. Using a bootstrap procedure, we extracted compositional biases associated with replication and identified the significant asymmetries. Although all taxa showed an overrepresentation of G relative to C on the leading strand (and imbalances between A and T), widely variable substitution asymmetries are noted. Surprisingly, all substitution types show significant asymmetry in at least one taxon, but none were universally biased in all taxa. Notably, in the two most biased genomes, A→G, rather than C→T, shapes the compositional bias. Given the variability in these biases, we propose that the process is multifactorial. Finally, we also find that most genomes are not at compositional equilibrium, and suggest that mutational-based heterotachy is deeply imprinted in the history of biological macromolecules. This shows that similar compositional biases associated with the same essential well-conserved process, replication, do not reflect similar mutational processes in different genomes, and that caution is required in inferring the roles of specific mutational biases on the basis of contemporary patterns of sequence composition.Keywords
This publication has 67 references indexed in Scilit:
- Comparisons of dN/dS are time dependent for closely related bacterial genomesJournal of Theoretical Biology, 2005
- Comparative and Evolutionary Analysis of the Bacterial Homologous Recombination SystemsPLoS Genetics, 2005
- Re-evaluating prokaryotic speciesNature Reviews Microbiology, 2005
- An Appraisal of the Potential for Illegitimate Recombination in Bacterial Genomes and Its Consequences: From Duplications to Genome ReductionGenome Research, 2003
- Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2)Nature, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- Asymmetries Generated by Transcription-Coupled Repair in Enterobacterial GenesScience, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Patterns of nucleotide substitution in pseudogenes and functional genesJournal of Molecular Evolution, 1982
- Molecular basis of base substitution hotspots in Escherichia coliNature, 1978