Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions
Open Access
- 1 October 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 28 (19) , 3801-3810
- https://doi.org/10.1093/nar/28.19.3801
Abstract
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.Keywords
This publication has 38 references indexed in Scilit:
- Evidence for a High Frequency of Simultaneous Double-Nucleotide SubstitutionsScience, 2000
- Tendency for local repetitiveness in amino acid usages in modern proteinsJournal of Molecular Biology, 1999
- Biological Implications of the DNA Structures Associated with Disease-Causing Triplet RepeatsAmerican Journal of Human Genetics, 1999
- CTG repeats associated with human genetic disease are inherently flexibleJournal of Molecular Biology, 1998
- Impact of changes in GC content on the silent molecular clock in muridsGene, 1997
- Genetic control of microsatellite stabilityMutation Research/DNA Repair, 1997
- Network analysis of human Y microsatellite haplotypesHuman Molecular Genetics, 1996
- Distribution of Trinucleotide Microsatellites in Different Categories of Mammalian Genomic Sequence: Implications for Human Genetic DiseasesGenomics, 1994
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Rates of DNA Sequence Evolution Differ Between Taxonomic GroupsScience, 1986