Positive Selection Differs between Protein Secondary Structure Elements in Drosophila
Open Access
- 1 January 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Genome Biology and Evolution
- Vol. 2, 166-179
- https://doi.org/10.1093/gbe/evq008
Abstract
Different protein secondary structure elements have different physicochemical properties and roles in the protein, which may determine their evolutionary flexibility. However, it is not clear to what extent protein structure affects the way Darwinian selection acts at the amino acid level. Using phylogeny-based likelihood tests for positive selection, we have examined the relationship between protein secondary structure and selection across six species of Drosophila. We find that amino acids that form disordered regions, such as random coils, are far more likely to be under positive selection than expected from their proportion in the proteins, and residues in helices and β-structures are subject to less positive selection than predicted. In addition, it appears that sites undergoing positive selection are more likely than expected to occur close to one another in the protein sequence. Finally, on a genome-wide scale, we have determined that positively selected sites are found more frequently toward the gene ends. Our results demonstrate that protein structures with a greater degree of organization and strong hydrophobicity, represented here as helices and β-structures, are less tolerant to molecular adaptation than disordered, hydrophilic regions, across a diverse set of proteins.Keywords
This publication has 68 references indexed in Scilit:
- Estimates of the Effect of Natural Selection on Protein-Coding ContentMolecular Biology and Evolution, 2009
- The intrinsic conformational propensities of the 20 naturally occurring amino acids and reflection of these propensities in proteinsProceedings of the National Academy of Sciences, 2008
- Evolution of protein-coding genes in DrosophilaTrends in Genetics, 2008
- Roles of β‐turns in protein folding: From peptide models to protein engineeringBiopolymers, 2008
- Evolution of genes and genomes on the Drosophila phylogenyNature, 2007
- PAML 4: Phylogenetic Analysis by Maximum LikelihoodMolecular Biology and Evolution, 2007
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selectionNature, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983