Natural selection of protein structural and functional properties: a single nucleotide polymorphism perspective
Open Access
- 8 April 2008
- journal article
- Published by Springer Nature in Genome Biology
- Vol. 9 (4) , R69
- https://doi.org/10.1186/gb-2008-9-4-r69
Abstract
Background: The rates of molecular evolution for protein-coding genes depend on the stringency of functional or structural constraints. The Ka/Ks ratio has been commonly used as an indicator of selective constraints and is typically calculated from interspecies alignments. Recent accumulation of single nucleotide polymorphism (SNP) data has enabled the derivation of Ka/Ks ratios for polymorphism (SNP A/S ratios). Results: Using data from the dbSNP database, we conducted the first large-scale survey of SNP A/S ratios for different structural and functional properties. We confirmed that the SNP A/S ratio is largely correlated with Ka/Ks for divergence. We observed stronger selective constraints for proteins that have high mRNA expression levels or broad expression patterns, have no paralogs, arose earlier in evolution, have natively disordered regions, are located in cytoplasm and nucleus, or are related to human diseases. On the residue level, we found higher degrees of variation for residues that are exposed to solvent, are in a loop conformation, natively disordered regions or low complexity regions, or are in the signal peptides of secreted proteins. Our analysis also revealed that histones and protein kinases are among the protein families that are under the strongest selective constraints, whereas olfactory and taste receptors are among the most variable groups. Conclusion: Our study suggests that the SNP A/S ratio is a robust measure for selective constraints. The correlations between SNP A/S ratios and other variables provide valuable insights into the natural selection of various structural or functional properties, particularly for human-specific genes and constraints within the human lineage.Keywords
This publication has 68 references indexed in Scilit:
- Evolutionary systems biology: links between gene evolution and functionPublished by Elsevier ,2006
- Flexible netsThe FEBS Journal, 2005
- Mimicking Cellular Sorting Improves Prediction of Subcellular LocalizationJournal of Molecular Biology, 2005
- Gene Expression Intensity Shapes Evolutionary Rates of the Proteins Encoded by the Vertebrate GenomeGenetics, 2004
- Bioinformatical assay of human gene morbidityNucleic Acids Research, 2004
- Prediction and Functional Analysis of Native Disorder in Proteins from the Three Kingdoms of LifeJournal of Molecular Biology, 2004
- Human non-synonymous SNPs: server and surveyNucleic Acids Research, 2002
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- Protein-length distributions for the three domains of lifeTrends in Genetics, 2000
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999