Abstract
Cornish-Bowden and Marson have recently suggested that the finite sampling component of Q, a measure of nonrandomness in the amino acid composition of proteins, may have been underestimated because it was calculated on the basis of the genetic code table frequencies rather than on the basis of the average natural abundance with which the twenty amino acids actually occur in proteins. This underestimate would lead to an overestimate of Qc a measure of selective effects above and beyond those imposed by the average natural abundance of the amino acids. In this paper the finite sampling component of Q is quantitatively estimated on the basis of these natural abundances and found to reduce Qc from its previous average value of 24.3 to the lower value of 9.7, with the standard deviation of the population of Qc values being 12.5. Individual Qc values are given for 81 protein families of mean composition per 61 codons of Ala5.3 Arg2.4 Asn3.0 Asp3.6 Cys1.5 Gln2.6 Glu3.5 Gly4.7 His1.3 Ile3.4 Leu4.5 Lys4.2 Met1.0 Phe2.3 Pro2.3 Ser4.2 Thr3.6 Trp0.8 Tyr2.6 Val4.2. The mean Qc value of 9.7 is notably small, and indicates that quantitatively minimal adjustments away from the average protein composition are necessary to maintain many different biological functions. This small value, however, is shown to differ significantly from the value of zero expected were the natural abundances of the amino acids the only selective constraint. These small deviations from the natural abundances are thus effectively selected for in the Darwinian sense.