Estimating the “Effective Number of Codons”: The Wright Way of Determining Codon Homozygosity Leads to Superior Estimates

Open Access

1 February 2006

journal article
research article
Published by Oxford University Press (OUP) in Genetics

Vol. 172 (2) , 1301-1307
https://doi.org/10.1534/genetics.105.049643

Abstract

In 1990, Frank Wright introduced a method for measuring synonymous codon usage bias in a gene by estimation of the “effective number of codons,” N_c. Several attempts have been made recently to improve Wright's estimate of N_c, but the methods that work in cases where a gene encodes a protein not containing all amino acids with degenerate codons have not been tested against each other. In this article I derive five new estimators of N_c and test them together with the two published estimators, using resampling under rigorous testing conditions. Estimation of codon homozygosity, F, turns out to be a key to the estimation of N_c. F can be estimated in two closely related ways, corresponding to sampling with or without replacement, the latter being what Wright used. The N_c methods that are based on sampling without replacement showed much better accuracy at short gene lengths than those based on sampling with replacement, indicating that Wright's homozygosity method is superior. Surprisingly, the methods based on sampling with replacement displayed a superior correlation with mRNA levels in Escherichia coli.

Keywords

This publication has 13 references indexed in Scilit:

On the methodological weakness of ‘the effective number of codons’: a reply to Marashi and Najafabadi
Biochemical and Biophysical Research Communications, 2005
Correlation of codon bias measures with mRNA levels: analysis of transcriptome data from Escherichia coli
Biochemical and Biophysical Research Communications, 2005
How reliable re-adjustment is: correspondence regarding A. Fuglsang, “The ‘effective number of codons’ revisited”
Biochemical and Biophysical Research Communications, 2004
The ‘effective number of codons’ revisited
Biochemical and Biophysical Research Communications, 2004
An Evaluation of Measures of Synonymous Codon Usage Bias
Journal of Molecular Evolution, 1998
The ‘effective number of codons’ used in a gene
Gene, 1990
The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications
Nucleic Acids Research, 1987
Codon usage and tRNA content in unicellular and multicellular organisms.
Molecular Biology and Evolution, 1985
Codon usage in bacteria: correlation with gene expressivity
Nucleic Acids Research, 1982
Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes
Journal of Molecular Biology, 1981