A problem in multivariate analysis of codon usage data and a possible solution
- 2 November 2005
- journal article
- Published by Wiley in FEBS Letters
- Vol. 579 (28) , 6499-6504
- https://doi.org/10.1016/j.febslet.2005.10.032
Abstract
Multivariate analyses are often used to identify major trends of variation in synonymous codon usage among genes. These analyses need to be performed on properly normalized codon usage data to avoid biases masking this synonymous variation, i.e., gene length, amino acid usage, and codon degeneracy; however, previous studies have failed to do so. In this paper, we demonstrate that the use of alternative normalized data (called 'relative adaptiveness' in the literature) can avoid all these biases and furthermore, can identify more trends of variation among genes, including GC-ending codon usage, GT-ending codon usage, and gene expression level.Keywords
This publication has 20 references indexed in Scilit:
- Variation in the strength of selected codon usage bias among bacteriaNucleic Acids Research, 2005
- GenBankNucleic Acids Research, 2004
- Translational selection is operative for synonymous codon usage in Clostridium perfringens and Clostridium acetobutylicumMicrobiology, 2003
- Use and misuse of correspondence analysis in codon usage studiesNucleic Acids Research, 2002
- Prokaryotic Genome Evolution as Assessed by Multivariate Analysis of Codon Usage PatternsMicrobial & Comparative Genomics, 1997
- The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applicationsNucleic Acids Research, 1987
- Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genesNucleic Acids Research, 1986
- A simple method for displaying the hydropathic character of a proteinJournal of Molecular Biology, 1982
- Codon catalog usage and the genome hypothesisNucleic Acids Research, 1980