A problem in multivariate analysis of codon usage data and a possible solution

2 November 2005

journal article
Published by Wiley in FEBS Letters

Vol. 579 (28) , 6499-6504
https://doi.org/10.1016/j.febslet.2005.10.032

Abstract

Multivariate analyses are often used to identify major trends of variation in synonymous codon usage among genes. These analyses need to be performed on properly normalized codon usage data to avoid biases masking this synonymous variation, i.e., gene length, amino acid usage, and codon degeneracy; however, previous studies have failed to do so. In this paper, we demonstrate that the use of alternative normalized data (called 'relative adaptiveness' in the literature) can avoid all these biases and furthermore, can identify more trends of variation among genes, including GC-ending codon usage, GT-ending codon usage, and gene expression level.

Keywords

This publication has 20 references indexed in Scilit:

Variation in the strength of selected codon usage bias among bacteria
Nucleic Acids Research, 2005
GenBank
Nucleic Acids Research, 2004
Translational selection is operative for synonymous codon usage in Clostridium perfringens and Clostridium acetobutylicum
Microbiology, 2003
Use and misuse of correspondence analysis in codon usage studies
Nucleic Acids Research, 2002
Prokaryotic Genome Evolution as Assessed by Multivariate Analysis of Codon Usage Patterns
Microbial & Comparative Genomics, 1997
The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications
Nucleic Acids Research, 1987
Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes
Nucleic Acids Research, 1986
A simple method for displaying the hydropathic character of a protein
Journal of Molecular Biology, 1982
Codon catalog usage and the genome hypothesis
Nucleic Acids Research, 1980