Use and misuse of correspondence analysis in codon usage studies
- 15 October 2002
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 30 (20), 4548-4555
- https://doi.org/10.1093/nar/gkf565
Abstract
Correspondence analysis has frequently been used for codon usage studies but this method is often misused. Because amino acid composition exerts constraints on codon usage, it is common to use tables containing relative codon frequencies (or ratios of frequencies) instead of simple codon counts to get rid of these amino acid biases. The problem is that some important properties of correspondence analysis, such as rows weighting, are lost in the process. Moreover, the use of relative measures sometimes introduces other biases and often diminishes the quantity of information to analyse, occasionally resulting in interpretation errors. For instance, in the case of an organism such as Borrelia burgdorferi, the use of relative measures led to the conclusion that there was no translational selection, while analyses based on codon counts show that there is a possibility of a selective effect at that level. In this paper, we expose these problems and we propose alternative strategies to correspondence analysis for studying codon usage biases when amino acid composition effects must be removed.Keywords
This publication has 30 references indexed in Scilit:
- Conserved codon composition of ribosomal protein coding genes in Escherichia coli, Mycobacterium tuberculosis and Saccharomyces cerevisiae: lessons from supervised machine learning in functional genomicsNucleic Acids Research, 2002
- Gene expressivity is the main factor in dictating the codon usage variation among the genes in Pseudomonas aeruginosaGene, 2001
- High-Density Microarray-Mediated Gene Expression Profiling of Escherichia coliJournal of Bacteriology, 2001
- Codon usage in Chlamydia trachomatis is the result of strand-specific mutational biases and a complex pattern of selective forcesNucleic Acids Research, 2000
- Absence of translationally selected synonymous codon usage bias in Helicobacter pyloriMicrobiology, 2000
- Codon Usage and the Origin of P ElementsMolecular Biology and Evolution, 2000
- Gene Expression, Amino Acid Conservation, and Hydrophobicity Are the Main Factors Shaping Codon Preferences in Mycobacterium tuberculosis and Mycobacterium lepraeJournal of Molecular Evolution, 2000
- Codon usage and lateral gene transfer in Bacillus subtilisCurrent Opinion in Microbiology, 1999
- Studies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysisGene, 1999
- Codon frequencies in 119 individual genes confirm corsistent choices of degenerate bases according to genome typeNucleic Acids Research, 1980