Multiple Genome Comparison within a Bacterial Species Reveals a Unit of Evolution Spanning Two Adjacent Genes in a Tandem Paralog Cluster
Open Access
- 21 August 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 25 (11), 2457-2473
- https://doi.org/10.1093/molbev/msn192
Abstract
It has been assumed that an open reading frame (ORF) represents a unit of gene evolution as well as a unit of gene expression and function. In the present work, we report a case in which a unit comprising the 3′ region of an ORF linked to a downstream intergenic region that is in turn linked to the 5′ region of a downstream ORF has been conserved, and has served as the unit of gene evolution. The genes are tandem paralogous genes from the bacterium Staphylococcus aureus, for which more than ten entire genomes have been sequenced. We compared these multiple genome sequences at a locus for the lpl (lipoprotein-like) cluster (encoding lipoprotein homologs presumably related to their host interaction) in the genomic island termed νSaα. A highly conserved nucleotide sequence found within every lpl ORF is likely to provide a site for homologous recombination. Comparison of phylogenies of the 5′-variable region and the 3′-variable region within the same ORF revealed significant incongruence. In contrast, pairs of the 3′-variable region of an ORF and the 5′-variable region of the next downstream ORF gave more congruent phylogenies, with distinct groups of conserved pairs. The intergenic region seemed to have coevolved with the flanking variable regions. Multiple recombination events at the central conserved region appear to have caused various types of rearrangements among strains, shuffling the two variable regions in one ORF, but maintaining a conserved unit comprising the 3′-variable region, the intergenic region, and the 5′-variable region spanning adjacent ORFs. This result has strong impact on our understanding of gene evolution because most gene lineages underwent tandem duplication and then diversified. This work also illustrates the use of multiple genome sequences for high-resolution evolutionary analysis within the same species.Keywords
This publication has 79 references indexed in Scilit:
- Epidemic community-associated methicillin-resistant Staphylococcus aureus : Recent clonal expansion and diversificationProceedings of the National Academy of Sciences, 2008
- Genome Sequence of Staphylococcus aureus Strain Newman and Comparative Analysis of Staphylococcal Genomes: Polymorphism and Evolution of Two Major Pathogenicity IslandsJournal of Bacteriology, 2008
- Subtle genetic changes enhance virulence of methicillin resistant and sensitive Staphylococcus aureusBMC Microbiology, 2007
- Tracking the in vivo evolution of multidrug resistance in Staphylococcus aureus by whole-genome sequencingProceedings of the National Academy of Sciences, 2007
- Extensive and Genome-Wide Changes in the Transcription Profile ofStaphylococcus aureusInduced by Modulating the Transcription of the Cell Wall Synthesis GenemurFJournal of Bacteriology, 2007
- Mapping the Pathways to StaphylococcalPathogenesis by Comparative SecretomicsMicrobiology and Molecular Biology Reviews, 2006
- Whole-Genome Sequencing of Staphylococcus haemolyticus Uncovers the Extreme Plasticity of Its Genome and the Evolution of Human-Colonizing Staphylococcal SpeciesJournal of Bacteriology, 2005
- Splitting pairs: the diverging fates of duplicated genesNature Reviews Genetics, 2002
- BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequencesFEMS Microbiology Letters, 1999
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980