Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics
- 8 February 2010
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 107 (9), 4371-4376
- https://doi.org/10.1073/pnas.0911295107
Abstract
Understanding the fine-structure molecular architecture of bacterial epidemics has been a long-sought goal of infectious disease research. We used short-read-length DNA sequencing coupled with mass spectroscopy analysis of SNPs to study the molecular pathogenomics of three successive epidemics of invasive infections involving 344 serotype M3 group A Streptococcus in Ontario, Canada. Sequencing the genome of 95 strains from the three epidemics, coupled with analysis of 280 biallelic SNPs in all 344 strains, revealed an unexpectedly complex population structure composed of a dynamic mixture of distinct clonally related complexes. We discovered that each epidemic is dominated by micro- and macrobursts of multiple emergent clones, some with distinct strain genotype–patient phenotype relationships. On average, strains were differentiated from one another by only 49 SNPs and 11 insertion-deletion events (indels) in the core genome. Ten percent of SNPs are strain specific; that is, each strain has a unique genome sequence. We identified nonrandom temporal–spatial patterns of strain distribution within and between the epidemic peaks. The extensive full-genome data permitted us to identify genes with significantly increased rates of nonsynonymous (amino acid-altering) nucleotide polymorphisms, thereby providing clues about selective forces operative in the host. Comparative expression microarray analysis revealed that closely related strains differentiated by seemingly modest genetic changes can have significantly divergent transcriptomes. We conclude that enhanced understanding of bacterial epidemics requires a deep-sequencing, geographically centric, comparative pathogenomics strategy.Keywords
This publication has 28 references indexed in Scilit:
- Sensitive, specific polymorphism discovery in bacteria using massively parallel sequencingNature Methods, 2008
- Next-generation DNA sequencingNature Biotechnology, 2008
- High-throughput sequencing provides insights into genome variation and evolution in Salmonella TyphiNature Genetics, 2008
- A direct link between carbohydrate utilization and virulence in the major human pathogen group A StreptococcusProceedings of the National Academy of Sciences, 2008
- Dendroscope: An interactive viewer for large phylogenetic treesBMC Bioinformatics, 2007
- Clustal W and Clustal X version 2.0Bioinformatics, 2007
- Automated comparative sequence analysis by base-specific cleavage and mass spectrometry for nucleic acid-based microbial typingProceedings of the National Academy of Sciences, 2007
- Molecular genetic anatomy of inter- and intraserotype variation in the human bacterial pathogen group A StreptococcusProceedings of the National Academy of Sciences, 2006
- Application of Phylogenetic Networks in Evolutionary StudiesMolecular Biology and Evolution, 2005
- A role for Trigger Factor and an Rgg-like regulator in the transcription, secretion and processing of the cysteine proteinase ofStreptococcus pyogenesThe EMBO Journal, 1998