Efficient de novo assembly of single-cell bacterial genomes from short-read data sets
Open Access
- 18 September 2011
- journal article
- research article
- Published by Springer Nature in Nature Biotechnology
- Vol. 29 (10), 915-921
- https://doi.org/10.1038/nbt.1966
Abstract
DNA can be amplified and sequenced from a single cell, but unevenness of the sequence coverage complicates efforts to assemble a high-quality genome. Chitsaz et al. devise an algorithm to address this problem and apply it to assemble a genome draft of an uncultured single-cell marine organism from one lane of Illumina sequence data. Whole genome amplification by the multiple displacement amplification (MDA) method allows sequencing of DNA from single cells of bacteria that cannot be cultured. Assembling a genome is challenging, however, because MDA generates highly nonuniform coverage of the genome. Here we describe an algorithm tailored for short-read data from single cells that improves assembly through the use of a progressively increasing coverage cutoff. Assembly of reads from single Escherichia coli and Staphylococcus aureus cells captures >91% of genes within contigs, approaching the 95% captured from an assembly based on many E. coli cells. We apply this method to assemble a genome from a single cell of an uncultivated SAR324 clade of Deltaproteobacteria, a cosmopolitan bacterial lineage in the global ocean. Metabolic reconstruction suggests that SAR324 is aerobic, motile and chemotaxic. Our approach enables acquisition of genome assemblies for individual uncultivated bacteria using only short reads, providing cell-specific genetic information absent from metagenomic studies.Keywords
This publication has 48 references indexed in Scilit:
- DOOR: a database for prokaryotic operonsNucleic Acids Research, 2008
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- Genomic sequencing of single microbial cells from environmental samplesCurrent Opinion in Microbiology, 2008
- Complete genome of the uncultured Termite Group 1 bacteria in a single host protist cellProceedings of the National Academy of Sciences, 2008
- Dissecting biological “dark matter” with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouthProceedings of the National Academy of Sciences, 2007
- KAAS: an automatic genome annotation and pathway reconstruction serverNucleic Acids Research, 2007
- Evolution of sensory complexity recorded in a myxobacterial genomeProceedings of the National Academy of Sciences, 2006
- MetaGene: prokaryotic gene finding from environmental genome shotgun sequencesNucleic Acids Research, 2006
- Sequencing genomes from single cells by polymerase cloningNature Biotechnology, 2006
- KEGG: Kyoto Encyclopedia of Genes and GenomesNucleic Acids Research, 2000