Individual genome assembly from complex community short-read metagenomic datasets
Open Access
- 27 October 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in The ISME Journal
- Vol. 6 (4), 898-901
- https://doi.org/10.1038/ismej.2011.147
Abstract
Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least about 20 × coverage. At lower coverage, however, the derived assemblies contained a substantial fraction of non-target sequences (chimeras), which explains, at least in part, the higher number of hypothetical genes recovered in metagenomic relative to genomic projects. We also provide examples of how to detect intrapopulation structure in metagenomic datasets and estimate the type and frequency of errors in assembled genes and contigs from datasets of varied species complexity.Keywords
This publication has 13 references indexed in Scilit:
- A human gut microbial gene catalogue established by metagenomic sequencingNature, 2010
- Comparative Metagenomic Analysis of a Microbial Community Residing at a Depth of 4,000 Meters at Station ALOHA in the North Pacific Subtropical GyreApplied and Environmental Microbiology, 2009
- Accurate determination of microbial diversity from 454 pyrosequencing dataNature Methods, 2009
- Systematic artifacts in metagenomes from complex microbial communitiesThe ISME Journal, 2009
- Genomic patterns of recombination, clonal divergence and environment in marine microbial populationsThe ISME Journal, 2008
- Use of simulated data sets to evaluate the fidelity of metagenomic processing methodsNature Methods, 2007
- Community Genomics Among Stratified Microbial Assemblages in the Ocean's InteriorScience, 2006
- Genome sequencing in microfabricated high-density picolitre reactorsNature, 2005
- Genomic insights that advance the species definition for prokaryotesProceedings of the National Academy of Sciences, 2005
- Solexa LtdPharmacogenomics, 2004