Individual genome assembly from complex community short-read metagenomic datasets

Open Access

27 October 2011

journal article
research article
Published by Oxford University Press (OUP) in The ISME Journal

Vol. 6 (4), 898-901
https://doi.org/10.1038/ismej.2011.147

Abstract

Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least about 20 × coverage. At lower coverage, however, the derived assemblies contained a substantial fraction of non-target sequences (chimeras), which explains, at least in part, the higher number of hypothetical genes recovered in metagenomic relative to genomic projects. We also provide examples of how to detect intrapopulation structure in metagenomic datasets and estimate the type and frequency of errors in assembled genes and contigs from datasets of varied species complexity.

Keywords

This publication has 13 references indexed in Scilit:

A human gut microbial gene catalogue established by metagenomic sequencing
Nature, 2010
Comparative Metagenomic Analysis of a Microbial Community Residing at a Depth of 4,000 Meters at Station ALOHA in the North Pacific Subtropical Gyre
Applied and Environmental Microbiology, 2009
Accurate determination of microbial diversity from 454 pyrosequencing data
Nature Methods, 2009
Systematic artifacts in metagenomes from complex microbial communities
The ISME Journal, 2009
Genomic patterns of recombination, clonal divergence and environment in marine microbial populations
The ISME Journal, 2008
Use of simulated data sets to evaluate the fidelity of metagenomic processing methods
Nature Methods, 2007
Community Genomics Among Stratified Microbial Assemblages in the Ocean's Interior
Science, 2006
Genome sequencing in microfabricated high-density picolitre reactors
Nature, 2005
Genomic insights that advance the species definition for prokaryotes
Proceedings of the National Academy of Sciences, 2005
Solexa Ltd
Pharmacogenomics, 2004

Cited by 105 articles