Metabolic Reconstruction for Metagenomic Data and Its Application to the Human Microbiome
Top Cited Papers
Open Access
- 31 May 2012
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 8 (6), e1002358
- https://doi.org/10.1371/journal.pcbi.1002358
Abstract
Microbial communities carry out the majority of the biochemical activity on the planet, and they play integral roles in processes including metabolism and immune homeostasis in the human microbiome. Shotgun sequencing of such communities' metagenomes provides information complementary to organismal abundances from taxonomic markers, but the resulting data typically comprise short reads from hundreds of different organisms and are at best challenging to assemble comparably to single-organism genomes. Here, we describe an alternative approach to infer the functional and metabolic potential of a microbial community metagenome. We determined the gene families and pathways present or absent within a community, as well as their relative abundances, directly from short sequence reads. We validated this methodology using a collection of synthetic metagenomes, recovering the presence and abundance both of large pathways and of small functional modules with high accuracy. We subsequently applied this method, HUMAnN, to the microbial communities of 649 metagenomes drawn from seven primary body sites on 102 individuals as part of the Human Microbiome Project (HMP). This provided a means to compare functional diversity and organismal ecology in the human microbiome, and we determined a core of 24 ubiquitously present modules. Core pathways were often implemented by different enzyme families within different body sites, and 168 functional modules and 196 metabolic pathways varied in metagenomic abundance specifically to one or more niches within the microbiome. These included glycosaminoglycan degradation in the gut, as well as phosphate and amino acid transport linked to host phenotype (vaginal pH) in the posterior fornix. An implementation of our methodology is available at http://huttenhower.sph.harvard.edu/humann. This provides a means to accurately and efficiently characterize microbial metabolic pathways and functional modules directly from high-throughput sequencing reads, enabling the determination of community roles in the HMP cohort and in future metagenomic studies.Keywords
This publication has 71 references indexed in Scilit:
- Toward molecular trait‐based ecology through integration of biogeochemical, geographical and metagenomic dataMolecular Systems Biology, 2011
- Homeostasis and Inflammation in the IntestineCell, 2010
- Natural Products Version 2.0: Connecting Genes to MoleculesJournal of the American Chemical Society, 2010
- The gut microbiota shapes intestinal immune responses during health and diseaseNature Reviews Immunology, 2009
- Common Genetic Variation and Human TraitsNew England Journal of Medicine, 2009
- Quantifying environmental adaptation of metabolic pathways in metagenomicsProceedings of the National Academy of Sciences, 2009
- A core gut microbiome in obese and lean twinsNature, 2008
- The properties of high-dimensional data spaces: implications for exploring gene and protein expression dataNature Reviews Cancer, 2008
- The NCBI dbGaP database of genotypes and phenotypesNature Genetics, 2007
- Use of simulated data sets to evaluate the fidelity of metagenomic processing methodsNature Methods, 2007