Quantifying environmental adaptation of metabolic pathways in metagenomics
- 3 February 2009
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 106 (5), 1374-1379
- https://doi.org/10.1073/pnas.0808022106
Abstract
Recently, approaches have been developed to sample the genetic content of heterogeneous environments (metagenomics). However, by what means these sequences link distinct environmental conditions with specific biological processes is not well understood. Thus, a major challenge is how the usage of particular pathways and subnetworks reflects the adaptation of microbial communities across environments and habitats—i.e., how network dynamics relates to environmental features. Previous research has treated environments as discrete, somewhat simplified classes (e.g., terrestrial vs. marine), and searched for obvious metabolic differences among them (i.e., treating the analysis as a typical classification problem). However, environmental differences result from combinations of many factors, which often vary only slightly. Therefore, we introduce an approach that employs correlation and regression to relate multiple, continuously varying factors defining an environment to the extent of particular microbial pathways present in a geographic site. Moreover, rather than looking only at individual correlations (one-to-one), we adapted canonical correlation analysis and related techniques to define an ensemble of weighted pathways that maximally covaries with a combination of environmental variables (many-to-many), which we term a metabolic footprint. Applied to available aquatic datasets, we identified footprints predictive of their environment that can potentially be used as biosensors. For example, we show a strong multivariate correlation between the energy-conversion strategies of a community and multiple environmental gradients (e.g., temperature). Moreover, we identified covariation in amino acid transport and cofactor synthesis, suggesting that limiting amounts of cofactor can (partially) explain increased import of amino acids in nutrient-limited conditions.Keywords
This publication has 26 references indexed in Scilit:
- Taxonomic distribution of large DNA viruses in the seaGenome Biology, 2008
- Rapid chemotactic response enables marine bacteria to exploit ephemeral microscale nutrient patchesProceedings of the National Academy of Sciences, 2008
- iPath: interactive exploration of biochemical pathways and networksTrends in Biochemical Sciences, 2008
- A statistical toolbox for metagenomics: assessing functional diversity in microbial communitiesBMC Bioinformatics, 2008
- Common variants in the GDF5-UQCC region are associated with variation in human heightNature Genetics, 2008
- Quantitative assessment of protein function prediction from metagenomics shotgun sequencesProceedings of the National Academy of Sciences, 2007
- The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical PacificPLoS Biology, 2007
- Quantitative Phylogenetic Assessment of Microbial Communities in Diverse EnvironmentsScience, 2007
- Community genomics in microbial ecology and evolutionNature Reviews Microbiology, 2005
- Adaptive eradication of methionine and cysteine from cyanobacterial light-harvesting proteinsNature, 1989