Refining microbial community metabolic models derived from metagenomics using reference-based taxonomic profiling.
Marwan E MajzoubLaurence D W LuuCraig HaiferSudarshan ParamsothyThomas J BorodyRupert W LeongTorsten ThomasNadeem O KaakoushPublished in: mSystems (2024)
Characterization of microbial community metabolic output is crucial to understanding their functions. Construction of genome-scale metabolic models from metagenome-assembled genomes (MAG) has enabled prediction of metabolite production by microbial communities, yet little is known about their accuracy. Here, we examined the performance of two approaches for metabolite prediction from metagenomes, one that is MAG-guided and another that is taxonomic reference-guided. We applied both on shotgun metagenomics data from human and environmental samples, and validated findings in the human samples using untargeted metabolomics. We found that in human samples, where taxonomic profiling is optimized and reference genomes are readily available, when number of input taxa was normalized, the reference-guided approach predicted more metabolites than the MAG-guided approach. The two approaches showed significant overlap but each identified metabolites not predicted in the other. Pathway enrichment analyses identified significant differences in inferences derived from data based on the approach, highlighting the need for caution in interpretation. In environmental samples, when the number of input taxa was normalized, the reference-guided approach predicted more metabolites than the MAG-guided approach for total metabolites in both sample types and non-redundant metabolites in seawater samples. Nonetheless, as was observed for the human samples, the approaches overlapped substantially but also predicted metabolites not observed in the other. Our findings report on utility of a complementary input to genome-scale metabolic model construction that is less computationally intensive forgoing MAG assembly and refinement, and that can be applied on shallow shotgun sequencing where MAGs cannot be generated.IMPORTANCELittle is known about the accuracy of genome-scale metabolic models (GEMs) of microbial communities despite their influence on inferring community metabolic outputs and culture conditions. The performance of GEMs for metabolite prediction from metagenomes was assessed by applying two approaches on shotgun metagenomics data from human and environmental samples, and validating findings in the human samples using untargeted metabolomics. The performance of the approach was found to be dependent on sample type, but collectively, the reference-guided approach predicted more metabolites than the MAG-guided approach. Despite the differences, the predictions from the approaches overlapped substantially but each identified metabolites not predicted in the other. We found significant differences in biological inferences based on the approach, with some examples of uniquely enriched pathways in one group being invalidated when using the alternative approach, highlighting the need for caution in interpretation of GEMs.