Login / Signup

Human Microbiome Mixture Analysis Using Weighted Quantile Sum Regression.

Shoshannah EggersMoira BixbyStefano RenzettiPaul CurtinChris Gennings
Published in: International journal of environmental research and public health (2022)
Studies of the health effects of the microbiome often measure overall associations by using diversity metrics, and individual taxa associations in separate analyses, but do not consider the correlated relationships between taxa in the microbiome. In this study, we applied random subset weighted quantile sum regression with repeated holdouts (WQS RSRH ), a mixture method successfully applied to 'omic data to account for relationships between many predictors, to processed amplicon sequencing data from the Human Microbiome Project. We simulated a binary variable associated with 20 operational taxonomic units (OTUs). WQS RSRH was used to test for the association between the microbiome and the simulated variable, adjusted for sex, and sensitivity and specificity were calculated. The WQS RSRH method was also compared to other standard methods for microbiome analysis. The method was further illustrated using real data from the Growth and Obesity Cohort in Chile to assess the association between the gut microbiome and body mass index. In the analysis with simulated data, WQS RSRH predicted the correct directionality of association between the microbiome and the simulated variable, with an average sensitivity and specificity of 75% and 70%, respectively, in identifying the 20 associated OTUs. WQS RSRH performed better than all other comparison methods. In the illustration analysis of the gut microbiome and obesity, the WQS RSRH analysis identified an inverse association between body mass index and the gut microbe mixture, identifying Bacteroides, Clostridium, Prevotella, and Ruminococcus as important genera in the negative association. The application of WQS RSRH to the microbiome allows for analysis of the mixture effect of all the taxa in the microbiome, while simultaneously identifying the most important to the mixture, and allowing for covariate adjustment. It outperformed other methods when using simulated data, and in analysis with real data found results consistent with other study findings.
Keyphrases