Development of a novel machine learning-based weighted modeling approach to incorporate Salmonella enterica heterogeneity on a genetic scale in a dose-response modeling framework.
Shraddha KaranthAbani K PradhanPublished in: Risk analysis : an official publication of the Society for Risk Analysis (2022)
Estimating microbial dose-response is an important aspect of a food safety risk assessment. In recent years, there has been considerable interest to advance these models with potential incorporation of gene expression data. The aim of this study was to develop a novel machine learning model that considers the weights of expression of Salmonella genes that could be associated with illness, given exposure, in hosts. Here, an elastic net-based weighted Poisson regression method was proposed to identify Salmonella enterica genes that could be significantly associated with the illness response, irrespective of serovar. The best-fit elastic net model was obtained by 10-fold cross-validation. The best-fit elastic net model identified 33 gene expression-dose interaction terms that added to the predictability of the model. Of these, nine genes associated with Salmonella metabolism and virulence were found to be significant by the best-fit Poisson regression model (p < 0.05). This method could improve or redefine dose-response relationships for illness from relative proportions of significant genes from a microbial genetic dataset, which would help in refining endpoint and risk estimations.