Incorporating metabolic activity, taxonomy and community structure to improve microbiome-based predictive models for host phenotype prediction.
Mahsa MonshizadehYuzhen YePublished in: Gut microbes (2024)
We developed MicroKPNN, a prior-knowledge guided interpretable neural network for microbiome-based human host phenotype prediction. The prior knowledge used in MicroKPNN includes the metabolic activities of different bacterial species, phylogenetic relationships, and bacterial community structure, all in a shallow neural network. Application of MicroKPNN to seven gut microbiome datasets (involving five different human diseases including inflammatory bowel disease, type 2 diabetes, liver cirrhosis, colorectal cancer, and obesity) shows that incorporation of the prior knowledge helped improve the microbiome-based host phenotype prediction. MicroKPNN outperformed fully connected neural network-based approaches in all seven cases, with the most improvement of accuracy in the prediction of type 2 diabetes. MicroKPNN outperformed a recently developed deep-learning based approach DeepMicro, which selects the best combination of autoencoder and machine learning approach to make predictions, in all of the seven cases. Importantly, we showed that MicroKPNN provides a way for interpretation of the predictive models. Using importance scores estimated for the hidden nodes, MicroKPNN could provide explanations for prior research findings by highlighting the roles of specific microbiome components in phenotype predictions. In addition, it may suggest potential future research directions for studying the impacts of microbiome on host health and diseases. MicroKPNN is publicly available at https://github.com/mgtools/MicroKPNN.
Keyphrases
- neural network
- healthcare
- type diabetes
- machine learning
- endothelial cells
- deep learning
- insulin resistance
- metabolic syndrome
- public health
- induced pluripotent stem cells
- mental health
- artificial intelligence
- pluripotent stem cells
- glycemic control
- climate change
- high fat diet induced
- early stage
- big data
- current status
- sentinel lymph node
- neoadjuvant chemotherapy
- ulcerative colitis