Login / Signup

Large-scale microbiome data integration enables robust biomarker identification.

Liwen XiaoFengyi ZhangFangqing Zhao
Published in: Nature computational science (2022)
The close association between gut microbiota dysbiosis and human diseases is being increasingly recognized. However, contradictory results are frequently reported, as confounding effects exist. The lack of unbiased data integration methods is also impeding the discovery of disease-associated microbial biomarkers from different cohorts. Here we propose an algorithm, NetMoss, for assessing shifts of microbial network modules to identify robust biomarkers associated with various diseases. Compared to previous approaches, the NetMoss method shows better performance in removing batch effects. Through comprehensive evaluations on both simulated and real datasets, we demonstrate that NetMoss has great advantages in the identification of disease-related biomarkers. Based on analysis of pandisease microbiota studies, there is a high prevalence of multidisease-related bacteria in global populations. We believe that large-scale data integration will help in understanding the role of the microbiome from a more comprehensive perspective and that accurate biomarker identification will greatly promote microbiome-based medical diagnosis.
Keyphrases
  • electronic health record
  • microbial community
  • endothelial cells
  • healthcare
  • bioinformatics analysis
  • machine learning
  • small molecule
  • induced pluripotent stem cells
  • artificial intelligence
  • network analysis