Direct interaction network inference for compositional data via codaloss.
Liang ChenShun HeYuyao ZhaiMinghua DengPublished in: Journal of bioinformatics and computational biology (2020)
16S rRNA gene sequencing and whole microbiome sequencing make it possible and stable to quantitatively analyze the composition of microbial communities and the relationship among microbial communities, microbes, and hosts. One essential step in the analysis of microbiome compositional data is inferring the direct interaction network among microbial species, bringing to light the potential underlying mechanism that regulates interaction in their communities. However, standard statistical analysis may obtain spurious results due to compositional nature of microbiome data; therefore, network recovery of microbial communities remains challenging. Here, we propose a novel loss function called codaloss for direct microbes interaction network estimation under the sparsity assumptions. We develop an alternating direction optimization algorithm to obtain sparse solution of codaloss as estimator. Compared to other state-of-the-art methods, our model makes less assumptions about the microbial networks. The simulation and real microbiome data results show that our method outperforms other methods in network inference. An implementation of codaloss is available from https://github.com/xuebaliang/Codaloss.