Testing latent class of subjects with structural zeros in negative binomial models with applications to gut microbiome data.
Peng YeXinhui QiaoWan TangChunyi WangHua HePublished in: Statistical methods in medical research (2022)
Human microbiome research has become a hot-spot in health and medical research in the past decade due to the rapid development of modern high-throughput. Typical data in a microbiome study consisting of the operational taxonomic unit counts may have over-dispersion and/or structural zero issues. In such cases, negative binomial models can be applied to address the over-dispersion issue, while zero-inflated negative binomial models can be applied to address both issues. In practice, it is essential to know if there is zero-inflation in the data before applying negative binomial or zero-inflated negative binomial models because zero-inflated negative binomial models may be unnecessarily complex and difficult to interpret, or may even suffer from convergence issues if there is no zero-inflation in the data. On the other hand, negative binomial models may yield invalid inferences if the data does exhibit excessive zeros. In this paper, we develop a new test for detecting zero-inflation resulting from a latent class of subjects with structural zeros in a negative binomial regression model by directly comparing the amount of observed zeros with what would be expected under the negative binomial regression model. A closed form of the test statistic as well as its asymptotic properties are derived based on estimating equations. Intensive simulation studies are conducted to investigate the performance of the new test and compare it with the classical Wald, likelihood ratio, and score tests. The tests are also applied to human gut microbiome data to test latent class in microbial genera.