Login / Signup

Testing generalized linear models with high-dimensional nuisance parameter.

Jinsong ChenQuefeng LiHua Yun Chen
Published in: Biometrika (2022)
Generalized linear models often have a high-dimensional nuisance parameters, as seen in applications such as testing gene-environment interactions or gene-gene interactions. In these scenarios, it is essential to test the significance of a high-dimensional sub-vector of the model's coefficients. Although some existing methods can tackle this problem, they often rely on the bootstrap to approximate the asymptotic distribution of the test statistic, and thus are computationally expensive. Here, we propose a computationally efficient test with a closed-form limiting distribution, which allows the parameter being tested to be either sparse or dense. We show that under certain regularity conditions, the type I error of the proposed method is asymptotically correct, and we establish its power under high-dimensional alternatives. Extensive simulations demonstrate the good performance of the proposed test and its robustness when certain sparsity assumptions are violated. We also apply the proposed method to Chinese famine sample data in order to show its performance when testing the significance of gene-environment interactions.
Keyphrases
  • copy number
  • genome wide
  • genome wide identification
  • climate change
  • gene expression
  • machine learning
  • electronic health record
  • data analysis
  • artificial intelligence