Login / Signup

Methods for large-scale single mediator hypothesis testing: Possible choices and comparisons.

Jiacong DuXiang ZhouDylan Clark-BoucherWei HaoYongmei LiuJennifer A SmithBhramar Mukherjee
Published in: Genetic epidemiology (2022)
Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, H 0 : α β = 0 ( α : effect of the exposure on the mediator after adjusting for confounders; β : effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for large-scale one at a time mediation hypothesis testing. These methods are commonly used for continuous outcomes and continuous mediators assuming there is no exposure-mediator interaction so that the product α β has a causal interpretation as the indirect effect. The first class of methods ignores the impact of different structures under the composite null hypothesis, namely, 1) α = 0 , β ≠ 0 ; 2) α ≠ 0 , β = 0 ; and 3) α = β = 0 . The second class of methods weights the reference distribution under each case of the null to form a mixture reference distribution. The third class constructs a composite test statistic using the three p-values obtained under each case of the null so that the reference distribution of the composite statistic is approximately U ( 0 , 1 ) . In addition to these existing methods, we developed the Sobel-comp method belonging to the second class, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods belonging to these three classes in terms of the false positive rates under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the second class of methods which uses a mixture reference distribution could best maintain the false positive rates at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi-Ethnic Study of Atherosclerosis (MESA). We provide guidelines for choosing the optimal mediation hypothesis testing method in practice and develop an R package medScan available at https://github.com/umich-cphds/medScan for implementing all the six methods. This article is protected by copyright. All rights reserved.
Keyphrases
  • dna methylation
  • healthcare
  • cardiovascular disease
  • type diabetes
  • metabolic syndrome
  • adipose tissue
  • social support
  • high resolution
  • genome wide
  • clinical practice