Controlling false discovery rate for mediator selection in high-dimensional data.
Ran DaiRuiyang LiSeonjoo LeeYing LiuPublished in: Biometrics (2024)
The need to select mediators from a high dimensional data source, such as neuroimaging data and genetic data, arises in much scientific research. In this work, we formulate a multiple-hypothesis testing framework for mediator selection from a high-dimensional candidate set, and propose a method, which extends the recent development in false discovery rate (FDR)-controlled variable selection with knockoff to select mediators with FDR control. We show that the proposed method and algorithm achieved finite sample FDR control. We present extensive simulation results to demonstrate the power and finite sample performance compared with the existing method. Lastly, we demonstrate the method for analyzing the Adolescent Brain Cognitive Development (ABCD) study, in which the proposed method selects several resting-state functional magnetic resonance imaging connectivity markers as mediators for the relationship between adverse childhood events and the crystallized composite score in the NIH toolbox.
Keyphrases
- resting state
- functional connectivity
- electronic health record
- magnetic resonance imaging
- big data
- white matter
- computed tomography
- mental health
- young adults
- emergency department
- machine learning
- data analysis
- high throughput
- genome wide
- deep learning
- magnetic resonance
- dna methylation
- single cell
- adverse drug
- childhood cancer