Data Imputation for Clinical Trial Emulation: A Case Study on Impact of Intracranial Pressure Monitoring for Traumatic Brain Injury.
Zhizhen ZhaoRuoqi LiuJonathan I GronerHenry XiangPing ZhangPublished in: medRxiv : the preprint server for health sciences (2023)
Randomized clinical trial emulation using real-world data is significant for treatment effect evaluation. Missing values are common in the observational data. Handling missing data improperly would cause biased estimations and invalid conclusions. However, discussions on how to address this issue in causal analysis using observational data are still limited. Multiple imputation by chained equations (MICE) is a popular approach to fill in missing data. In this study, we combined multiple imputation with propensity score weighted model to estimate the average treatment effect (ATE). We compared various multiple imputation (MI) strategies and a complete data analysis on two benchmark datasets. The experiments showed that data imputations had better performances than completely ignoring the missing data, and using different imputation models for different covariates gave a high precision of estimation. Furthermore, we applied the optimal strategy on a medical records data to evaluate the impact of ICP monitoring on inpatient mortality of traumatic brain injury (TBI). The experiment details and code are available at https://github.com/Zhizhen-Zhao/IPTW-TBI .
Keyphrases
- traumatic brain injury
- electronic health record
- data analysis
- big data
- clinical trial
- magnetic resonance imaging
- type diabetes
- metabolic syndrome
- mental health
- skeletal muscle
- computed tomography
- adipose tissue
- cross sectional
- severe traumatic brain injury
- deep learning
- high fat diet induced
- study protocol
- combination therapy
- wild type