A multi-step approach to managing missing data in time and patient variant electronic health records.
Nina CesareLawrence P O WerePublished in: BMC research notes (2022)
For this study we use a subset of data from AMPATH representing information from 530,812 clinic visits from 16,316 Human Immunodeficiency Virus (HIV) positive women across Western Kenya who have given birth. We apply this process to a set of 84 clinical, social and economic variables and are able to impute values for 84.6% of variables with missing data with an average reduction in missing data of approximately 35.6%. We validate the use of this imputed dataset by predicting National Hospital Insurance Fund (NHIF) enrollment with 94.8% accuracy.
Keyphrases
- electronic health record
- human immunodeficiency virus
- hiv positive
- antiretroviral therapy
- big data
- clinical decision support
- adverse drug
- south africa
- healthcare
- health insurance
- hepatitis c virus
- men who have sex with men
- hiv infected
- primary care
- data analysis
- mental health
- pregnant women
- case report
- metabolic syndrome
- deep learning
- drug induced