Login / Signup

Performance of Multiple Imputation Using Modern Machine Learning Methods in Electronic Health Records Data.

Kylie GetzRebecca A HubbardKristin A Linn
Published in: Epidemiology (Cambridge, Mass.) (2022)
We found no advantage of denoising autoencoders for multiple imputation in the setting of an epidemiologic study conducted using EHR data. Results suggested that denoising autoencoders may overfit the data leading to poor confounder control. Use of more flexible imputation approaches does not mitigate bias induced by missingness not at random and can produce estimates with spurious precision.
Keyphrases
  • electronic health record
  • clinical decision support
  • machine learning
  • adverse drug
  • big data
  • convolutional neural network
  • deep learning