Login / Signup

Estimating Gaussian Copulas with Missing Data with and without Expert Knowledge.

Maximilian KertelMarkus Pauly
Published in: Entropy (Basel, Switzerland) (2022)
In this work, we present a rigorous application of the Expectation Maximization algorithm to determine the marginal distributions and the dependence structure in a Gaussian copula model with missing data. We further show how to circumvent a priori assumptions on the marginals with semiparametric modeling. Further, we outline how expert knowledge on the marginals and the dependency structure can be included. A simulation study shows that the distribution learned through this algorithm is closer to the true distribution than that obtained with existing methods and that the incorporation of domain knowledge provides benefits.
Keyphrases
  • healthcare
  • machine learning
  • electronic health record
  • deep learning
  • big data
  • clinical practice
  • neural network