A framework for employing longitudinally collected multicenter electronic health records to stratify heterogeneous patient populations on disease history.
Marc P MauritsIlya KorsunskySoumya RaychaudhuriShawn N MurphyJordan W SmollerScott T WeissThomas W J HuizingaMarcel J T ReindersElizabeth W KarlsonErik B van den AkkerRachel KnevelPublished in: Journal of the American Medical Informatics Association : JAMIA (2022)
We establish a generalizable pipeline for the identification and replication of clinically meaningful (sub)phenotypes from widely available high-dimensional billing codes. This approach overcomes datatype problems and produces comprehensive visualizations of validation-ready phenotypes.