Login / Signup

Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model.

Jeffrey G KlannMatthew A H JossKevin EmbreeShawn N Murphy
Published in: PloS one (2019)
Here, we leverage our investment in i2b2 high-performance transformations to support the AOU OMOP data pipeline. Because the ARCH ontology has gained widespread national interest (through the Accrual to Clinical Trials network, other PCORnet networks, and the Nebraska Lexicon), we leveraged sites' existing investments into this standard ontology. We developed an i2b2-to-OMOP transformation, driven by the ARCH-OMOP ontology and the OMOP concept mapping dictionary. We demonstrated and validated our approach in the AOU New England HPO (NEHPO). First, we transformed into OMOP a fake patient dataset in i2b2 and verified through AOU tools that the data was structurally compliant with OMOP. We then transformed a subset of data in the Partners Healthcare data warehouse into OMOP. We developed a checklist of assessments to ensure the transformed data had self-integrity (e.g., the distributions have an expected shape and required fields are populated), using OMOP's visual Achilles data quality tool. This i2b2-to-OMOP transformation is being used to send NEHPO production data to AOU. It is open-source and ready for use by other research projects.
Keyphrases
  • electronic health record
  • big data
  • healthcare
  • clinical trial
  • randomized controlled trial
  • mass spectrometry
  • data analysis
  • human immunodeficiency virus
  • health insurance
  • hepatitis c virus