Issues With Variability in Electronic Health Record Data About Race and Ethnicity: Descriptive Analysis of the National COVID Cohort Collaborative Data Enclave.
Lily A CookJuan C EspinozaNicole G WeiskopfNisha MathewsDavid Andrew DorrKelly L GonzalesAdam B WilcoxCharisse R Madlock-Brownnull nullPublished in: JMIR medical informatics (2022)
Differences in how race and ethnicity data are conceptualized and encoded by health care institutions can affect the quality of the data in aggregated clinical databases. The impact of data quality issues in the N3C Data Enclave was not equal across all races and ethnicities, which has the potential to introduce bias in analyses and conclusions drawn from these data. Transparency about how data have been transformed can help users make accurate analyses and inferences and eventually better guide clinical care and public policy.