Login / Signup

Utilising identifier error variation in linkage of large administrative data sources.

Katie L HarronGareth Hagger-JohnsonRuth GilbertHarvey Goldstein
Published in: BMC medical research methodology (2017)
We provide empirical evidence on variation in rates of identifier error in a widely-used administrative data source and propose a new method for deriving match weights that incorporates additional data attributes. Our results demonstrate that incorporating information on variation by individual-level characteristics can help to reduce bias due to linkage error.
Keyphrases
  • electronic health record
  • big data
  • genome wide
  • healthcare
  • data analysis
  • hiv testing
  • artificial intelligence
  • social media