Login / Signup

The Data-Adaptive Fellegi-Sunter Model for Probabilistic Record Linkage: Algorithm Development and Validation for Incorporating Missing Data and Field Selection.

Xiaochun LiHuiping XuShaun J Grannis
Published in: Journal of medical Internet research (2022)
MAR is a reasonable assumption in real-world record linkage applications: it maintains or improves F1-scores regardless of whether matching fields are expert-specified or data-driven. Data-driven selection of fields coupled with MAR achieves the best overall performance, which can be especially useful in privacy-preserving record linkage.
Keyphrases
  • big data
  • genome wide
  • hiv testing
  • electronic health record
  • machine learning
  • men who have sex with men
  • deep learning
  • artificial intelligence
  • health information
  • data analysis
  • gene expression