Login / Signup

Linking Biomedical Data Warehouse Records With the National Mortality Database in France: Large-scale Matching Algorithm.

Vianney GuardiolleAdrien BazogeEmmanuel MorinBéatrice DailleDelphine ToublantGuillaume BouzilléYouenn MerelMorgane Pierre-JeanAlexandre FiliotMarc CuggiaMatthieu WargnyAntoine LamerPierre-Antoine Gourraud
Published in: JMIR medical informatics (2022)
Overall, sensitivity/recall was 11% higher using the DLD-based algorithm than that using the direct algorithm. This shows the importance of advanced data cleaning and knowledge of a naming system through DLD use. Statistically significant differences in sensitivity between groups could be found and must be considered when performing an analysis to avoid differential biases. Our algorithm, originally conceived for linking a BDW with the FNMD, can be used to match any large-scale databases. While matching operations using names are considered sensitive computational operations, the Inseehop package released here is easy to run on premises, thereby facilitating compliance with cybersecurity local framework. The use of an advanced deterministic matching algorithm such as the DLD-based algorithm is an insightful example of combining open-source external data to improve the usage value of BDWs.
Keyphrases