OpenDeID Pipeline for Unstructured Electronic Health Record Text Notes Based on Rules and Transformers: Deidentification Algorithm Development and Validation Study.
Jiaxing LiuShalini GuptaAipeng ChenChen-Kai WangPratik MishraHong-Jie DaiZoie Shui-Yee WongJitendra JonnagaddalaPublished in: Journal of medical Internet research (2023)
The OpenDeID pipeline is a hybrid deidentification pipeline to deidentify SHI entities in unstructured EHR text notes. The pipeline has been evaluated on a large multicenter corpus. External validation will be undertaken as part of our future work to evaluate the effectiveness of the OpenDeID pipeline.