Evaluation of Automated Public De-Identification Tools on a Corpus of Radiology Reports.
Jackson M SteinkampTaylor PomeranzJason AdlebergCharles E KahnTessa S CookPublished in: Radiology. Artificial intelligence (2020)
PHI appeared infrequently within the corpus of reports studied, which created difficulties for training machine learning systems. Out-of-the-box de-identification tools achieved limited performance on the corpus of radiology reports, suggesting the need for further advancements in public datasets and trained models.Supplemental material is available for this article.See also the commentary by Tenenholtz and Wood in this issue.© RSNA, 2020.