De-identifying Spanish medical texts - named entity recognition applied to radiology reports.
Irene Pérez-DíezRaúl Pérez-MoragaAdolfo López-CerdánJose-Maria Salinas-SerranoMaría de la Iglesia-VayáPublished in: Journal of biomedical semantics (2021)
The strategy proposed, combining named entity recognition tasks with randomization of entities, is suitable for Spanish radiology reports. It does not require a big training corpus, thus it could be easily extended to other languages and medical texts, such as electronic health records.