An Efficient Method for Deidentifying Protected Health Information in Chinese Electronic Health Records: Algorithm Development and Validation.
Peng WangYong LiLiang YangSimin LiLinfeng LiZehan ZhaoShaopei LongFei WangHongqian WangYing LiChengliang WangPublished in: JMIR medical informatics (2022)
Compared to baseline methods, the efficiency advantage of TinyBERT on our proposed augmented data set was kept while the performance improved for the task of Chinese protected health information deidentification.