Evaluating Identity Disclosure Risk in Fully Synthetic Health Data: Model Development and Validation.
Khaled El EmamLucy MosqueraJason BassPublished in: Journal of medical Internet research (2020)
We have presented a comprehensive identity disclosure risk model for fully synthetic data. The results for this synthesis method on 2 datasets demonstrate that synthesis can reduce meaningful identity disclosure risks considerably. The risk model can be applied in the future to evaluate the privacy of fully synthetic data.