Predicting Measles Outbreaks in the United States: Evaluation of Machine Learning Approaches.
Boshu RuStephanie A KujawskiNelson Lee AfanadorRichard BaumgartnerManjiri PawaskarAmarendra K DasPublished in: JMIR formative research (2023)
XGBoost provided more accurate predictions of measles cases at the county level compared with logistic regression. The threshold of prediction in this model can be adjusted to align with each county's resources, priorities, and risk for measles. While clustering pattern data from unsupervised machine learning approaches improved some aspects of model performance in this imbalanced data set, the optimal approach for the integration of such approaches with supervised machine learning models requires further investigation.