Modeling Global Monkeypox Infection Spread Data: A Comparative Study of Time Series Regression and Machine Learning Models.
Vishwajeet SinghSaif Ali KhanSubhash Kumar YadavYusuf AkhterPublished in: Current microbiology (2023)
The global impact of COVID-19 has heightened concerns about emerging viral infections, among which monkeypox (MPOX) has become a significant public health threat. To address this, our study employs a comprehensive approach using three statistical techniques: Distribution fitting, ARIMA modeling, and Random Forest machine learning to analyze and predict the spread of MPOX in the top ten countries with high infection rates. We aim to provide a detailed understanding of the disease dynamics and model theoretical distributions using country-specific datasets to accurately assess and forecast the disease's transmission. The data from the considered countries are fitted into ARIMA models to determine the best time series regression model. Additionally, we employ the random forest machine learning approach to predict the future behavior of the disease. Evaluating the Root Mean Square Errors (RMSE) for both models, we find that the random forest outperforms ARIMA in six countries, while ARIMA performs better in the remaining four countries. Based on these findings, robust policy-making should consider the best fitted model for each country to effectively manage and respond to the ongoing public health threat posed by monkeypox. The integration of multiple modeling techniques enhances our understanding of the disease dynamics and aids in devising more informed strategies for containment and control.