Explainable statistical learning in public health for policy development: the case of real-world suicide data.
Paul van SchaikYonghong PengAdedokun OjelabiJonathan LingPublished in: BMC medical research methodology (2019)
We demonstrate the effectiveness of regression techniques in the analysis of online public health data. Regression analysis for prediction and explanation can both be appropriate for public health data analysis for a better understanding of public health outcomes. It is therefore essential to clarify the aim of the analysis (prediction accuracy or theory development) as a basis for choosing the most appropriate model. We apply these techniques to the analysis of suicide data; however, we argue that the analysis presented in this study should be applied to datasets across public health in order to improve the quality of health policy recommendations.