Covid19-Mexican-Patients' Dataset (Covid19MPD) Classification and Prediction Using Feature Importance.
Khaled Mohamad AlmustafaPublished in: Concurrency and computation : practice & experience (2021)
Coronavirus disease, Covid19, pandemic has a great effect on human heath worldwide since it was first detected in late 2019. A clear understanding of the structure of the available Covid19 datasets might give the healthcare provider a better understanding of identifying some of the cases at an early stage. In this article, we will be looking into a Covid19 Mexican Patients' Dataset (Covid109MPD), and we will apply number of machine learning algorithms on the dataset to select the best possible classification algorithm for the death and survived cases in Mexico, then we will study the performance of the enhancement of the specified classifiers in term of their features selection in order to be able to predict sever, and or death, cases from the available dataset. Results show that J48 classifier gives the best classification accuracy with 94.41% and RMSE = 0.2028 and ROC = 0.919, compared to other classifiers, and when using feature selection method, J48 classifier can predict a surviving Covid19MPD case within 94.88% accuracy, and by using only 10 out of the total 19 features.
Keyphrases
- coronavirus disease
- machine learning
- sars cov
- deep learning
- end stage renal disease
- early stage
- healthcare
- respiratory syndrome coronavirus
- ejection fraction
- artificial intelligence
- newly diagnosed
- big data
- endothelial cells
- chronic kidney disease
- prognostic factors
- primary care
- patient reported
- health insurance
- health information
- lymph node