Prognostic Model of COVID-19 Severity and Survival among Hospitalized Patients Using Machine Learning Techniques.
Ivano LodatoAditya Varna IyerIsaac Zachary ToZhong-Yuan LaiHelen Shuk-Ying ChanWinnie Suk-Wai LeungTommy Hing-Cheung TangVictor Kai-Lam CheungTak-Chiu WuGeorge Wing-Yiu NgPublished in: Diagnostics (Basel, Switzerland) (2022)
We conducted a statistical study and developed a machine learning model to triage COVID-19 patients affected during the height of the COVID-19 pandemic in Hong Kong based on their medical records and test results (features) collected during their hospitalization. The correlation between the values of these features is studied against discharge status and disease severity as a preliminary step to identify those features with a more pronounced effect on the patient outcome. Once identified, they constitute the inputs of four machine learning models, Decision Tree, Random Forest, Gradient and RUSBoosting, which predict both the Mortality and Severity associated with the disease. We test the accuracy of the models when the number of input features is varied, demonstrating their stability; i.e., the models are already highly predictive when run over a core set of (6) features. We show that Random Forest and Gradient Boosting classifiers are highly accurate in predicting patients' Mortality (average accuracy ∼99%) as well as categorize patients (average accuracy ∼91%) into four distinct risk classes (Severity of COVID-19 infection). Our methodical and broad approach combines statistical insights with various machine learning models, which paves the way forward in the AI-assisted triage and prognosis of COVID-19 cases, which is potentially generalizable to other seasonal flus.
Keyphrases
- machine learning
- end stage renal disease
- sars cov
- newly diagnosed
- emergency department
- ejection fraction
- chronic kidney disease
- coronavirus disease
- artificial intelligence
- peritoneal dialysis
- prognostic factors
- cardiovascular disease
- type diabetes
- healthcare
- physical activity
- cardiovascular events
- risk factors
- deep learning
- big data
- case report
- high resolution
- coronary artery disease
- patient reported