Predictive Modeling of COVID-19 Readmissions: Insights from Machine Learning and Deep Learning Approaches.
Wei Kit LooWingates VoonAnwar SuhaimiCindy Shuan Ju TehYee Kai TeeYan Chai HumKhairunnisa HasikinKareen TeoHang Cheng OngKhin Wee LaiPublished in: Diagnostics (Basel, Switzerland) (2024)
This project employs artificial intelligence, including machine learning and deep learning, to assess COVID-19 readmission risk in Malaysia. It offers tools to mitigate healthcare resource strain and enhance patient outcomes. This study outlines a methodology for classifying COVID-19 readmissions. It starts with dataset description and pre-processing, while the data balancing was computed through Random Oversampling, Borderline SMOTE, and Adaptive Synthetic Sampling. Nine machine learning and ten deep learning techniques are applied, with five-fold cross-validation for evaluation. Optuna is used for hyperparameter selection, while the consistency in training hyperparameters is maintained. Evaluation metrics encompass accuracy, AUC, and training/inference times. Results were based on stratified five-fold cross-validation and different data-balancing methods. Notably, CatBoost consistently excelled in accuracy and AUC across all tables. Using ROS, CatBoost achieved the highest accuracy (0.9882 ± 0.0020) with an AUC of 1.0000 ± 0.0000. CatBoost maintained its superiority in BSMOTE and ADASYN as well. Deep learning approaches performed well, with SAINT leading in ROS and TabNet leading in BSMOTE and ADASYN. Decision Tree ensembles like Random Forest and XGBoost consistently showed strong performance.
Keyphrases
- deep learning
- artificial intelligence
- machine learning
- big data
- coronavirus disease
- sars cov
- convolutional neural network
- healthcare
- electronic health record
- cell death
- dna damage
- reactive oxygen species
- climate change
- virtual reality
- respiratory syndrome coronavirus
- computed tomography
- social media
- health insurance
- decision making