A tailored machine learning approach for mortality prediction in severe COVID-19 treated with glucocorticoids.
F SaltonM RispoliP ConfalonieriA De NesE SpagnolA SalottiB RuaroS HarariA RoccaL ManzoniM ConfalonieriPublished in: The international journal of tuberculosis and lung disease : the official journal of the International Union against Tuberculosis and Lung Disease (2024)
<sec><title>BACKGROUND</title>The impact of severe COVID-19 pneumonia on healthcare systems highlighted the need for accurate predictions to improve patient outcomes. Despite the established efficacy of glucocorticoids (GCs), variable patient responses are observed, and the existing clinical scores are limited in predicting non-responders. We propose a machine learning (ML) based approach to predict mortality in COVID-19 pneumonia treated with GCs.</sec><sec><title>METHODS</title>This is an ML-driven retrospective analysis involving 825 patients. We leveraged XGBoost to select the most appropriate features from the initial 52, including clinical and laboratory data. Six different ML techniques were compared. Shapley additive explanation (SHAP) values were used to describe the influence of each feature on classification. Internal validation was performed.</sec><sec><title>RESULTS</title>Nine key predictors of death were identified: increasing C-reactive protein (CRP), decreasing arterial partial pressure of oxygen to fraction of inspired oxygen ratio (PaO₂/FiO₂), age, coronary artery disease, invasive mechanical ventilation, acute renal failure, chronic heart failure, PaO₂/FiO₂ earliest value, and body mass index. Random forest achieved the highest test area under the receiver operating characteristic curve at 0.938 (95% CI 0.903-0.969). SHAP values highlighted age and PaO₂/FiO₂ improvement as the most influential features; the latter showed a higher impact than CRP reduction over time.</sec><sec><title>CONCLUSION</title>The proposed ML algorithm effectively predicts the risk of hospital death in COVID-19 pneumonia patients undergoing GCs. This approach can be adapted to datasets measuring similar clinical variables.</sec>.
Keyphrases
- machine learning
- coronavirus disease
- sars cov
- respiratory failure
- healthcare
- mechanical ventilation
- body mass index
- deep learning
- coronary artery disease
- patients undergoing
- ejection fraction
- big data
- newly diagnosed
- cardiovascular events
- artificial intelligence
- end stage renal disease
- climate change
- risk factors
- early onset
- electronic health record
- high resolution
- type diabetes
- respiratory syndrome coronavirus
- case report
- heart failure
- peritoneal dialysis
- acute coronary syndrome
- smoking cessation
- percutaneous coronary intervention
- acute care