From Admission to Discharge: Predicting National Institutes of Health Stroke Scale Progression in Stroke Patients Using Biomarkers and Explainable Machine Learning.
Aimilios GkantziosChristos KokkotisDimitrios TsiptsiosSerafeim MoustakidisElena GkartzonikaTheodoros AvramidisGrigorios TrypsanisIoannis IliopoulosNikolaos AggelousisKonstantinos VadikoliasPublished in: Journal of personalized medicine (2023)
As a result of social progress and improved living conditions, which have contributed to a prolonged life expectancy, the prevalence of strokes has increased and has become a significant phenomenon. Despite the available stroke treatment options, patients frequently suffer from significant disability after a stroke. Initial stroke severity is a significant predictor of functional dependence and mortality following an acute stroke. The current study aims to collect and analyze data from the hyperacute and acute phases of stroke, as well as from the medical history of the patients, in order to develop an explainable machine learning model for predicting stroke-related neurological deficits at discharge, as measured by the National Institutes of Health Stroke Scale (NIHSS). More specifically, we approached the data as a binary task problem: improvement of NIHSS progression vs. worsening of NIHSS progression at discharge, using baseline data within the first 72 h. For feature selection, a genetic algorithm was applied. Using various classifiers, we found that the best scores were achieved from the Random Forest (RF) classifier at the 15 most informative biomarkers and parameters for the binary task of the prediction of NIHSS score progression. RF achieved 91.13% accuracy, 91.13% recall, 90.89% precision, 91.00% f1-score, 8.87% FN rate and 4.59% FP rate . Those biomarkers are: age, gender, NIHSS upon admission, intubation, history of hypertension and smoking, the initial diagnosis of hypertension, diabetes, dyslipidemia and atrial fibrillation, high-density lipoprotein (HDL) levels, stroke localization, systolic blood pressure levels, as well as erythrocyte sedimentation rate (ESR) levels upon admission and the onset of respiratory infection. The SHapley Additive exPlanations (SHAP) model interpreted the impact of the selected features on the model output. Our findings suggest that the aforementioned variables may play a significant role in determining stroke patients' NIHSS progression from the time of admission until their discharge.
Keyphrases
- atrial fibrillation
- blood pressure
- machine learning
- healthcare
- emergency department
- mental health
- heart failure
- type diabetes
- public health
- end stage renal disease
- big data
- electronic health record
- ejection fraction
- high density
- cerebral ischemia
- left atrial appendage
- catheter ablation
- left atrial
- direct oral anticoagulants
- metabolic syndrome
- climate change
- left ventricular
- coronary artery disease
- prognostic factors
- quality improvement
- adipose tissue
- percutaneous coronary intervention
- dna methylation
- intensive care unit
- risk assessment
- hepatitis b virus
- extracorporeal membrane oxygenation
- brain injury
- heart rate
- cardiac arrest
- hypertensive patients
- subarachnoid hemorrhage
- human health
- aortic dissection