Machine Learning Approach for Cardiovascular Death Prediction among Nonalcoholic Steatohepatitis (NASH) Liver Transplant Recipients.
Yasin FatemiMohsen NikfarAmir OladazimiJingyi ZhengHaley HoyHaneen AliPublished in: Healthcare (Basel, Switzerland) (2024)
Cardiovascular disease is the leading cause of mortality among nonalcoholic steatohepatitis (NASH) patients who undergo liver transplants. In the present study, machine learning algorithms were used to identify important risk factors for cardiovascular death and to develop a prediction model. The Standard Transplant Analysis and Research data were gathered from the Organ Procurement and Transplantation Network. After cleaning and preprocessing, the dataset comprised 10,871 patients and 92 features. Recursive feature elimination (RFE) and select from model (SFM) were applied to select relevant features from the dataset and avoid overfitting. Multiple machine learning algorithms, including logistic regression, random forest, decision tree, and XGBoost, were used with RFE and SFM. Additionally, prediction models were developed using a support vector machine, Gaussian naïve Bayes, K-nearest neighbors, random forest, and XGBoost algorithms. Finally, SHapley Additive exPlanations (SHAP) were used to increase interpretability. The findings showed that the best feature selection method was RFE with a random forest estimator, and the most critical features were recipient and donor blood type, body mass index, recipient and donor state of residence, serum creatinine, and year of transplantation. Furthermore, among all the outcomes, the XGBoost model had the highest performance, with an accuracy value of 0.6909 and an area under the curve value of 0.86. The findings also revealed a predictive relationship between features and cardiovascular death after liver transplant among NASH patients. These insights may assist clinical decision-makers in devising strategies to prevent cardiovascular complications in post-liver transplant NASH patients.
Keyphrases
- machine learning
- end stage renal disease
- cardiovascular disease
- deep learning
- ejection fraction
- body mass index
- newly diagnosed
- chronic kidney disease
- big data
- prognostic factors
- climate change
- artificial intelligence
- type diabetes
- adipose tissue
- peritoneal dialysis
- stem cells
- electronic health record
- insulin resistance
- patient reported outcomes
- weight loss