Login / Signup

Enhancing Preoperative Outcome Prediction: A Comparative Retrospective Case-Control Study on Machine Learning versus the International Esodata Study Group Risk Model for Predicting 90-Day Mortality in Oncologic Esophagectomy.

Axel WinterRobin P van de WaterBjarne PfitznerMarius Jonathan IbachChristoph RiepeRobert AhlbornLara FarajFelix KrenzienEva M DobrindtJonas RaakowIgor Maximillian SauerBert ArnrichKatharina BeyerChristian DeneckeJohann PratschkeMax M Maurer
Published in: Cancers (2024)
Risk prediction prior to oncologic esophagectomy is crucial for assisting surgeons and patients in their joint informed decision making. Recently, a new risk prediction model for 90-day mortality after esophagectomy using the International Esodata Study Group (IESG) database was proposed, allowing for the preoperative assignment of patients into different risk categories. However, given the non-linear dependencies between patient- and tumor-related risk factors contributing to cumulative surgical risk, machine learning (ML) may evolve as a novel and more integrated approach for mortality prediction. We evaluated the IESG risk model and compared its performance to ML models. Multiple classifiers were trained and validated on 552 patients from two independent centers undergoing oncologic esophagectomies. The discrimination performance of each model was assessed utilizing the area under the receiver operating characteristics curve (AUROC), the area under the precision-recall curve (AUPRC), and the Matthews correlation coefficient (MCC). The 90-day mortality rate was 5.8%. We found that IESG categorization allowed for adequate group-based risk prediction. However, ML models provided better discrimination performance, reaching superior AUROCs (0.64 [0.63-0.65] vs. 0.44 [0.32-0.56]), AUPRCs (0.25 [0.24-0.27] vs. 0.11 [0.05-0.21]), and MCCs (0.27 ([0.25-0.28] vs. 0.15 [0.03-0.27]). Conclusively, ML shows promising potential to identify patients at risk prior to surgery, surpassing conventional statistics. Still, larger datasets are needed to achieve higher discrimination performances for large-scale clinical implementation in the future.
Keyphrases