Complex Machine-Learning Algorithms and Multivariable Logistic Regression on Par in the Prediction of Insufficient Clinical Response to Methotrexate in Rheumatoid Arthritis.
Helen R GosseltMaxime M A VerhoevenMaja Bulatović-ĆalasanPaco M WelsingMaurits C F J de RotteJohanna M W HazesFloris P J G LafeberMark HoogendoornRobert de JongePublished in: Journal of personalized medicine (2021)
The goals of this study were to examine whether machine-learning algorithms outperform multivariable logistic regression in the prediction of insufficient response to methotrexate (MTX); secondly, to examine which features are essential for correct prediction; and finally, to investigate whether the best performing model specifically identifies insufficient responders to MTX (combination) therapy. The prediction of insufficient response (3-month Disease Activity Score 28-Erythrocyte-sedimentation rate (DAS28-ESR) > 3.2) was assessed using logistic regression, least absolute shrinkage and selection operator (LASSO), random forest, and extreme gradient boosting (XGBoost). The baseline features of 355 rheumatoid arthritis (RA) patients from the "treatment in the Rotterdam Early Arthritis CoHort" (tREACH) and the U-Act-Early trial were combined for analyses. The model performances were compared using area under the curve (AUC) of receiver operating characteristic (ROC) curves, 95% confidence intervals (95% CI), and sensitivity and specificity. Finally, the best performing model following feature selection was tested on 101 RA patients starting tocilizumab (TCZ)-monotherapy. Logistic regression (AUC = 0.77 95% CI: 0.68-0.86) performed as well as LASSO (AUC = 0.76, 95% CI: 0.67-0.85), random forest (AUC = 0.71, 95% CI: 0.61 = 0.81), and XGBoost (AUC = 0.70, 95% CI: 0.61-0.81), yet logistic regression reached the highest sensitivity (81%). The most important features were baseline DAS28 (components). For all algorithms, models with six features performed similarly to those with 16. When applied to the TCZ-monotherapy group, logistic regression's sensitivity significantly dropped from 83% to 69% (p = 0.03). In the current dataset, logistic regression performed equally well compared to machine-learning algorithms in the prediction of insufficient response to MTX. Models could be reduced to six features, which are more conducive for clinical implementation. Interestingly, the prediction model was specific to MTX (combination) therapy response.
Keyphrases
- machine learning
- rheumatoid arthritis
- disease activity
- combination therapy
- rheumatoid arthritis patients
- systemic lupus erythematosus
- end stage renal disease
- ankylosing spondylitis
- artificial intelligence
- deep learning
- big data
- ejection fraction
- chronic kidney disease
- juvenile idiopathic arthritis
- newly diagnosed
- gene expression
- climate change
- interstitial lung disease
- primary care
- peritoneal dialysis
- prognostic factors
- high dose
- systemic sclerosis
- randomized controlled trial
- idiopathic pulmonary fibrosis
- dna methylation
- patient reported outcomes
- low dose