Towards Predicting Length of Stay and Identification of Cohort Risk Factors Using Self-Attention-Based Transformers and Association Mining: COVID-19 as a Phenotype.
Fakhare AlamObieda AnanbehKhalid Mahmood MalikAbdulrahman Al OdayaniIbrahim Bin HussainNaoufel KaabiaAmal Al AidaroosAbdul Khader Jilani SaudagarPublished in: Diagnostics (Basel, Switzerland) (2023)
Predicting length of stay (LoS) and understanding its underlying factors is essential to minimizing the risk of hospital-acquired conditions, improving financial, operational, and clinical outcomes, and better managing future pandemics. The purpose of this study was to forecast patients' LoS using a deep learning model and to analyze cohorts of risk factors reducing or prolonging LoS. We employed various preprocessing techniques, SMOTE-N to balance data, and a TabTransformer model to forecast LoS. Finally, the Apriori algorithm was applied to analyze cohorts of risk factors influencing hospital LoS. The TabTransformer outperformed the base machine learning models in terms of F1 score (0.92), precision (0.83), recall (0.93), and accuracy (0.73) for the discharged dataset and F1 score (0.84), precision (0.75), recall (0.98), and accuracy (0.77) for the deceased dataset. The association mining algorithm was able to identify significant risk factors/indicators belonging to laboratory, X-ray, and clinical data, such as elevated LDH and D-dimer levels, lymphocyte count, and comorbidities such as hypertension and diabetes. It also reveals what treatments have reduced the symptoms of COVID-19 patients, leading to a reduction in LoS, particularly when no vaccines or medication, such as Paxlovid, were available.
Keyphrases
- risk factors
- machine learning
- deep learning
- sars cov
- big data
- healthcare
- coronavirus disease
- blood pressure
- type diabetes
- artificial intelligence
- electronic health record
- ejection fraction
- cardiovascular disease
- adverse drug
- emergency department
- working memory
- computed tomography
- mass spectrometry
- metabolic syndrome
- adipose tissue
- magnetic resonance imaging
- physical activity
- convolutional neural network
- high resolution
- patient reported outcomes
- depressive symptoms
- sleep quality
- respiratory syndrome coronavirus