Delineating COVID-19 subgroups using routine clinical data identifies distinct in-hospital outcomes.
Bojidar A RangelovAlexandra L YoungWatjana LilaonitkulShahab AslaniPaul TaylorEyjólfur GuðmundssonQianye YangYipeng HuJohn R HurstDavid J HawkesJoseph Jacobnull nullPublished in: Scientific reports (2023)
The COVID-19 pandemic has been a great challenge to healthcare systems worldwide. It highlighted the need for robust predictive models which can be readily deployed to uncover heterogeneities in disease course, aid decision-making and prioritise treatment. We adapted an unsupervised data-driven model-SuStaIn, to be utilised for short-term infectious disease like COVID-19, based on 11 commonly recorded clinical measures. We used 1344 patients from the National COVID-19 Chest Imaging Database (NCCID), hospitalised for RT-PCR confirmed COVID-19 disease, splitting them equally into a training and an independent validation cohort. We discovered three COVID-19 subtypes (General Haemodynamic, Renal and Immunological) and introduced disease severity stages, both of which were predictive of distinct risks of in-hospital mortality or escalation of treatment, when analysed using Cox Proportional Hazards models. A low-risk Normal-appearing subtype was also discovered. The model and our full pipeline are available online and can be adapted for future outbreaks of COVID-19 or other infectious disease.
Keyphrases
- coronavirus disease
- sars cov
- infectious diseases
- healthcare
- decision making
- respiratory syndrome coronavirus
- machine learning
- open label
- randomized controlled trial
- end stage renal disease
- gene expression
- quality improvement
- big data
- insulin resistance
- clinical trial
- health information
- genome wide
- climate change
- smoking cessation
- peritoneal dialysis
- adverse drug
- patient reported outcomes