Classification of COVID-19 Patients into Clinically Relevant Subsets by a Novel Machine Learning Pipeline Using Transcriptomic Features.
Andrea R DaamenPrathyusha S BachaliAmrie C GrammerPeter E LipskyPublished in: International journal of molecular sciences (2023)
The persistent impact of the COVID-19 pandemic and heterogeneity in disease manifestations point to a need for innovative approaches to identify drivers of immune pathology and predict whether infected patients will present with mild/moderate or severe disease. We have developed a novel iterative machine learning pipeline that utilizes gene enrichment profiles from blood transcriptome data to stratify COVID-19 patients based on disease severity and differentiate severe COVID cases from other patients with acute hypoxic respiratory failure. The pattern of gene module enrichment in COVID-19 patients overall reflected broad cellular expansion and metabolic dysfunction, whereas increased neutrophils, activated B cells, T-cell lymphopenia, and proinflammatory cytokine production were specific to severe COVID patients. Using this pipeline, we also identified small blood gene signatures indicative of COVID-19 diagnosis and severity that could be used as biomarker panels in the clinical setting.
Keyphrases
- sars cov
- machine learning
- genome wide
- respiratory failure
- coronavirus disease
- copy number
- single cell
- respiratory syndrome coronavirus
- early onset
- big data
- artificial intelligence
- deep learning
- genome wide identification
- rna seq
- dna methylation
- gene expression
- extracorporeal membrane oxygenation
- oxidative stress
- computed tomography
- drug induced
- high intensity
- magnetic resonance
- electronic health record
- magnetic resonance imaging
- mechanical ventilation
- data analysis