Login / Signup

Generalizable Long COVID Subtypes: Findings from the NIH N3C and RECOVER Programs.

Justin T ReeseHannah BlauTimothy BergquistJohanna J LoombaTiffany J CallahanBryan LarawayCorneliu AntonescuElena CasiraghiBen ColemanMichael GarganoKenneth WilkinsLuca CappellettiTommaso FontanaNariman AmmarBlessy AntonyT M MuraliGuy KarlebachJulie A McMurryAndrew WilliamsRichard MoffittJineta BanerjeeAnthony E SolomonidesHannah DavisKristin KostkaGiorgio ValentiniDavid SahnerChristopher G ChuteCharisse Madlock-BrownMelissa A HaendelPeter Nick Robinsonnull null
Published in: medRxiv : the preprint server for health sciences (2022)
Accurate stratification of patients with Post-acute sequelae of SARS-CoV-2 infection (PASC, or long COVID) would allow precision clinical management strategies and could enable more focussed investigation of the molecular pathogenetic mechanisms of this disease. However, the natural history of long COVID is incompletely understood and characterized by an extremely wide range of manifestations that are difficult to analyze computationally. In addition, the generalizability of machine learning classification of COVID-19 clinical outcomes has rarely been tested. We present a method for computationally modeling long COVID phenotype data based on electronic healthcare records (EHRs) and for assessing pairwise phenotypic similarity between patients using semantic similarity. Using unsupervised machine learning (k-means clustering), we found six distinct clusters of long COVID patients, each with distinct profiles of phenotypic abnormalities with enrichments in pulmonary, cardiovascular, neuropsychiatric, and constitutional symptoms such as fatigue and fever. There was a highly significant association of cluster membership with a range of pre-existing conditions and with measures of severity during acute COVID-19. We show that the clusters we identified in one hospital system were generalizable across different hospital systems. Semantic phenotypic clustering can provide a foundation for assigning patients to stratified subgroups for natural history or therapy studies on long COVID.
Keyphrases