Login / Signup

Text mining method to unravel long COVID's clinical condition in hospitalized patients.

Pilar Tavares Veras FlorentinoVinícius de Oliveira AraújoHenrique ZattiCaio Vinícius LuisCélia Regina Santos CavalcantiMatheus Henrique Citibaldi de OliveiraAnderson Henrique França Figueredo LeãoJuracy Bertoldo JuniorGeorge G Caique BarbosaErnesto RaveraAlberto CebukinRenata Bernardes DavidDanilo Batista Vieira de MeloTales Mota MachadoNancy C J BelleiViviane BoaventuraManoel Barral NettoSoraya Soubhi Smaili
Published in: Cell death & disease (2024)
Long COVID is characterized by persistent that extends symptoms beyond established timeframes. Its varied presentation across different populations and healthcare systems poses significant challenges in understanding its clinical manifestations and implications. In this study, we present a novel application of text mining technique to automatically extract unstructured data from a long COVID survey conducted at a prominent university hospital in São Paulo, Brazil. Our phonetic text clustering (PTC) method enables the exploration of unstructured Electronic Healthcare Records (EHR) data to unify different written forms of similar terms into a single phonemic representation. We used n-gram text analysis to detect compound words and negated terms in Portuguese-BR, focusing on medical conditions and symptoms related to long COVID. By leveraging text mining, we aim to contribute to a deeper understanding of this chronic condition and its implications for healthcare systems globally. The model developed in this study has the potential for scalability and applicability in other healthcare settings, thereby supporting broader research efforts and informing clinical decision-making for long COVID patients.
Keyphrases