Multimodal fusion learning for long QT syndrome pathogenic genotypes in a racially diverse population.
Joy J JiangHa My Thi VyAlexander W CharneyPatricia KovatchVivek ReddyPushkala JayaramanRon DoRohan KheraSumeet ChughDeepak L BhattAkhil VaidJoshua LampertGirish Nitin NadkarniPublished in: NPJ digital medicine (2024)
Congenital long QT syndrome (LQTS) diagnosis is complicated by limited genetic testing at scale, low prevalence, and normal QT corrected interval in patients with high-risk genotypes. We developed a deep learning approach combining electrocardiogram (ECG) waveform and electronic health record data to assess whether patients had pathogenic variants causing LQTS. We defined patients with high-risk genotypes as having ≥1 pathogenic variant in one of the LQTS-susceptibility genes. We trained the model using data from United Kingdom Biobank (UKBB) and then fine-tuned in a racially/ethnically diverse cohort using Mount Sinai BioMe Biobank. Following group-stratified 5-fold splitting, the fine-tuned model achieved area under the precision-recall curve of 0.29 (95% confidence interval [CI] 0.28-0.29) and area under the receiver operating curve of 0.83 (0.82-0.83) on independent testing data from BioMe. Multimodal fusion learning has promise to identify individuals with pathogenic genetic mutations to enable patient prioritization for further work up.
Keyphrases
- electronic health record
- clinical decision support
- deep learning
- big data
- case report
- end stage renal disease
- drug induced
- chronic kidney disease
- genome wide
- adverse drug
- air pollution
- ejection fraction
- pain management
- newly diagnosed
- copy number
- machine learning
- blood pressure
- heart rate
- gene expression
- dna methylation
- transcription factor
- chronic pain