Identification of mycoplasma pneumonia in children based on fusion of multi-modal clinical free-text description and structured test data.
Jingna XieYingshuo WangQiuyang ShengXiaoqing LiuJing LiFenglei SunYuqi WangShuxian LiYiming LiYizhou YuGang YuPublished in: Health informatics journal (2024)
Mycoplasma pneumonia may lead to hospitalizations and pose life-threatening risks in children. The automated identification of mycoplasma pneumonia from electronic medical records holds significant potential for improving the efficiency of hospital resource allocation. In this study, we proposed a novel method for identifying mycoplasma pneumonia by integrating multi-modal features derived from both free-text descriptions and structured test data in electronic medical records. Our approach begins with the extraction of free-text and structured data from clinical records through a systematic preprocessing pipeline. Subsequently, we employ a pre-trained transformer language model to extract features from the free-text, while multiple additive regression trees are used to transform features from the structured data. An attention-based fusion mechanism is then applied to integrate these multi-modal features for effective classification. We validated our method using clinic records of 7157 patients, retrospectively collected for training and testing purposes. The experimental results demonstrate that our proposed multi-modal fusion approach achieves significant improvements over other methods across four key performance metrics.
Keyphrases
- electronic health record
- big data
- smoking cessation
- machine learning
- end stage renal disease
- young adults
- respiratory tract
- deep learning
- chronic kidney disease
- primary care
- ejection fraction
- newly diagnosed
- respiratory failure
- working memory
- autism spectrum disorder
- emergency department
- human health
- intensive care unit
- high throughput
- body composition
- acute care