Contextual embedding bootstrapped neural network for medical information extraction of coronary artery disease records.
Xingxing CenJunyi YuanChangqing PanQinhua TangQunsheng MaPublished in: Medical & biological engineering & computing (2021)
Coronary artery disease (CAD) is the major cause of human death worldwide. The development of new CAD early diagnosis methods based on medical big data has a great potential to reduce the risk of CAD death. In this process, neural network (NN), as a powerful tool for electronic medical record (EMR) processing, enables extract structured data accurately to unlock medical information and to further improve CAD diagnosis. However, the excessive time and labor caused by dataset's annotation is the main limitation of its application, especially on the CAD records situation with large natural language text and biomedical professional content. In this study, we present an annotation cost saving NN approach for CAD records, which is bootstrapped by deep language model with contextual embedding pre-trained on large unannotated CAD corpus. To demonstrate the feasibility and to further evaluate the performance of our approach, we performed pre-training experiment and term classification experiment, by using the unannotated and annotated CAD records, respectively. The results showed that our contextual embedding bootstrapped NN for CAD records has better performance under the condition of annotations reduction.
Keyphrases
- coronary artery disease
- neural network
- big data
- percutaneous coronary intervention
- cardiovascular events
- coronary artery bypass grafting
- healthcare
- machine learning
- aortic stenosis
- preterm infants
- rna seq
- deep learning
- transcatheter aortic valve replacement
- acute coronary syndrome
- single cell
- health information
- climate change
- virtual reality
- left ventricular