Demonstration of the potential of white-box machine learning approaches to gain insights from cardiovascular disease electrocardiograms.
Thilo RiegJanek FrickHermann BaumgartlRicardo BuettnerPublished in: PloS one (2020)
We present the results from a white-box machine learning approach to detect cardiac arrhythmias using electrocardiographic data. A C5.0 is trained to recognize four classes using common features. The four classes are (i) atrial fibrillation and atrial flutter, (ii) tachycardias (iii), sinus bradycardia and (iv) sinus rhythm. Data from 10,646 subjects, 83% of whom have at least one arrhythmia and 17% of whom exhibit a normal sinus rhythm, are used. The C5.0 is trained using 10-fold cross-validation and is able to achieve a balanced accuracy of 95.35%. By using the white-box machine learning approach, a clear and comprehensible tree structure can be revealed, which has selected the 5 most important features from a total of 24 features. These 5 features are ventricular rate, RR-Interval variation, atrial rate, age and difference between longest and shortest RR-Interval. The combination of ventricular rate, RR-Interval variation and atrial rate is especially relevant to achieve classification accuracy, which can be disclosed through the tree. The tree assigns unique values to distinguish the classes. These findings could be applied in medicine in the future. It can be shown that a white-box machine learning approach can reveal granular structures, thus confirming known linear relationships and also revealing nonlinear relationships. To highlight the strength of the C5.0 with respect to this structural revelation, the results of further white-box machine learning and black-box machine learning algorithms are presented.
Keyphrases
- machine learning
- atrial fibrillation
- catheter ablation
- big data
- left atrial
- transcription factor
- artificial intelligence
- binding protein
- left ventricular
- cardiovascular disease
- heart failure
- deep learning
- left atrial appendage
- oral anticoagulants
- direct oral anticoagulants
- percutaneous coronary intervention
- type diabetes
- heart rate
- climate change
- high resolution
- venous thromboembolism