Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy.
Jui-En LoEugene Yu-Chuan KangYun-Nung ChenYi-Ting HsiehNan-Kai WangTa-Ching ChenKuan-Jen ChenWei-Chi WuYih-Shiou HwangFu-Sung LoChi-Chun LaiPublished in: Journal of diabetes research (2021)
This study is aimed at evaluating a deep transfer learning-based model for identifying diabetic retinopathy (DR) that was trained using a dataset with high variability and predominant type 2 diabetes (T2D) and comparing model performance with that in patients with type 1 diabetes (T1D). The Kaggle dataset, which is a publicly available dataset, was divided into training and testing Kaggle datasets. In the comparison dataset, we collected retinal fundus images of T1D patients at Chang Gung Memorial Hospital in Taiwan from 2013 to 2020, and the images were divided into training and testing T1D datasets. The model was developed using 4 different convolutional neural networks (Inception-V3, DenseNet-121, VGG1, and Xception). The model performance in predicting DR was evaluated using testing images from each dataset, and area under the curve (AUC), sensitivity, and specificity were calculated. The model trained using the Kaggle dataset had an average (range) AUC of 0.74 (0.03) and 0.87 (0.01) in the testing Kaggle and T1D datasets, respectively. The model trained using the T1D dataset had an AUC of 0.88 (0.03), which decreased to 0.57 (0.02) in the testing Kaggle dataset. Heatmaps showed that the model focused on retinal hemorrhage, vessels, and exudation to predict DR. In wrong prediction images, artifacts and low-image quality affected model performance. The model developed with the high variability and T2D predominant dataset could be applied to T1D patients. Dataset homogeneity could affect the performance, trainability, and generalization of the model.
Keyphrases
- diabetic retinopathy
- deep learning
- convolutional neural network
- optical coherence tomography
- type diabetes
- healthcare
- image quality
- emergency department
- magnetic resonance imaging
- cardiovascular disease
- skeletal muscle
- chronic kidney disease
- adipose tissue
- rna seq
- insulin resistance
- metabolic syndrome
- functional connectivity
- electronic health record
- resting state
- contrast enhanced