Toward understanding deep learning classification of anatomic sites: lessons from the development of a CBCT projection classifier.
Juan P Cruz-BastidaErik PearsonHania A Al-HallaqPublished in: Journal of medical imaging (Bellingham, Wash.) (2022)
Purpose: Deep learning (DL) applications strongly depend on the training dataset and convolutional neural network architecture; however, it is unclear how to objectively select such parameters. We investigate the classification performance of different DL models and training schemes for the anatomic classification of cone-beam computed tomography (CBCT) projections. Approach: CBCT scans from 1055 patients were collected and manually classified into five anatomic classes and used to develop DL models to predict the anatomic class from single x-ray projections. VGG-16, Xception, and Inception v3 architectures were trained with 75% of the data, and the remaining 25% was used for testing and evaluation. To study the dependence of the classification performance on dataset size, training data was downsampled to various dataset sizes. Gradient-weighted class activation maps (grad-CAM) were generated using the model with highest classification performance, to identify regions with strong influence on CNN decisions. Results: The highest precision and recall values were achieved with VGG-16. One of the best performing combinations was the VGG-16 trained with 90 deg projections (mean class precision = 0.87). The training dataset size could be reduced to ∼ 50 % of its initial size, without compromising the classification performance. For correctly classified cases, Grad-CAM were more heavily weighted for anatomically relevant regions. Conclusions: It was possible to determine those dependencies with a higher influence on the classification performance of DL models for the studied task. Grad-CAM enabled the identification of possible sources of class confusion.
Keyphrases
- deep learning
- convolutional neural network
- machine learning
- cone beam computed tomography
- artificial intelligence
- big data
- image quality
- end stage renal disease
- chronic kidney disease
- computed tomography
- magnetic resonance
- high resolution
- newly diagnosed
- contrast enhanced
- physical activity
- electronic health record
- prognostic factors
- magnetic resonance imaging