Leveraging global binary masks for structure segmentation in medical images.
Mahdieh KazemimoghadamZi YangMingli ChenLin MaWeiguo LuXuejun GuPublished in: Physics in medicine and biology (2023)
Deep learning (DL) models for medical image segmentation are highly influenced by intensity variations of input images and lack generalization due to primarily utilizing pixels' intensity information for inference. Acquiring sufficient training data is another challenge limiting models' applications. Here, we proposed to leverage the consistency of organs' anatomical position and shape information in medical images. We introduced a framework leveraging recurring anatomical patterns through global binary masks for organ segmentation. Two scenarios were studied: (1) Global binary masks were the only input for the U-Net based model, forcing exclusively encoding organs' position and shape information for rough segmentation or localization. (2) Global binary masks were incorporated as an additional channel providing position/shape clues to mitigate training data scarcity. Two datasets of the brain and heart CT images with their ground-truth were split into (26:10:10) and (12:3:5) for training, validation, and test respectively. The two scenarios were evaluated using full training split as well as reduced subsets of training data. In scenario (1), training exclusively on global binary masks led to Dice scores of 0.77±0.06 and 0.85±0.04 for the brain and heart structures respectively. Average Euclidian distance of 3.12±1.43mm and 2.5±0.93mm were obtained relative to the center of mass of the ground truth for the brain and heart structures respectively. The outcomes indicated encoding a surprising degree of position and shape information through global binary masks. In scenario (2), incorporating global binary masks led to significantly higher accuracy relative to the model trained on only CT images in small subsets of training data; the performance improved by 4.3-125.3% and 1.3-48.1% for 1-8 training cases of the brain and heart datasets respectively. The findings imply the advantages of utilizing global binary masks for building models that are robust to image intensity variations as well as an effective approach to boost performance when access to labeled training data is highly limited.
Keyphrases
- deep learning
- convolutional neural network
- virtual reality
- artificial intelligence
- ionic liquid
- healthcare
- electronic health record
- heart failure
- resting state
- white matter
- optical coherence tomography
- machine learning
- computed tomography
- climate change
- magnetic resonance
- insulin resistance
- magnetic resonance imaging
- metabolic syndrome
- multiple sclerosis
- type diabetes
- skeletal muscle
- functional connectivity
- blood brain barrier
- subarachnoid hemorrhage
- weight loss
- resistance training