VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.

Tiange LiuQingze BaiDrew A TorigianYubing TongJayaram K Udupa

Published in: Medical image analysis (2024)

The proposed hybrid Transformer-based backbone network for 3D medical image segmentation can tightly integrate self-attention and convolution to exploit the advantages of these two paradigms. The experimental results demonstrate our method's superiority compared to other state-of-the-art methods. The hybrid paradigm seems to be most appropriate to the medical image segmentation field. The ablation experiments also demonstrate that the proposed hybrid mechanism can effectively balance large receptive fields with local inductive biases, resulting in highly accurate segmentation results, especially in capturing details. Our code is available at https://github.com/qingze-bai/VSmTrans.

Keyphrases

deep learning
convolutional neural network
healthcare
working memory
machine learning
high resolution
mass spectrometry
network analysis