Venue
International Conference on Medical Image Computing and Computer-Assisted Intervention
Domain
Medical image analysis
The Transformer architecture has demonstrated remarkable results in 3D medical image segmentation due to its capability of modeling global relationships.However, it poses a significant computational burden when processing high-dimensional medical images.Mamba, as a State Space Model (SSM), has recently emerged as a notable approach for modeling long-range dependencies in sequential data, and has excelled in the field of natural language processing with its remarkable memory efficiency and computational speed.Inspired by this, we devise SegMamba, a novel 3D medical image Segmentation Mamba model, to effectively capture long-range dependencies within whole-volume features at every scale.Our SegMamba outperforms Transformer-based methods in wholevolume feature modeling, maintaining high efficiency even at a resolution of 64 × 64 × 64, where the sequential length is approximately 260k.Moreover, we collect and annotate a novel large-scale dataset (named CRC-500) to facilitate benchmarking evaluation in 3D colorectal cancer (CRC) segmentation.Experimental results on our CRC-500 and two public benchmark datasets further demonstrate the effectiveness and universality of our method.The code for SegMamba is publicly available at: https://github.com/ge-xing/SegMamba.
This paper presents SegMamba, a novel approach for 3D medical image segmentation that efficiently captures long-range dependencies using a State Space Model (SSM) called Mamba. The method combines a tri-orientated Mamba (ToM) module with a gated spatial convolution (GSC) module and a feature-level uncertainty estimation (FUE) module. Extensive experiments demonstrate its superior performance on the novel CRC-500 dataset, which contains 500 annotated 3D colorectal cancer scans, as well as on two public benchmark datasets: BraTS2023 and AIIB2023. Results show that SegMamba outperforms existing methods in segmentation accuracy while maintaining high efficiency.
This paper employs the following methods:
- Mamba
- Tri-orientated Mamba (ToM)
- Gated Spatial Convolution (GSC)
- Feature-level Uncertainty Estimation (FUE)
- SegMamba
- U-Mamba
- UNETR
- SwinUNETR
- SwinUNETR-V2
- SegresNet
- UX-Net
- MedNeXt
The following datasets were used in this research:
- CRC-500
- BraTS2023
- AIIB2023
- Dice
- Hausdorff Distance (HD95)
- Intersection over Union (IoU)
- Detected Length Ratio (DLR)
- Detected Branch Ratio (DBR)
- SegMamba outperforms Transformer-based methods in whole-volume feature modeling
- Achieved highest Dice scores of 93.61%, 92.65%, and 87.71% on the BraTS2023 dataset
- Obtained best Dice and HD95 scores of 48.46% and 28.52 on the CRC-500 dataset
- Achieved highest IoU, DLR, and DBR scores on AIIB2023 dataset
- Number of GPUs: 4
- GPU Type: NVIDIA A100
SegMamba
long-range dependencies
3D medical image segmentation
state space model