TY - GEN
T1 - Mamba-SaCIF Net
T2 - 7th International Conference on Image, Video Processing, and Artificial Intelligence, IVPAI 2025
AU - Khalid, Muhammad
AU - Li, Shuai
AU - Zada, Bakht
AU - Guo, Yuting
AU - Rehman, Khawar
AU - Amjad, Muhammad Asfand Yar
N1 - Publisher Copyright:
© COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.
PY - 2025/8/29
Y1 - 2025/8/29
N2 - In this study, we propose a hybrid architecture, Mamba SaCIF Net, for improving the segmentation of cardiac structures using MRI data. The architecture leverages a U-shaped encoder-decoder framework that integrates Mamba-based modules, Global Attention Module (GAM) and Channel Attention Modules (CAM), for enhanced channel and spatial information fusion. The input image is partitioned into non-overlapping patches and projected into a high-dimensional feature space using a linear embedding layer. The encoder employs hierarchical stages with CAM and GAM to capture both local and global dependencies, while the decoder utilizes skip connections and bilinear upsampling for precise reconstruction. The proposed Spatial and Channel Information Fusion (SaCIF) mechanism merges outputs from CAM and GAM using bidirectional aggregation and shared weights, ensuring computational efficiency and robust feature extraction. Experimental evaluations on benchmark cardiac MRI datasets demonstrate that our proposed model consistently outperforms state-of-the-art methods in segmentation accuracy, parameter efficiency, and computational complexity, making it a promising solution for clinical applications.
AB - In this study, we propose a hybrid architecture, Mamba SaCIF Net, for improving the segmentation of cardiac structures using MRI data. The architecture leverages a U-shaped encoder-decoder framework that integrates Mamba-based modules, Global Attention Module (GAM) and Channel Attention Modules (CAM), for enhanced channel and spatial information fusion. The input image is partitioned into non-overlapping patches and projected into a high-dimensional feature space using a linear embedding layer. The encoder employs hierarchical stages with CAM and GAM to capture both local and global dependencies, while the decoder utilizes skip connections and bilinear upsampling for precise reconstruction. The proposed Spatial and Channel Information Fusion (SaCIF) mechanism merges outputs from CAM and GAM using bidirectional aggregation and shared weights, ensuring computational efficiency and robust feature extraction. Experimental evaluations on benchmark cardiac MRI datasets demonstrate that our proposed model consistently outperforms state-of-the-art methods in segmentation accuracy, parameter efficiency, and computational complexity, making it a promising solution for clinical applications.
KW - Cardiac MRI segmentation
KW - Hybrid architecture
KW - Mamba-SaCIF Net
KW - Medical image analysis
KW - Spatial-channel fusion
UR - https://www.scopus.com/pages/publications/105026951015
U2 - 10.1117/12.3075010
DO - 10.1117/12.3075010
M3 - 会议稿件
AN - SCOPUS:105026951015
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Seventh International Conference on Image, Video Processing, and Artificial Intelligence, IVPAI 2025
A2 - Su, Ruidan
PB - SPIE
Y2 - 18 May 2025 through 20 May 2025
ER -