跳到主要导航 跳到搜索 跳到主要内容

VMSIS: A Pre-trained Vision Transformer with Mamba Decoder for Surgical Instrument Segmentation

  • Yuechen Tao*
  • , Xiaobo Zhu
  • , Shiwei Wu
  • , He Sun
  • , Jiangang Liu
  • , Yu An
  • , Jie Tian
  • , Zhenyu Liu
  • *此作品的通讯作者
  • CAS - Institute of Automation
  • University of Chinese Academy of Sciences
  • Nanjing University of Science and Technology
  • Beihang University
  • National Key Laboratory of Kidney Diseases

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Accurate surgical instrument segmentation plays a vital role in robot assisted surgery. We present VMSIS, a hybrid architecture that combines the visual representation capabilities of self-supervised DINOv2 with the efficient sequence modeling of Mamba for surgical instrument segmentation. Our approach trained DINOv2 backbone with over 900,000 frames of RGB surgical videos and introduces a Mamba-based decoder that effectively captures temporal dependencies in surgical video sequences with backbone frozen. By processing 10 consecutive frames, our model achieves accurate instrument segmentation while maintaining temporal consistency. Experiments on 4 reorganized public datasets demonstrate the effectiveness of our approach, achieving competitive results with fewer trainable parameters compared to traditional methods.

源语言英语
主期刊名2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2025 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331586188
DOI
出版状态已出版 - 2025
活动47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2025 - Copenhagen, 丹麦
期限: 14 7月 202518 7月 2025

出版系列

姓名Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
ISSN(印刷版)1557-170X

会议

会议47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2025
国家/地区丹麦
Copenhagen
时期14/07/2518/07/25

指纹

探究 'VMSIS: A Pre-trained Vision Transformer with Mamba Decoder for Surgical Instrument Segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此