TY - GEN
T1 - Swin Transformer Encoder and Local Attention Network Via Transfer Learning Strategy for Retinal Vessel Segmentation
AU - Yun, Jiacheng
AU - Li, Yang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The effective segmentation of retinal blood vessels is an important means for diagnosis of fundus diseases. However, recent segmentation methods often ignore the global dependence of retinal blood vessels and have poor segmentation capability to small blood vessels. In addition, these methods are trained on limited datasets, resulting in a weak generalization ability. To solve these problems, we propose a new method based on Swin Transformer encoder and local attention network. First, Swin Transformer is used as an encoder to extract the global dependence of retinal blood vessels which can improve the continuity of blood vessels in segmentation results. Then, a local attention network based on edge detection module and coordinate attention module is developed to enhance the segmentation ability of the method to small blood vessels. Finally, a transfer learning strategy is also utilized in this paper. The encoder of our proposed method is pre-trained on ADE20K to enhance the generalization ability. Our proposed method is evaluated on three retinal image datasets: DRIVE, STARE and CHASE-DB1. The results demonstrate that the method proposed in this paper has better performance than others.
AB - The effective segmentation of retinal blood vessels is an important means for diagnosis of fundus diseases. However, recent segmentation methods often ignore the global dependence of retinal blood vessels and have poor segmentation capability to small blood vessels. In addition, these methods are trained on limited datasets, resulting in a weak generalization ability. To solve these problems, we propose a new method based on Swin Transformer encoder and local attention network. First, Swin Transformer is used as an encoder to extract the global dependence of retinal blood vessels which can improve the continuity of blood vessels in segmentation results. Then, a local attention network based on edge detection module and coordinate attention module is developed to enhance the segmentation ability of the method to small blood vessels. Finally, a transfer learning strategy is also utilized in this paper. The encoder of our proposed method is pre-trained on ADE20K to enhance the generalization ability. Our proposed method is evaluated on three retinal image datasets: DRIVE, STARE and CHASE-DB1. The results demonstrate that the method proposed in this paper has better performance than others.
KW - Swin Transformer
KW - attention network
KW - retinal vessel segmentation
KW - transfer learning
UR - https://www.scopus.com/pages/publications/105003121905
U2 - 10.1109/IARCE64300.2024.00080
DO - 10.1109/IARCE64300.2024.00080
M3 - 会议稿件
AN - SCOPUS:105003121905
T3 - Proceedings - 2024 4th International Conference on Industrial Automation, Robotics and Control Engineering, IARCE 2024
SP - 399
EP - 404
BT - Proceedings - 2024 4th International Conference on Industrial Automation, Robotics and Control Engineering, IARCE 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Industrial Automation, Robotics and Control Engineering, IARCE 2024
Y2 - 15 November 2024 through 17 November 2024
ER -