TY - GEN
T1 - ACCURATE INSTANCE SEGMENTATION VIA COLLABORATIVE LEARNING
AU - Chen, Tianyou
AU - Hu, Xiaoguang
AU - Xiao, Jin
AU - Zhang, Guofeng
AU - Wang, Shaojie
N1 - Publisher Copyright:
© 2022 IEEE
PY - 2022
Y1 - 2022
N2 - We propose an instance segmentation model, named CoMask, that effectively alleviates the scale variation issue and addresses the precise localization. Specifically, we develop a multi-scale feature extraction module (MSFEM) to exploit multi-scale spatial cues. Besides, the channel attention mechanism is also adopted to further enhance the discriminating ability. Equipped with MSFEMs, multi-scale and multi-level features can be extracted to better characterize objects of various sizes and provide affluent high-level semantic information. For precise localization, we propose a collaborative learning framework to compute coarse masks and regresses position-sensitive dense offsets. The foreground confidence of each position is then assigned as the weight of the corresponding bounding box to calculate a weighted average. Thus, we can mitigate interference of background regions. After obtaining the final regressed bounding boxes, finer foreground masks can be calculated. We conduct experiments on MS COCO dataset. Experimental results validate that CoMask is competitive compared with state-of-the-art models.
AB - We propose an instance segmentation model, named CoMask, that effectively alleviates the scale variation issue and addresses the precise localization. Specifically, we develop a multi-scale feature extraction module (MSFEM) to exploit multi-scale spatial cues. Besides, the channel attention mechanism is also adopted to further enhance the discriminating ability. Equipped with MSFEMs, multi-scale and multi-level features can be extracted to better characterize objects of various sizes and provide affluent high-level semantic information. For precise localization, we propose a collaborative learning framework to compute coarse masks and regresses position-sensitive dense offsets. The foreground confidence of each position is then assigned as the weight of the corresponding bounding box to calculate a weighted average. Thus, we can mitigate interference of background regions. After obtaining the final regressed bounding boxes, finer foreground masks can be calculated. We conduct experiments on MS COCO dataset. Experimental results validate that CoMask is competitive compared with state-of-the-art models.
KW - Collaborative learning
KW - deep learning
KW - instance segmentation
KW - multi-scale feature extraction
UR - https://www.scopus.com/pages/publications/85131255600
U2 - 10.1109/ICASSP43922.2022.9747601
DO - 10.1109/ICASSP43922.2022.9747601
M3 - 会议稿件
AN - SCOPUS:85131255600
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 1880
EP - 1884
BT - 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022
Y2 - 22 May 2022 through 27 May 2022
ER -