TY - GEN
T1 - Degraded Image Semantic Segmentation Using Intra-image and Inter-image Contrastive Learning
AU - Dong, Lusen
AU - Li, Sichen
AU - Zheng, Jin
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Image degradation increases the difficulty of semantic segmentation, leading to the decreased accuracy. The gap of feature distribution between degraded images and clear images becomes a burden for accurate semantic segmentation. To solve this problem, we propose a contrastive learning method for degraded image semantic segmentation which is parallel to the semantic decoder of semantic segmentation network to utilize both intra-image and inter-image contrastive learning for feature representation. The former forces pixel embeddings and region embeddings belonging to the same semantic class to be more similar than the embeddings belonging to the different classes, while the latter utilizes memory banks to enhance the diversity of positive and negative samples. As a result, the intra-image and inter-image contrastive learning is able to effectively utilize multiple pixels, multiple regions, even multiple images to augment the degraded features and learn a more general representation of degraded features. Experiments demonstrate that our method achieves competitive performance on various benchmarks, including Cityscapes, ADE20K, FoggyDriving and our synthetic datasets. The code and synthetic datasets are open-sourced on Github: https://github.com/cocolord/degraded-image-seg.
AB - Image degradation increases the difficulty of semantic segmentation, leading to the decreased accuracy. The gap of feature distribution between degraded images and clear images becomes a burden for accurate semantic segmentation. To solve this problem, we propose a contrastive learning method for degraded image semantic segmentation which is parallel to the semantic decoder of semantic segmentation network to utilize both intra-image and inter-image contrastive learning for feature representation. The former forces pixel embeddings and region embeddings belonging to the same semantic class to be more similar than the embeddings belonging to the different classes, while the latter utilizes memory banks to enhance the diversity of positive and negative samples. As a result, the intra-image and inter-image contrastive learning is able to effectively utilize multiple pixels, multiple regions, even multiple images to augment the degraded features and learn a more general representation of degraded features. Experiments demonstrate that our method achieves competitive performance on various benchmarks, including Cityscapes, ADE20K, FoggyDriving and our synthetic datasets. The code and synthetic datasets are open-sourced on Github: https://github.com/cocolord/degraded-image-seg.
KW - contrastive learning
KW - degraded image
KW - semantic segmentation
UR - https://www.scopus.com/pages/publications/85189368269
U2 - 10.1109/CAC59555.2023.10452132
DO - 10.1109/CAC59555.2023.10452132
M3 - 会议稿件
AN - SCOPUS:85189368269
T3 - Proceedings - 2023 China Automation Congress, CAC 2023
SP - 8958
EP - 8964
BT - Proceedings - 2023 China Automation Congress, CAC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 China Automation Congress, CAC 2023
Y2 - 17 November 2023 through 19 November 2023
ER -