TY - GEN
T1 - A Two-Stage Shake-Shake Network for Long-Tailed Recognition of SAR Aerial View Objects
AU - Li, Gongzhe
AU - Pan, Linpeng
AU - Qiu, Linwei
AU - Tan, Zhiwen
AU - Xie, Fengying
AU - Zhang, Haopeng
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Synthetic Aperture Radar (SAR) has received more attention due to its complementary superiority on capturing significant information in the remote sensing area. However, for an Aerial View Object Classification (AVOC) task, SAR images still suffer from the long-tailed distribution of the aerial view objects. This disparity limit the performance of classification methods, especially for the data-sensitive deep learning models. In this paper, we propose a two-stage shake-shake network to tackle the long-tailed learning problem. Specifically, it decouples the learning procedure into the representation learning stage and the classification learning stage. Moreover, we apply the test time augmentation (TTA) and the classification with alternating normalization (CAN) to improve the accuracy. In the PBVS1 2022 Multi-modal Aerial View Object Classification Challenge Track 1, our method achieves 21.82% and 27.97% accuracy in the development phase and testing phase respectively, which wins the top-tier among all the participants.
AB - Synthetic Aperture Radar (SAR) has received more attention due to its complementary superiority on capturing significant information in the remote sensing area. However, for an Aerial View Object Classification (AVOC) task, SAR images still suffer from the long-tailed distribution of the aerial view objects. This disparity limit the performance of classification methods, especially for the data-sensitive deep learning models. In this paper, we propose a two-stage shake-shake network to tackle the long-tailed learning problem. Specifically, it decouples the learning procedure into the representation learning stage and the classification learning stage. Moreover, we apply the test time augmentation (TTA) and the classification with alternating normalization (CAN) to improve the accuracy. In the PBVS1 2022 Multi-modal Aerial View Object Classification Challenge Track 1, our method achieves 21.82% and 27.97% accuracy in the development phase and testing phase respectively, which wins the top-tier among all the participants.
UR - https://www.scopus.com/pages/publications/85137792142
U2 - 10.1109/CVPRW56347.2022.00039
DO - 10.1109/CVPRW56347.2022.00039
M3 - 会议稿件
AN - SCOPUS:85137792142
T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
SP - 248
EP - 255
BT - Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2022
PB - IEEE Computer Society
T2 - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2022
Y2 - 19 June 2022 through 24 June 2022
ER -