TY - GEN
T1 - Hint Pyramid Learning Via Salient Semantic Mining
AU - Wang, Jianyuan
AU - Liu, Xinyue
AU - Cao, Qianggang
AU - Leng, Biao
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/7/18
Y1 - 2021/7/18
N2 - Feature pyramid network (FPN) has dominated the task of object detection for many years with its congenital capability of semantic information interaction between different feature levels. Satisfactory performance can be easily achieved via the heavy backbones or tricky designs of anchors. However, the heavy structure brings about a huge increase of parameters and calculation. To address this challenge, this paper proposes a novel Hint Pyramid Learning(HPL) strategy to optimize a lightweight FPN-based detector such as one based on ResNet18. In HPL, the hint feature representation is learnt via salient semantic mining based on the teacher-student learning mechanism and self learning mechanism. Besides minimizing of the discrepancy of features belonging to the same feature pyramid layer, HPL also focuses on the relationships between feature pyramid layers. Unlike the popular mimicking algorithms such as hint learning and knowledge distillation which achieve comparable performance on classification tasks but failed to generalize to the detection tasks with FPN, HPL can be easily plugged into different FPN-based pipelines such as one-stage RetinaNet or two-stage Faster R-CNN and improves the performance straightforwardly. Without any meticulous designing, when the backbone of students is ResNet18 and the backbone of teachers is ResNet50, HPL effectively improves the performance by 2.1 AP for RetinaNet and 1.0 AP for Faster R-CNN on COCO benchmark.
AB - Feature pyramid network (FPN) has dominated the task of object detection for many years with its congenital capability of semantic information interaction between different feature levels. Satisfactory performance can be easily achieved via the heavy backbones or tricky designs of anchors. However, the heavy structure brings about a huge increase of parameters and calculation. To address this challenge, this paper proposes a novel Hint Pyramid Learning(HPL) strategy to optimize a lightweight FPN-based detector such as one based on ResNet18. In HPL, the hint feature representation is learnt via salient semantic mining based on the teacher-student learning mechanism and self learning mechanism. Besides minimizing of the discrepancy of features belonging to the same feature pyramid layer, HPL also focuses on the relationships between feature pyramid layers. Unlike the popular mimicking algorithms such as hint learning and knowledge distillation which achieve comparable performance on classification tasks but failed to generalize to the detection tasks with FPN, HPL can be easily plugged into different FPN-based pipelines such as one-stage RetinaNet or two-stage Faster R-CNN and improves the performance straightforwardly. Without any meticulous designing, when the backbone of students is ResNet18 and the backbone of teachers is ResNet50, HPL effectively improves the performance by 2.1 AP for RetinaNet and 1.0 AP for Faster R-CNN on COCO benchmark.
KW - Deep Learning
KW - Feature Pyramid Network
KW - Hint Learning
KW - Knowledge Distillation
KW - Object Detection
UR - https://www.scopus.com/pages/publications/85116405064
U2 - 10.1109/IJCNN52387.2021.9533835
DO - 10.1109/IJCNN52387.2021.9533835
M3 - 会议稿件
AN - SCOPUS:85116405064
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - IJCNN 2021 - International Joint Conference on Neural Networks, Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 International Joint Conference on Neural Networks, IJCNN 2021
Y2 - 18 July 2021 through 22 July 2021
ER -