跳到主要导航 跳到搜索 跳到主要内容

HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors

  • Luting Wang
  • , Xiaojie Li
  • , Yue Liao*
  • , Zeren Jiang
  • , Jianlong Wu
  • , Fei Wang
  • , Chen Qian
  • , Si Liu
  • *此作品的通讯作者
  • Beihang University
  • SenseTime Group Limited
  • Swiss Federal Institute of Technology Zurich
  • Shandong University
  • University of Science and Technology of China

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Conventional knowledge distillation (KD) methods for object detection mainly concentrate on homogeneous teacher-student detectors. However, the design of a lightweight detector for deployment is often significantly different from a high-capacity detector. Thus, we investigate KD among heterogeneous teacher-student pairs for a wide application. We observe that the core difficulty for heterogeneous KD (hetero-KD) is the significant semantic gap between the backbone features of heterogeneous detectors due to the different optimization manners. Conventional homogeneous KD (homo-KD) methods suffer from such a gap and are hard to directly obtain satisfactory performance for hetero-KD. In this paper, we propose the HEtero-Assists Distillation (HEAD) framework, leveraging heterogeneous detection heads as assistants to guide the optimization of the student detector to reduce this gap. In HEAD, the assistant is an additional detection head with the architecture homogeneous to the teacher head attached to the student backbone. Thus, a hetero-KD is transformed into a homo-KD, allowing efficient knowledge transfer from the teacher to the student. Moreover, we extend HEAD into a Teacher-Free HEAD (TF-HEAD) framework when a well-trained teacher detector is unavailable. Our method has achieved significant improvement compared to current detection KD methods. For example, on the MS-COCO dataset, TF-HEAD helps R18 RetinaNet achieve 33.9 mAP (+ 2.2 ), while HEAD further pushes the limit to 36.2 mAP (+ 4.5 ).

源语言英语
主期刊名Computer Vision – ECCV 2022 - 17th European Conference, Proceedings
编辑Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
出版商Springer Science and Business Media Deutschland GmbH
314-331
页数18
ISBN(印刷版)9783031200762
DOI
出版状态已出版 - 2022
活动17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, 以色列
期限: 23 10月 202227 10月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13669 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议17th European Conference on Computer Vision, ECCV 2022
国家/地区以色列
Tel Aviv
时期23/10/2227/10/22

指纹

探究 'HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors' 的科研主题。它们共同构成独一无二的指纹。

引用此