跳到主要导航 跳到搜索 跳到主要内容

Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection

  • Luting Wang
  • , Yi Liu
  • , Penghui Du
  • , Zihan Ding
  • , Yue Liao*
  • , Qiaosong Qi
  • , Biaolong Chen
  • , Si Liu
  • *此作品的通讯作者
  • Beihang University
  • Alibaba Group Holding Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Open-vocabulary object detection aims to provide object detectors trained on a fixed set of object categories with the generalizability to detect objects described by arbitrary text queries. Previous methods adopt knowledge distillation to extract knowledge from Pretrained Vision-and-Language Models (PVLMs) and transfer it to detectors. However, due to the non-adaptive proposal cropping and single-level feature mimicking processes, they suffer from information destruction during knowledge extraction and inefficient knowledge transfer. To remedy these limitations, we propose an Object-Aware Distillation Pyramid (OADP) framework, including an Object-Aware Knowledge Extraction (OAKE) module and a Distillation Pyramid (DP) mechanism. When extracting object knowledge from PVLMs, the former adaptively transforms object proposals and adopts object-aware mask attention to obtain precise and complete knowledge of objects. The latter introduces global and block distillation for more comprehensive knowledge transfer to compensate for the missing relation information in object distillation. Extensive experiments show that our method achieves significant improvement compared to current methods. Especially on the MS-COCO dataset, our OADP framework reaches 35.6 mAPN50, surpassing the current state-of-the-art method by 3.3 mAPN50. Code is released at https://github.com/LutingWang/OADP.

源语言英语
主期刊名Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
出版商IEEE Computer Society
11186-11196
页数11
ISBN(电子版)9798350301298
DOI
出版状态已出版 - 2023
活动2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Vancouver, 加拿大
期限: 18 6月 202322 6月 2023

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
2023-June
ISSN(印刷版)1063-6919

会议

会议2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
国家/地区加拿大
Vancouver
时期18/06/2322/06/23

指纹

探究 'Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此