跳到主要导航 跳到搜索 跳到主要内容

Make Baseline Model Stronger: Embedded Knowledge Distillation in Weight-Sharing Based Ensemble Network

  • Beihang University

科研成果: 会议稿件论文同行评审

摘要

Recently, many notable convolutional neural networks have powerful performance with compact and efficient structure. To further pursue performance improvement, previous methods either introduce more computation or design complex modules. In this paper, we propose an elegant weight-sharing based ensemble network embedded knowledge distillation (EKD-FWSNet) to enhance the generalization ability of baseline models with no increase of computation and complex modules. Specifically, we first design an auxiliary branch alongside with baseline model, then set branch points and shortcut connections between two branches to construct different forward paths. In this way, we form a weight-sharing ensemble network with multiple output predictions. Furthermore, we integrate the information from diverse posterior probabilities and intermediate feature maps, which are then transferred to baseline model through knowledge distillation strategy. Extensive image classification experiments on CIFAR-10/100 and tiny-ImageNet datasets demonstrate that our proposed EKD-FWSNet can help numerous baseline models improve the accuracy by large margin (sometimes more than 4%). We also conduct extended experiments on remote sensing datasets (AID, NWPU-RESISC45, UC-Merced) and achieve state-of-the-art results.

源语言英语
出版状态已出版 - 2021
活动32nd British Machine Vision Conference, BMVC 2021 - Virtual, Online
期限: 22 11月 202125 11月 2021

会议

会议32nd British Machine Vision Conference, BMVC 2021
Virtual, Online
时期22/11/2125/11/21

指纹

探究 'Make Baseline Model Stronger: Embedded Knowledge Distillation in Weight-Sharing Based Ensemble Network' 的科研主题。它们共同构成独一无二的指纹。

引用此