跳到主要导航 跳到搜索 跳到主要内容

Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification

  • Yang Liu
  • , Lei Zhou
  • , Pengcheng Zhang
  • , Xiao Bai*
  • , Lin Gu
  • , Xiaohan Yu
  • , Jun Zhou
  • , Edwin R. Hancock
  • *此作品的通讯作者
  • Beihang University
  • RIKEN
  • The University of Tokyo
  • Griffith University Queensland
  • University of York

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Object categories are often grouped into a multi-granularity taxonomic hierarchy. Classifying objects at coarser-grained hierarchy requires global and common characteristics, while finer-grained hierarchy classification relies on local and discriminative features. Therefore, humans should also subconsciously focus on different object regions when classifying different hierarchies. This granularity-wise attention is confirmed by our collected human real-time gaze data on different hierarchy classifications. To leverage this mechanism, we propose a Cross-Hierarchical Region Feature (CHRF) learning framework. Specifically, we first design a region feature mining module that imitates humans to learn different granularity-wise attention regions with multi-grained classification tasks. To explore how human attention shifts from one hierarchy to another, we further present a cross-hierarchical orthogonal fusion module to enhance the region feature representation by blending the original feature and an orthogonal component extracted from adjacent hierarchies. Experiments on five hierarchical fine-grained datasets demonstrate the effectiveness of CHRF compared with the state-of-the-art methods. Ablation study and visualization results also consistently verify the advantages of our human attention-oriented modules. The code and dataset are available at https://github.com/visiondom/CHRF.

源语言英语
主期刊名Computer Vision – ECCV 2022 - 17th European Conference, Proceedings
编辑Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
出版商Springer Science and Business Media Deutschland GmbH
57-73
页数17
ISBN(印刷版)9783031200526
DOI
出版状态已出版 - 2022
活动17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, 以色列
期限: 23 10月 202227 10月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13684 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议17th European Conference on Computer Vision, ECCV 2022
国家/地区以色列
Tel Aviv
时期23/10/2227/10/22

指纹

探究 'Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification' 的科研主题。它们共同构成独一无二的指纹。

引用此