Skip to main navigation Skip to search Skip to main content

Diving into Continual Ultra-fine-grained Visual Categorization

  • Pengcheng Zhang
  • , Xiaohan Yu*
  • , Xiao Bai
  • , Jin Zheng
  • , Xiaoyu Wu
  • , Yongsheng Gao
  • *Corresponding author for this work
  • Beihang University
  • Griffith University Queensland

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recent advance in ultra-fine-grained visual categorization (ultra-FGVC) has significantly boosted the capability of deep neural networks for ultra-FGVC tasks. However, building models for continually learning to recognize increasing ultra-fine-grained categories is still under-explored. This limits the application of ultra-FGVC techniques in real-world production. To this, we take the first attempt for continual ultra-FGVC. By evaluating existing continual learning methods on the constructed continual ultra-FGVC benchmark, we observe that the main bottleneck lies in the limited model plasticity for incrementally adapting to new tasks. This can be caused by excessive anti-forgetting constraints as the difficult ultra-FGVC task requires substantial update of parameters, and over-fitting on early tasks given that the ultra-fine-grained categories are with very few training samples. To tackle these problems, we propose a joint self-supervised learning and prompting model. The prompt-based continual learning framework offers proper anti-forgetting operation by fixed pretrained vision transformer and adaptive prompt selection. By jointly optimizing the learnable prompts with an adversarial self-supervised loss, the over-fitting on each continual learning task is mitigated. Extensive experiments demonstrate that the proposed method outperforms existing continual learning methods on the challenging continual ultra-FGVC problem.

Original languageEnglish
Title of host publication2023 International Conference on Digital Image Computing
Subtitle of host publicationTechniques and Applications, DICTA 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages113-120
Number of pages8
ISBN (Electronic)9798350382204
DOIs
StatePublished - 2023
Event2023 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2023 - Port Macquarie, Australia
Duration: 28 Nov 20231 Dec 2023

Publication series

Name2023 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2023

Conference

Conference2023 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2023
Country/TerritoryAustralia
CityPort Macquarie
Period28/11/231/12/23

Keywords

  • continual learning
  • prompt learning
  • ultra-fine-grained visual classification

Fingerprint

Dive into the research topics of 'Diving into Continual Ultra-fine-grained Visual Categorization'. Together they form a unique fingerprint.

Cite this