跳到主要导航 跳到搜索 跳到主要内容

DecomCAM: Advancing beyond saliency maps through decomposition and integration

  • Yuguang Yang
  • , Runtang Guo
  • , Sheng Wu
  • , Yimi Wang
  • , Linlin Yang
  • , Bo Fan*
  • , Jilong Zhong
  • , Juan Zhang
  • , Baochang Zhang
  • *此作品的通讯作者
  • Beihang University
  • Communication University of China
  • Academy of Military Medical Science China
  • Zhongguancun Laboratory

科研成果: 期刊稿件文章同行评审

摘要

Interpreting complex deep networks, notably pre-trained vision-language models (VLMs), is a formidable challenge. Current Class Activation Map (CAM) methods highlight regions revealing the model's decision-making basis but lack clear saliency maps and detailed interpretability. To bridge this gap, we propose DecomCAM, a novel decomposition-and-integration method that distills shared patterns from channel activation maps. Utilizing singular value decomposition, DecomCAM decomposes class-discriminative activation maps into orthogonal sub-saliency maps (OSSMs), which are then integrated together based on their contribution to the target concept. Extensive experiments on six benchmarks reveal that DecomCAM not only excels in locating accuracy but also achieves an optimizing balance between interpretability and computational efficiency. Further analysis unveils that OSSMs correlate with discernible object components, facilitating a granular understanding of the model's reasoning. This positions DecomCAM as a potential tool for fine-grained interpretation of advanced deep learning models. The code is available at https://github.com/CapricornGuang/DecomCAM.

源语言英语
文章编号127826
期刊Neurocomputing
610
DOI
出版状态已出版 - 28 12月 2024

指纹

探究 'DecomCAM: Advancing beyond saliency maps through decomposition and integration' 的科研主题。它们共同构成独一无二的指纹。

引用此