TY - JOUR
T1 - Activation Map-Based Visual Explanation for Remote Sensing Image Super-Resolution
AU - Yao, Xudong
AU - Zhang, Haopeng
AU - Xie, Fengying
AU - Cui, Yufu
AU - Shi, Zhenwei
AU - Jiang, Zhiguo
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2026
Y1 - 2026
N2 - Deep learning has enhanced image super-resolution (SR) performance, yet SR networks are often treated as closed boxes with limited explainability. Notably, remote sensing (RS) images possess explicit physical semantics, imposing stringent demands on the explainability of their SR methods. To address this, we propose AMVE-SR, a framework that systematically transfers activation map-based visual explanation (AMVE) from classification networks to SR networks. Distinct from classification-oriented methods (e.g., Grad-CAM) that struggle with shallow-layer visualization or computationally intensive perturbation-based approaches, AMVE-SR introduces a layer-wise recursive propagation mechanism. By exploiting interlayer correlations, it traces activation maps backward to the input, ensuring high-fidelity explanations even in shallow layers where conventional methods fail to transfer effectively. Furthermore, we compute gradient responses via textural discrepancy rather than raw pixel intensities, capturing high-frequency details (e.g., physical boundaries) essential for physical semantics. To address the lack of quantitative metrics, we propose average texture variation (ATV) by adapting the average drop concept in classification tasks. Validated by visual consistency, ATV serves as a novel fidelity metric for SR explanations. Experiments on the AID dataset using the EDSR model demonstrate that our method produces visually reliable explanation maps while achieving superior performance on the proposed ATV metric with high running efficiency. In general, this study extends AMVE from image classification to SR, thereby establishing a new paradigm to explain SR networks.
AB - Deep learning has enhanced image super-resolution (SR) performance, yet SR networks are often treated as closed boxes with limited explainability. Notably, remote sensing (RS) images possess explicit physical semantics, imposing stringent demands on the explainability of their SR methods. To address this, we propose AMVE-SR, a framework that systematically transfers activation map-based visual explanation (AMVE) from classification networks to SR networks. Distinct from classification-oriented methods (e.g., Grad-CAM) that struggle with shallow-layer visualization or computationally intensive perturbation-based approaches, AMVE-SR introduces a layer-wise recursive propagation mechanism. By exploiting interlayer correlations, it traces activation maps backward to the input, ensuring high-fidelity explanations even in shallow layers where conventional methods fail to transfer effectively. Furthermore, we compute gradient responses via textural discrepancy rather than raw pixel intensities, capturing high-frequency details (e.g., physical boundaries) essential for physical semantics. To address the lack of quantitative metrics, we propose average texture variation (ATV) by adapting the average drop concept in classification tasks. Validated by visual consistency, ATV serves as a novel fidelity metric for SR explanations. Experiments on the AID dataset using the EDSR model demonstrate that our method produces visually reliable explanation maps while achieving superior performance on the proposed ATV metric with high running efficiency. In general, this study extends AMVE from image classification to SR, thereby establishing a new paradigm to explain SR networks.
KW - Activation map-based visual explanation (AMVE)
KW - class activation maps (CAMs)
KW - remote sensing (RS) images
KW - super-resolution (SR)
UR - https://www.scopus.com/pages/publications/105028623830
U2 - 10.1109/LGRS.2026.3657185
DO - 10.1109/LGRS.2026.3657185
M3 - 文章
AN - SCOPUS:105028623830
SN - 1545-598X
VL - 23
JO - IEEE Geoscience and Remote Sensing Letters
JF - IEEE Geoscience and Remote Sensing Letters
M1 - 5000805
ER -