跳到主要导航 跳到搜索 跳到主要内容

Imperceptible Adversarial Attack with Multigranular Spatiotemporal Attention for Video Action Recognition

  • Guoming Wu
  • , Yangfan Xu
  • , Jun Li*
  • , Zhiping Shi*
  • , Xianglong Liu
  • *此作品的通讯作者
  • Capital Normal University

科研成果: 期刊稿件文章同行评审

摘要

In recent years, the application of video Internet of Things (IoT) in various cities and public places has brought unprecedented opportunities to the security field and achieved great success. However, the latest research shows that video recognition models are also vulnerable to adversarial examples, but adversarial examples based on physical attacks are easily detected by humans, making it difficult to pass human review. To address this problem, in this article, we propose to introduce a novel multigranular spatiotemporal attention network (MSANet), which can attack the video action recognition models imperceptibly. Specifically, to exploit video motion information more effectively and to reduce the detectability of attack perturbations, we design a multiplexed spatiotemporal attention module to select and enhance spatial regions and temporal frames at coarse-grained and fine-grained levels, respectively, thus maintaining a certain degree of smoothness while reducing the perturbation size and avoiding attacking overfitting. In addition, our proposed MSANet achieves imperceptible perturbations to video sequences through alternate iterative optimization combined with the PGD attack mechanism. extended experimental results on two different models (e.g., TDN and TSM) and two widely used data sets [HMDB-51 (Kuehne et al., 2011) and UCF-101 (Soomro et al., 2012)], compared to the state-of-the-art model, demonstrate the effectiveness of our devised video action recognition attack approach.

源语言英语
页(从-至)17785-17796
页数12
期刊IEEE Internet of Things Journal
10
20
DOI
出版状态已出版 - 15 10月 2023

指纹

探究 'Imperceptible Adversarial Attack with Multigranular Spatiotemporal Attention for Video Action Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此