跳到主要导航 跳到搜索 跳到主要内容

A Method for Generating Adversarial Examples Based on Interpretable Information

  • Yuntian Gao
  • , Jinlun Li
  • , Chang Liu
  • , Ce Yang
  • , Yuxuan Run
  • , Changdi Zhao
  • , Yangyang Sun
  • , Xiaobin Li*
  • , Dezhen Yang
  • *此作品的通讯作者
  • Beihang University
  • Beijing Institute of Control and Electronic Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In recent years, machine learning (ML) technology, particularly deep neural networks (DNNs), has experienced rapid development and widespread application across various fields due to their superior performance. However, systems based on deep learning are vulnerable to adversarial attacks, where images with added adversarial perturbations can cause deep learning models to produce incorrect output predictions. This undermines the stability of neural network systems and achieves the goal of illegal attacks. Adversarial examples are a crucial means of evaluating the robustness of deep neural networks and revealing their potential security vulnerabilities. This paper addresses the issue of poor interpretability in adversarial example generation methods by proposing a method for adversarial attacks based on interpretable information. The method generates feature heatmaps by extracting interpretable information from training samples, visually representing the importance of different regions of the target. It constructs a heatmap-guided mechanism to generate adversarial patches, which are then directed to attack critical positions on the target to enhance attack precision, resulting in the final adversarial examples. Experimental results demonstrate that the proposed method generates adversarial examples with better attack performance compared to mainstream methods, outperforming existing methods in terms of both attack effectiveness and robustness.

源语言英语
主期刊名Proceedings - 2024 15th International Conference on Reliability, Maintenance and Safety, ICRMS 2024
出版商Institute of Electrical and Electronics Engineers Inc.
1074-1080
页数7
ISBN(电子版)9798331529116
DOI
出版状态已出版 - 2024
活动15th International Conference on Reliability, Maintenance and Safety, ICRMS 2024 - Gulin, 中国
期限: 31 7月 20242 8月 2024

出版系列

姓名Proceedings - 2024 15th International Conference on Reliability, Maintenance and Safety, ICRMS 2024

会议

会议15th International Conference on Reliability, Maintenance and Safety, ICRMS 2024
国家/地区中国
Gulin
时期31/07/242/08/24

指纹

探究 'A Method for Generating Adversarial Examples Based on Interpretable Information' 的科研主题。它们共同构成独一无二的指纹。

引用此