跳到主要导航 跳到搜索 跳到主要内容

LDP: Latent Diffusion-based Adversarial Purification towards Transformer-based Visual Encoders

  • Mengfan Li
  • , Xinxin Fan
  • , Quanliang Jing
  • , Shaoye Luo
  • , Yunfeng Lu
  • , Jingping Bi*
  • *此作品的通讯作者
  • CAS - Institute of Computing Technology
  • University of Chinese Academy of Sciences

科研成果: 期刊稿件会议文章同行评审

摘要

Adversarial purification has emerged as a critical defense mechanism against various adversarial attacks on deep neural networks, however, such category of diffusion-based purification in the pixel space confronts two challenges while applying into today's large vision transformers: i) the distributional divergence between adversarial and clean examples in high-dimensional image manifold; and ii) the prohibitive computational cost while processing high-resolution images. To address the predicament, we in this paper focus on the transformer-based visual encoders commonly employed in large Vision-Language Models (VLMs), and propose a novel Latent Diffusion-based Purification (LDP) mechanism through leveraging the latent space to pave the gap between adversarial distribution and clean distribution. Resorting to projecting adversarial input into a low-dimensional latent representation, our proposed LDP not only suppresses the off-manifold perturbations to achieve accelerated denoising, but also preserves critical semantic features by aligning input with the priori high-quality visual representations. Multi-facet experiments over both proactive robustness enhancement and post-attack purification demonstrate that our LDP has a superior performance in terms of effectiveness and efficiency.

指纹

探究 'LDP: Latent Diffusion-based Adversarial Purification towards Transformer-based Visual Encoders' 的科研主题。它们共同构成独一无二的指纹。

引用此