跳到主要导航 跳到搜索 跳到主要内容

Vector Quantization Based Query-Efficient Attack via Direct Preference Optimization

  • Beihang University

科研成果: 期刊稿件文章同行评审

摘要

This work studies black-box adversarial attacks against deep neural networks, where the attacker only has access to the query feedback from the target model. The current state-of-the-art (SOTA) query-efficient attacks usually combine transfer-based and query-based methods by utilizing the gradient or initializations of surrogate models. However, these strategies typically incur significant computational costs and require a large number of queries during the attack process. In this paper, we propose a novel query-efficient method for generating black-box adversarial perturbations, named Vector Quantization based Query-efficient Adversarial Perturbation generation (VQQAP). Specifically, we propose a Nucleus Sampling based Discretization Module (NSDM) to create diverse adversarial examples in the discrete latent space. To directly optimize the latent vector, we formulate the optimization problem as a direct preference optimization (DPO) problem, and iteratively solve this problem based on the target model feedback. Experimental evaluations demonstrate the effectiveness and efficiency of our method.

源语言英语
页(从-至)1550-1554
页数5
期刊IEEE Signal Processing Letters
32
DOI
出版状态已出版 - 2025

指纹

探究 'Vector Quantization Based Query-Efficient Attack via Direct Preference Optimization' 的科研主题。它们共同构成独一无二的指纹。

引用此