TY - JOUR
T1 - Vector Quantization Based Query-Efficient Attack via Direct Preference Optimization
AU - Yang, Ruijie
AU - Guo, Yuanfang
AU - Zhou, Chao
AU - Li, Guohao
AU - Wang, Yunhong
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - This work studies black-box adversarial attacks against deep neural networks, where the attacker only has access to the query feedback from the target model. The current state-of-the-art (SOTA) query-efficient attacks usually combine transfer-based and query-based methods by utilizing the gradient or initializations of surrogate models. However, these strategies typically incur significant computational costs and require a large number of queries during the attack process. In this paper, we propose a novel query-efficient method for generating black-box adversarial perturbations, named Vector Quantization based Query-efficient Adversarial Perturbation generation (VQQAP). Specifically, we propose a Nucleus Sampling based Discretization Module (NSDM) to create diverse adversarial examples in the discrete latent space. To directly optimize the latent vector, we formulate the optimization problem as a direct preference optimization (DPO) problem, and iteratively solve this problem based on the target model feedback. Experimental evaluations demonstrate the effectiveness and efficiency of our method.
AB - This work studies black-box adversarial attacks against deep neural networks, where the attacker only has access to the query feedback from the target model. The current state-of-the-art (SOTA) query-efficient attacks usually combine transfer-based and query-based methods by utilizing the gradient or initializations of surrogate models. However, these strategies typically incur significant computational costs and require a large number of queries during the attack process. In this paper, we propose a novel query-efficient method for generating black-box adversarial perturbations, named Vector Quantization based Query-efficient Adversarial Perturbation generation (VQQAP). Specifically, we propose a Nucleus Sampling based Discretization Module (NSDM) to create diverse adversarial examples in the discrete latent space. To directly optimize the latent vector, we formulate the optimization problem as a direct preference optimization (DPO) problem, and iteratively solve this problem based on the target model feedback. Experimental evaluations demonstrate the effectiveness and efficiency of our method.
KW - Black-box adversarial attacks
KW - deep neural networks
KW - direct preference optimization
KW - query-based attacks
UR - https://www.scopus.com/pages/publications/105003032479
U2 - 10.1109/LSP.2025.3553791
DO - 10.1109/LSP.2025.3553791
M3 - 文章
AN - SCOPUS:105003032479
SN - 1070-9908
VL - 32
SP - 1550
EP - 1554
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
ER -