跳到主要导航 跳到搜索 跳到主要内容

Improving Robust Fairness via Balance Adversarial Training

  • Chunyu Sun
  • , Chenye Xu
  • , Chengyuan Yao
  • , Siyuan Liang
  • , Yichao Wu
  • , Ding Liang
  • , Xianglong Liu
  • , Aishan Liu*
  • *此作品的通讯作者
  • SenseTime Group Limited
  • CAS - Institute of Information Engineering
  • Zhongguancun Laboratory
  • Institute of Dataspace

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Adversarial training (AT) methods are effective against adversarial attacks, yet they introduce severe disparity of accuracy and robustness between different classes, known as the robust fairness problem. Previously proposed Fair Robust Learning (FRL) adaptively reweights different classes to improve fairness. However, the performance of the better-performed classes decreases, leading to a strong performance drop. In this paper, we observed two unfair phenomena during adversarial training: different difficulties in generating adversarial examples from each class (source-class fairness) and disparate target class tendencies when generating adversarial examples (target-class fairness). From the observations, we propose Balance Adversarial Training (BAT) to address the robust fairness problem. Regarding source-class fairness, we adjust the attack strength and difficulties of each class to generate samples near the decision boundary for easier and fairer model learning; considering target-class fairness, by introducing a uniform distribution constraint, we encourage the adversarial example generation process for each class with a fair tendency. Extensive experiments conducted on multiple datasets (CIFAR-10, CIFAR-100, and ImageNette) demonstrate that our BAT can significantly outperform other baselines in mitigating the robust fairness problem (+5-10% on the worst class accuracy)(Our codes can be found at https://github.com/silvercherry/Improving-Robust-Fairness-via-Balance-Adversarial-Training).

源语言英语
主期刊名AAAI-23 Special Tracks
编辑Brian Williams, Yiling Chen, Jennifer Neville
出版商AAAI press
15161-15169
页数9
ISBN(电子版)9781577358800
DOI
出版状态已出版 - 27 6月 2023
活动37th AAAI Conference on Artificial Intelligence, AAAI 2023 - Washington, 美国
期限: 7 2月 202314 2月 2023

出版系列

姓名Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
37

会议

会议37th AAAI Conference on Artificial Intelligence, AAAI 2023
国家/地区美国
Washington
时期7/02/2314/02/23

指纹

探究 'Improving Robust Fairness via Balance Adversarial Training' 的科研主题。它们共同构成独一无二的指纹。

引用此