跳到主要导航 跳到搜索 跳到主要内容

Steering control of payoff-maximizing players in adaptive learning dynamics

  • Beijing University of Posts and Telecommunications
  • Dartmouth College

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Evolutionary game theory provides a mathematical foundation for cross-disciplinary fertilization, especially for integrating ideas from artificial intelligence and game theory. Such integration offers a transparent and rigorous approach to complex decision-making problems in a variety of important contexts, ranging from evolutionary computation to machine behavior. Despite the astronomically huge individual behavioral strategy space for interactions in the iterated Prisoner's Dilemma (IPD) games, the so-called Zero-Determinant (ZD) strategies is a set of rather simple memory-one strategies yet can unilaterally set a linear payoff relationship between themselves and their opponent. Although the witting of ZD strategies gives players an upper hand in the IPD games, we find and characterize unbending strategies that can force ZD players to be fair in their own interest. Moreover, our analysis reveals the ubiquity of unbending properties in common IPD strategies which are previously overlooked. In this work, we demonstrate the important steering role of unbending strategies in fostering fairness and cooperation in pairwise interactions. Our results will help bring a new perspective by means of combining game theory and multi-agent learning systems for optimizing winning strategies that are robust to noises, errors, and deceptions in non-zero-sum games.

源语言英语
主期刊名Proceedings of the 35th Chinese Control and Decision Conference, CCDC 2023
出版商Institute of Electrical and Electronics Engineers Inc.
1487-1494
页数8
ISBN(电子版)9798350334722
DOI
出版状态已出版 - 2023
已对外发布
活动35th Chinese Control and Decision Conference, CCDC 2023 - Yichang, 中国
期限: 20 5月 202322 5月 2023

出版系列

姓名Proceedings of the 35th Chinese Control and Decision Conference, CCDC 2023

会议

会议35th Chinese Control and Decision Conference, CCDC 2023
国家/地区中国
Yichang
时期20/05/2322/05/23

指纹

探究 'Steering control of payoff-maximizing players in adaptive learning dynamics' 的科研主题。它们共同构成独一无二的指纹。

引用此