跳到主要导航 跳到搜索 跳到主要内容

Imitation Learning of Linear Noncooperative Multiagent Games: Employing Stability and Equilibrium Constraints

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this article, imitation learning (IL) methods with stability and equilibrium constraints are proposed for fitting policies of multiagent systems (MASs) from state-control input demonstrations. The collaborative control problem of the MAS is described by a linear quadratic differential game (LQDG). First, the standard IL method for policy fitting is introduced by minimizing a loss function along with a regularization function. Considering the stability requirement of the MAS, a stability constraint is added to the standard IL method. Furthermore, we also incorporate an equilibrium constraint into the standard IL method that requires the control policies to be the equilibrium solution for some LQDG problem. The proposed IL methods with stability and equilibrium constraints require solving an optimization problem with convex cost and bilinear constraints, which can be solved by using the alternating direction method of multipliers (ADMM). Finally, an illustrative example is provided to demonstrate the effectiveness of the proposed methods.

源语言英语
主期刊名Intelligent Robotics and Applications - 17th International Conference, ICIRA 2024, Proceedings
编辑Xuguang Lan, Xuesong Mei, Caigui Jiang, Fei Zhao, Zhiqiang Tian
出版商Springer Science and Business Media Deutschland GmbH
449-461
页数13
ISBN(印刷版)9789819607976
DOI
出版状态已出版 - 2025
活动17th International Conference on Intelligent Robotics and Applications, ICIRA 2024 - Xi'an, 中国
期限: 31 7月 20242 8月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
15204 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议17th International Conference on Intelligent Robotics and Applications, ICIRA 2024
国家/地区中国
Xi'an
时期31/07/242/08/24

指纹

探究 'Imitation Learning of Linear Noncooperative Multiagent Games: Employing Stability and Equilibrium Constraints' 的科研主题。它们共同构成独一无二的指纹。

引用此