跳到主要导航 跳到搜索 跳到主要内容

Feature-Based Local Ensemble Framework for Multi-Agent Reinforcement Learning

  • Beihang University
  • Zhongguancun Laboratory

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The use of centralized value networks is an important training method in multi-agent reinforcement learning (MARL). Existing methods usually utilize value decomposition to enable agents to achieve localized learning. However, these methods do not take into account sample efficiency, and the introduction of additional networks will also bring more computational costs. Therefore, it is necessary to introduce the ensemble idea of making full use of the replay buffer. The general ensemble learning approach is to introduce sub-models into the value network, and its application in multi-agent systems is not yet mature. In this paper, we firstly implement a multi-agent ensemble technique with locally shared parameters in the network by introducing a feature-based grouping mechanism. Secondly, we propose random and feature-based grouping principles and design a dimension reduction scheme for action-state space. Finally, our approach was evaluated on the benchmark platform, Multi-Agent Particle Environment (MPE), and compared with baseline algorithms. The results demonstrated that our method exhibited notable advantages in terms of achieving higher scores and converging rapidly.

源语言英语
主期刊名Proceedings - 2024 8th International Symposium on Computer Science and Intelligent Control, ISCSIC 2024
出版商Institute of Electrical and Electronics Engineers Inc.
255-260
页数6
ISBN(电子版)9798350380286
DOI
出版状态已出版 - 2024
活动8th International Symposium on Computer Science and Intelligent Control, ISCSIC 2024 - Zhengzhou, 中国
期限: 6 9月 20248 9月 2024

出版系列

姓名Proceedings - 2024 8th International Symposium on Computer Science and Intelligent Control, ISCSIC 2024

会议

会议8th International Symposium on Computer Science and Intelligent Control, ISCSIC 2024
国家/地区中国
Zhengzhou
时期6/09/248/09/24

指纹

探究 'Feature-Based Local Ensemble Framework for Multi-Agent Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此