Skip to main navigation Skip to search Skip to main content

Multi-Agent Reinforcement Learning with Optimal Equivalent Action of Neighborhood

  • Haixing Wang
  • , Yi Yang*
  • , Zhiwei Lin
  • , Tian Wang
  • *Corresponding author for this work
  • Henan Polytechnic University
  • Queen's University Belfast

Research output: Contribution to journalArticlepeer-review

Abstract

In a multi-agent system, the complex interaction among agents is one of the difficulties in making the optimal decision. This paper proposes a new action value function and a learning mechanism based on the optimal equivalent action of the neighborhood (OEAN) of a multi-agent system, in order to obtain the optimal decision from the agents. In the new Q-value function, the OEAN is used to depict the equivalent interaction between the current agent and the others. To deal with the non-stationary environment when agents act, the OEAN of the current agent is inferred simultaneously by the maximum a posteriori based on the hidden Markov random field model. The convergence property of the proposed methodology proved that the Q-value function can approach the global Nash equilibrium value using the iteration mechanism. The effectiveness of the method is verified by the case study of the top-coal caving. The experiment results show that the OEAN can reduce the complexity of the agents’ interaction description, meanwhile, the top-coal caving performance can be improved significantly.

Original languageEnglish
Article number99
JournalActuators
Volume11
Issue number4
DOIs
StatePublished - Apr 2022

Keywords

  • hidden Markov random field
  • multi-agent reinforcement learning
  • optimal decision
  • top-coal caving

Fingerprint

Dive into the research topics of 'Multi-Agent Reinforcement Learning with Optimal Equivalent Action of Neighborhood'. Together they form a unique fingerprint.

Cite this