Research on Game-Playing Agents Based on Deep Reinforcement Learning

Research output: Contribution to journalArticlepeer-review

Abstract

Path planning is a key technology for the autonomous mobility of intelligent robots. However, there are few studies on how to carry out path planning in real time under the confrontation environment. Therefore, based on the deep deterministic policy gradient (DDPG) algorithm, this paper designs the reward function and adopts the incremental training and reward compensation method to improve the training efficiency and obtain the penetration strategy. The Monte Carlo experiment results show that the algorithm can effectively avoid static obstacles, break through the interception, and finally reach the target area. Moreover, the algorithm is also validated in the Webots simulator.

Original languageEnglish
Article number35
JournalRobotics
Volume11
Issue number2
DOIs
StatePublished - Apr 2022

Keywords

  • confrontation environment
  • deep deterministic policy gradient (DDPG)
  • deep reinforcement learning (DRL)
  • dynamic path planning

Fingerprint

Dive into the research topics of 'Research on Game-Playing Agents Based on Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this