TY - GEN
T1 - An application of continuous deep reinforcement learning approach to pursuit-evasion differential game
AU - Wang, Maolin
AU - Wang, Lixin
AU - Yue, Ting
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/3
Y1 - 2019/3
N2 - Pursuit-evasion differential game is a classic decision-making process in continuous domain. Most recently, the reinforcement learning (RL) technique has greatly advanced the research in decision-making field. In this paper, the dynamic model of the game is described and the optimization problem of the purser in the game is addressed. To learn the control strategy with self-learning, reinforcement learning is considered. An actor-critic based, model-free, end-to-end approach Deep Deterministic Policy Gradient (DDPG) Algorithm is applied to train the pursuer. In the first training phase the pursuer is trained only with a given evader's control strategy. In the second training phase, the pursuer and evader are trained simultaneously without any expert knowledge given in advance. The result shows that the pursuer and the evader can learn the control strategy during the training phase.
AB - Pursuit-evasion differential game is a classic decision-making process in continuous domain. Most recently, the reinforcement learning (RL) technique has greatly advanced the research in decision-making field. In this paper, the dynamic model of the game is described and the optimization problem of the purser in the game is addressed. To learn the control strategy with self-learning, reinforcement learning is considered. An actor-critic based, model-free, end-to-end approach Deep Deterministic Policy Gradient (DDPG) Algorithm is applied to train the pursuer. In the first training phase the pursuer is trained only with a given evader's control strategy. In the second training phase, the pursuer and evader are trained simultaneously without any expert knowledge given in advance. The result shows that the pursuer and the evader can learn the control strategy during the training phase.
KW - DDPG
KW - Differential Game
KW - Reinforcement Learning
KW - Self-Learning
UR - https://www.scopus.com/pages/publications/85067892778
U2 - 10.1109/ITNEC.2019.8729310
DO - 10.1109/ITNEC.2019.8729310
M3 - 会议稿件
AN - SCOPUS:85067892778
T3 - Proceedings of 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019
SP - 1150
EP - 1156
BT - Proceedings of 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019
A2 - Xu, Bing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019
Y2 - 15 March 2019 through 17 March 2019
ER -