跳到主要导航 跳到搜索 跳到主要内容

An application of continuous deep reinforcement learning approach to pursuit-evasion differential game

  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Pursuit-evasion differential game is a classic decision-making process in continuous domain. Most recently, the reinforcement learning (RL) technique has greatly advanced the research in decision-making field. In this paper, the dynamic model of the game is described and the optimization problem of the purser in the game is addressed. To learn the control strategy with self-learning, reinforcement learning is considered. An actor-critic based, model-free, end-to-end approach Deep Deterministic Policy Gradient (DDPG) Algorithm is applied to train the pursuer. In the first training phase the pursuer is trained only with a given evader's control strategy. In the second training phase, the pursuer and evader are trained simultaneously without any expert knowledge given in advance. The result shows that the pursuer and the evader can learn the control strategy during the training phase.

源语言英语
主期刊名Proceedings of 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019
编辑Bing Xu
出版商Institute of Electrical and Electronics Engineers Inc.
1150-1156
页数7
ISBN(电子版)9781538662434
DOI
出版状态已出版 - 3月 2019
活动3rd IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019 - Chengdu, 中国
期限: 15 3月 201917 3月 2019

出版系列

姓名Proceedings of 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019

会议

会议3rd IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019
国家/地区中国
Chengdu
时期15/03/1917/03/19

指纹

探究 'An application of continuous deep reinforcement learning approach to pursuit-evasion differential game' 的科研主题。它们共同构成独一无二的指纹。

引用此