Skip to main navigation Skip to search Skip to main content

An application of continuous deep reinforcement learning approach to pursuit-evasion differential game

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Pursuit-evasion differential game is a classic decision-making process in continuous domain. Most recently, the reinforcement learning (RL) technique has greatly advanced the research in decision-making field. In this paper, the dynamic model of the game is described and the optimization problem of the purser in the game is addressed. To learn the control strategy with self-learning, reinforcement learning is considered. An actor-critic based, model-free, end-to-end approach Deep Deterministic Policy Gradient (DDPG) Algorithm is applied to train the pursuer. In the first training phase the pursuer is trained only with a given evader's control strategy. In the second training phase, the pursuer and evader are trained simultaneously without any expert knowledge given in advance. The result shows that the pursuer and the evader can learn the control strategy during the training phase.

Original languageEnglish
Title of host publicationProceedings of 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019
EditorsBing Xu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1150-1156
Number of pages7
ISBN (Electronic)9781538662434
DOIs
StatePublished - Mar 2019
Event3rd IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019 - Chengdu, China
Duration: 15 Mar 201917 Mar 2019

Publication series

NameProceedings of 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019

Conference

Conference3rd IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2019
Country/TerritoryChina
CityChengdu
Period15/03/1917/03/19

Keywords

  • DDPG
  • Differential Game
  • Reinforcement Learning
  • Self-Learning

Fingerprint

Dive into the research topics of 'An application of continuous deep reinforcement learning approach to pursuit-evasion differential game'. Together they form a unique fingerprint.

Cite this