Skip to main navigation Skip to search Skip to main content

Hierarchical reinforcement learning guidance with threat avoidance

  • Bohao Li
  • , Yunjie Wu
  • , Guofei Li*
  • *Corresponding author for this work
  • Beihang University
  • Science and Technology on Aircraft Control Laboratory
  • Northwestern Polytechnical University Xian

Research output: Contribution to journalArticlepeer-review

Abstract

The guidance strategy is an extremely critical factor in determining the striking effect of the missile operation. A novel guidance law is presented by exploiting the deep reinforcement learning (DRL) with the hierarchical deep deterministic policy gradient (DDPG) algorithm. The reward functions are constructed to minimize the line-of-sight (LOS) angle rate and avoid the threat caused by the opposed obstacles. To attenuate the chattering of the acceleration, a hierarchical reinforcement learning structure and an improved reward function with action penalty are put forward. The simulation results validate that the missile under the proposed method can hit the target successfully and keep away from the threatened areas effectively.

Original languageEnglish
Pages (from-to)1173-1185
Number of pages13
JournalJournal of Systems Engineering and Electronics
Volume33
Issue number5
DOIs
StatePublished - 1 Oct 2022

Keywords

  • deep reinforcement learning (DRL)
  • guidance law
  • hierarchical reinforcement learning
  • threat avoidance

Fingerprint

Dive into the research topics of 'Hierarchical reinforcement learning guidance with threat avoidance'. Together they form a unique fingerprint.

Cite this