Skip to main navigation Skip to search Skip to main content

Range-Aware Impact Angle Guidance Law with Deep Reinforcement Meta-Learning

  • Chen Liang*
  • , Weihong Wang
  • , Zhenghua Liu
  • , Chao Lai
  • , Sen Wang
  • *Corresponding author for this work
  • Beihang University
  • Institute of China North Industries Group Corporation

Research output: Contribution to journalArticlepeer-review

Abstract

In this article, a new guidance law is proposed for impact angle constrained missile with time-varying velocity against a maneuvering target. The proposed guidance law is based on model-based deep reinforcement learning (RL) technique where a deep neural network is trained to be a predictive model used in model predictive path integral (MPPI) control. Tube-MPPI, a robust approach utilizing ancillary controller for disturbance rejection, is introduced in guidance law design in this work to deal with the MPPI degradation of robustness when the deep predictive model differs with actual environment. To further improve the performance, meta-learning is utilized to enable the deep neural dynamics adapt to environment changes online. With this approach the model mismatch of the nominal controller is reduced to improve tube-MPPI performance. Furthermore, a range-aware hyperbolic function is proposed as an adaptive function in the MPPI performance index design. Thus, reduced initial acceleration command and increased terminal velocity benefit guidance performance. Numerical simulations under various conditions demonstrate the effectiveness of proposed guidance law.

Original languageEnglish
Article number9170500
Pages (from-to)152093-152104
Number of pages12
JournalIEEE Access
Volume8
DOIs
StatePublished - 2020

Keywords

  • Missile guidance
  • deep reinforcement learning
  • impact angle constraint
  • meta-learning
  • tube model predictive control

Fingerprint

Dive into the research topics of 'Range-Aware Impact Angle Guidance Law with Deep Reinforcement Meta-Learning'. Together they form a unique fingerprint.

Cite this