Robust Incremental Learning of Approximate Dynamic Programming for Nonlinear Terminal Guidance

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Nonlinear optimal guidance problems with terminal constraints are often analytically intractable, and approximate solutions based on reinforcement learning or approximate dynamic programming generally fail to provide stability guarantees due to the presence of inherent approximation errors in neural networks. This paper proposes a robust incremental policy iteration algorithm for nonlinear optimal guidance problems. First, the incremental guidance problem is defined and an incremental policy iteration algorithm is designed to mitigate the initial instability of the classical policy iteration. Then, the boundary of the incremental guidance command is determined by integrating the Lyapunov stability theory into the policy improvement step, which ensures that the entire command is theoretically stable. Simulation results of a specific impact-angle-constrained guidance problem verify advantages of the developed method on efficiency, stability, and optimality.

Original languageEnglish
Title of host publicationAdvances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 16
EditorsLiang Yan, Haibin Duan, Yimin Deng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages439-448
Number of pages10
ISBN (Print)9789819622597
DOIs
StatePublished - 2025
EventInternational Conference on Guidance, Navigation and Control, ICGNC 2024 - Changsha, China
Duration: 9 Aug 202411 Aug 2024

Publication series

NameLecture Notes in Electrical Engineering
Volume1352 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceInternational Conference on Guidance, Navigation and Control, ICGNC 2024
Country/TerritoryChina
CityChangsha
Period9/08/2411/08/24

Keywords

  • Lyapunov stability
  • approximate dynamic programming
  • neural network
  • optimal guidance

Fingerprint

Dive into the research topics of 'Robust Incremental Learning of Approximate Dynamic Programming for Nonlinear Terminal Guidance'. Together they form a unique fingerprint.

Cite this