Skip to main navigation Skip to search Skip to main content

Optimal Guidance for Reusable Launch Vehicle in Reentry Phase Based on Adaptive Dynamic Programming with Experience Replay

  • Beihang University
  • China State Shipbuilding Corporation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

An optimal tracking guidance method for Reusable Launch Vehicles (RLV) in the reentry phase is proposed based on improved Adaptive Dynamic Programming (ADP) with experience replay (ER). An actor-critic ADP with novel network weight tuning algorithms is developed. By introducing the experience replay technique, the persistence of the excitation requirement can be assessed while updating the critic neural network. Therefore, the generalization performance is improved for the ADP-based controller. Simulation of the RLV guidance system under model uncertainty is conducted. Better performances in terms of smoothness and accuracy are achieved compared with noise expansion, demonstrating the effectiveness and advantages of the proposed method for the optimal trajectory-tracking guidance of RLV.

Original languageEnglish
Title of host publicationProceedings of the 44th Chinese Control Conference, CCC 2025
EditorsJian Sun, Hongpeng Yin
PublisherIEEE Computer Society
Pages2808-2813
Number of pages6
ISBN (Electronic)9789887581611
DOIs
StatePublished - 2025
Event44th Chinese Control Conference, CCC 2025 - Chongqing, China
Duration: 28 Jul 202530 Jul 2025

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference44th Chinese Control Conference, CCC 2025
Country/TerritoryChina
CityChongqing
Period28/07/2530/07/25

Keywords

  • Adaptive dynamic programming
  • Experience Replay
  • Online actor-critic learning
  • Reusable Launch Vehicle
  • Trajectory-tracking guidance

Fingerprint

Dive into the research topics of 'Optimal Guidance for Reusable Launch Vehicle in Reentry Phase Based on Adaptive Dynamic Programming with Experience Replay'. Together they form a unique fingerprint.

Cite this