跳到主要导航 跳到搜索 跳到主要内容

Real-time control for fuel-optimal Moon landing based on an interactive deep reinforcement learning algorithm

  • Lin Cheng
  • , Zhenbo Wang
  • , Fanghua Jiang*
  • *此作品的通讯作者
  • Tsinghua University
  • University of Tennessee

科研成果: 期刊稿件文章同行评审

摘要

In this study, a real-time optimal control approach is proposed using an interactive deep reinforcement learning algorithm for the Moon fuel-optimal landing problem. Considering the remote communication restrictions and environmental uncertainties, advanced landing control techniques are demanded to meet the high requirements of real-time performance and autonomy in the Moon landing missions. Deep reinforcement learning (DRL) algorithms have been recently developed for real-time optimal control but suffer the obstacles of slow convergence and difficult reward function design. To address these problems, a DRL algorithm is developed using an actor-indirect method architecture to achieve the optimal control of the Moon landing mission. In this DRL algorithm, an indirect method is employed to generate the optimal control actions for the deep neural network (DNN) learning, while the trained DNNs provide good initial guesses for the indirect method to promote the efficiency of training data generation. Through sufficient learning of the state-action relationship, the trained DNNs can approximate the optimal actions and steer the spacecraft to the target in real time. Additionally, a nonlinear feedback controller is developed to improve the terminal landing accuracy. Numerical simulations are given to verify the effectiveness of the proposed DRL algorithm and demonstrate the performance of the developed optimal landing controller.

源语言英语
页(从-至)375-386
页数12
期刊Astrodynamics
3
4
DOI
出版状态已出版 - 1 12月 2019
已对外发布

指纹

探究 'Real-time control for fuel-optimal Moon landing based on an interactive deep reinforcement learning algorithm' 的科研主题。它们共同构成独一无二的指纹。

引用此