Skip to main navigation Skip to search Skip to main content

Learning-Based 6-DOF Control for Autonomous Proximity Operations under Motion Constraints

  • Qinglei Hu*
  • , Haoyang Yang
  • , Hongyang Dong
  • , Xiaowei Zhao
  • *Corresponding author for this work
  • Beihang University
  • University of Warwick

Research output: Contribution to journalArticlepeer-review

Abstract

This article proposes areinforcement learning (RL)-based six-degree-of-freedom (6-DOF) control scheme for the final-phase proximity operations of spacecraft. The main novelty of the proposed method are from two aspects: 1) The closed-loop performance can be improved in real-time through the RL technique, achieving an online approximate optimal control subject to the full 6-DOF nonlinear dynamics of spacecraft; 2) nontrivial motion constraints of proximity operations are considered and strictly obeyed during the whole control process. As a stepping stone, the dual-quaternion formalism is employed to characterize the 6-DOF dynamics model and motion constraints. Then, an RL-based control scheme is developed under the dual-quaternion algebraic framework to approximate the optimal control solution subject to a cost function and a Hamilton-Jacobi-Bellman equation. In addition, a specially designed barrier function is embedded in the reward function to avoid motion constraint violations. The Lyapunov-based stability analysis guarantees the ultimate boundedness of state errors and the weight of NN estimation errors. Besides, we also show that a PD-like controller under dual-quaternion formulation can be employed as the initial control policy to trigger the online learning process. The boundedness of it is proved by a special Lyapunov strictification method. Simulation results of prototypical spacecraft missions with proximity operations are provided to illustrate the effectiveness of the proposed method.

Original languageEnglish
Pages (from-to)4097-4109
Number of pages13
JournalIEEE Transactions on Aerospace and Electronic Systems
Volume57
Issue number6
DOIs
StatePublished - 1 Dec 2021

Keywords

  • Approximate optimal control
  • Constrained 6-DOF control
  • Reinforcement learning (RL)
  • Spacecraft proximity operations

Fingerprint

Dive into the research topics of 'Learning-Based 6-DOF Control for Autonomous Proximity Operations under Motion Constraints'. Together they form a unique fingerprint.

Cite this