Abstract
This article proposes areinforcement learning (RL)-based six-degree-of-freedom (6-DOF) control scheme for the final-phase proximity operations of spacecraft. The main novelty of the proposed method are from two aspects: 1) The closed-loop performance can be improved in real-time through the RL technique, achieving an online approximate optimal control subject to the full 6-DOF nonlinear dynamics of spacecraft; 2) nontrivial motion constraints of proximity operations are considered and strictly obeyed during the whole control process. As a stepping stone, the dual-quaternion formalism is employed to characterize the 6-DOF dynamics model and motion constraints. Then, an RL-based control scheme is developed under the dual-quaternion algebraic framework to approximate the optimal control solution subject to a cost function and a Hamilton-Jacobi-Bellman equation. In addition, a specially designed barrier function is embedded in the reward function to avoid motion constraint violations. The Lyapunov-based stability analysis guarantees the ultimate boundedness of state errors and the weight of NN estimation errors. Besides, we also show that a PD-like controller under dual-quaternion formulation can be employed as the initial control policy to trigger the online learning process. The boundedness of it is proved by a special Lyapunov strictification method. Simulation results of prototypical spacecraft missions with proximity operations are provided to illustrate the effectiveness of the proposed method.
| Original language | English |
|---|---|
| Pages (from-to) | 4097-4109 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Aerospace and Electronic Systems |
| Volume | 57 |
| Issue number | 6 |
| DOIs | |
| State | Published - 1 Dec 2021 |
Keywords
- Approximate optimal control
- Constrained 6-DOF control
- Reinforcement learning (RL)
- Spacecraft proximity operations
Fingerprint
Dive into the research topics of 'Learning-Based 6-DOF Control for Autonomous Proximity Operations under Motion Constraints'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver