Skip to main navigation Skip to search Skip to main content

Adaptive attitude maneuver control of a rigid-flexible satellite based on deep reinforcement learning

  • Liang Sun*
  • , Zelin Zhao
  • , Xurui Zhao
  • , Yu Liu
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, the attitude maneuver system is studied for a rigid-flexible satellite with a large fixed space net. The rigid-flexible satellite has significant advantages in deep space exploration, space-based power generation and space debris removal. The attitude maneuver model is formulated as a highly nonlinear and coupled system subject to parametric perturbations, external disturbances, flexible vibration and input saturation. To realize large-angle and rapid attitude maneuver control, a dual-loop adaptive attitude maneuver controller based on deep reinforcement learning (DARL) is designed, which consists of a robust observer-based compensator, a dual-loop feedback controller and an adaptive parameter regulator. The robust observer-based compensator is designed to compensate for uncertainties from multiple sources within a finite time. And the dual-loop feedback controller that accounts for input saturation is proposed for large-angle and rapid attitude maneuver. To further enhance the adaptability and robustness of the control system, an adaptive parameter regulator utilizing deep reinforcement learning is designed, which adopts a structured staged training strategy in the training process. Compared with traditional control algorithms, the simulation results demonstrate that the proposed controller not only achieves the high-precision attitude maneuver control, but also suppresses the vibration of the flexible space net effectively.

Original languageEnglish
JournalIEEE Transactions on Aerospace and Electronic Systems
DOIs
StateAccepted/In press - 2026
Externally publishedYes

Fingerprint

Dive into the research topics of 'Adaptive attitude maneuver control of a rigid-flexible satellite based on deep reinforcement learning'. Together they form a unique fingerprint.

Cite this