TY - GEN
T1 - Reinforcement Learning Driven Autonomous Active Debris Removal Strategy Based on Angles-Only Navigation
AU - Chen, Zheng
AU - Zhong, Rui
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - This article proposes a Reinforcement Learning (RL) driven autonomous Active Debris Removal (ADR) strategy based on Angles-Only Navigation (AON) theory. The policy network trained by the RL algorithm receives angles measurements as input, and outputs impulse information, thus combined the orbit determination solely based on angles measurements and autonomous approaching. To solve the low observability problem of AON, impulse maneuver is considered to gain observability. Thus, the network processes the coupled relationship of angles measurements and approaching maneuver, which balances the need to approach target and gain AON observability. This control method based on RL has the advantage of high autonomy and fast computation. Besides, the initial state of training is randomly decided, which effectively enhanced the generalization capabilities of the trained model, so the controller can handle more variable situations. Finally, simulation is performed to examine the performance of the policy network controller. The results showed great effectiveness and robustness in various environments.
AB - This article proposes a Reinforcement Learning (RL) driven autonomous Active Debris Removal (ADR) strategy based on Angles-Only Navigation (AON) theory. The policy network trained by the RL algorithm receives angles measurements as input, and outputs impulse information, thus combined the orbit determination solely based on angles measurements and autonomous approaching. To solve the low observability problem of AON, impulse maneuver is considered to gain observability. Thus, the network processes the coupled relationship of angles measurements and approaching maneuver, which balances the need to approach target and gain AON observability. This control method based on RL has the advantage of high autonomy and fast computation. Besides, the initial state of training is randomly decided, which effectively enhanced the generalization capabilities of the trained model, so the controller can handle more variable situations. Finally, simulation is performed to examine the performance of the policy network controller. The results showed great effectiveness and robustness in various environments.
KW - Active Debris Removal
KW - Angles-Only Navigation
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/86000442208
U2 - 10.1007/978-981-96-2252-8_22
DO - 10.1007/978-981-96-2252-8_22
M3 - 会议稿件
AN - SCOPUS:86000442208
SN - 9789819622511
T3 - Lecture Notes in Electrical Engineering
SP - 210
EP - 221
BT - Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control
A2 - Yan, Liang
A2 - Duan, Haibin
A2 - Deng, Yimin
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Guidance, Navigation and Control, ICGNC 2024
Y2 - 9 August 2024 through 11 August 2024
ER -