TY - JOUR
T1 - Long-term cooperative path planning for stratospheric airships based on hierarchical multi-agent reinforcement learning
AU - Lv, Chao
AU - Zhu, Ming
AU - Guo, Xiao
AU - Ou, Jiajun
AU - Zheng, Baojin
AU - Sun, Liran
N1 - Publisher Copyright:
© 2026 Elsevier Ltd.
PY - 2026/4/1
Y1 - 2026/4/1
N2 - Stratospheric airships are increasingly used for long-term collaborative tasks, requiring efficient path planning for multiple airships. Traditional methods struggle with collaborative optimization and state space explosion in such tasks. To address these issues, this paper presents a hierarchical cooperative airship path planning (HiCAPP). This HiCAPP employs a dual-layer control architecture, with the high-level controller responsible for task allocation and the low-level controller concentrating on path planning. Experimental results show that HiCAPP outperforms traditional multi-agent reinforcement learning methods in two critical metrics: average remaining energy and average distance to the task center. Additionally, through experiments with varying numbers of agents, task durations, and disturbances, HiCAPP has demonstrated robustness and scalability. These results confirm its effectiveness in long-term cooperative monitoring tasks and highlight the advantages of hierarchical decision-making in multi-agent systems.
AB - Stratospheric airships are increasingly used for long-term collaborative tasks, requiring efficient path planning for multiple airships. Traditional methods struggle with collaborative optimization and state space explosion in such tasks. To address these issues, this paper presents a hierarchical cooperative airship path planning (HiCAPP). This HiCAPP employs a dual-layer control architecture, with the high-level controller responsible for task allocation and the low-level controller concentrating on path planning. Experimental results show that HiCAPP outperforms traditional multi-agent reinforcement learning methods in two critical metrics: average remaining energy and average distance to the task center. Additionally, through experiments with varying numbers of agents, task durations, and disturbances, HiCAPP has demonstrated robustness and scalability. These results confirm its effectiveness in long-term cooperative monitoring tasks and highlight the advantages of hierarchical decision-making in multi-agent systems.
KW - Hierarchical reinforcement learning
KW - Multi-agent
KW - Path planning
KW - Stratospheric airship
UR - https://www.scopus.com/pages/publications/105029626680
U2 - 10.1016/j.engappai.2026.114156
DO - 10.1016/j.engappai.2026.114156
M3 - 文章
AN - SCOPUS:105029626680
SN - 0952-1976
VL - 169
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
M1 - 114156
ER -