TY - GEN
T1 - Trajectory Planning for Cooperative Coverage Missions with Multiple Stratospheric Airships
AU - Luo, Qinchuan
AU - Gong, Weicheng
AU - Sun, Kangwen
AU - Lv, Hui
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - This paper presents a trajectory planning method for cooperative regional coverage missions involving multiple stratospheric airships under dynamic wind field conditions. We propose a De-Identified Centralized Architecture based on the Soft Actor-Critic algorithm (DICA-SAC), which uses a global module to aggregate agent states, extract compact global features, and broadcast them to all airships. By incorporating mean-field theory, the method ensures scalability to varying fleet sizes while maintaining global situational awareness. A convolutional neural network is employed to process high-resolution wind field data, compressing the raw grid into a compact set of features. A task-specific reward function is designed to maximize coverage while accelerating convergence. Simulations using historical wind field data over the South China Sea show that the proposed method achieves over 97 percent coverage in scenarios matching training conditions and maintains above 95 percent when the number of airships differs. These results demonstrate the robustness, adaptability, and scalability of DICA-SAC for large-scale multi-airship regional coverage planning in realistic environments.
AB - This paper presents a trajectory planning method for cooperative regional coverage missions involving multiple stratospheric airships under dynamic wind field conditions. We propose a De-Identified Centralized Architecture based on the Soft Actor-Critic algorithm (DICA-SAC), which uses a global module to aggregate agent states, extract compact global features, and broadcast them to all airships. By incorporating mean-field theory, the method ensures scalability to varying fleet sizes while maintaining global situational awareness. A convolutional neural network is employed to process high-resolution wind field data, compressing the raw grid into a compact set of features. A task-specific reward function is designed to maximize coverage while accelerating convergence. Simulations using historical wind field data over the South China Sea show that the proposed method achieves over 97 percent coverage in scenarios matching training conditions and maintains above 95 percent when the number of airships differs. These results demonstrate the robustness, adaptability, and scalability of DICA-SAC for large-scale multi-airship regional coverage planning in realistic environments.
KW - convolutional neural network
KW - multi-agent reinforcement learning
KW - multi-airship cooperative coverage
KW - Soft Actor-Critic
UR - https://www.scopus.com/pages/publications/105032627115
U2 - 10.1109/AIAC68175.2025.11332425
DO - 10.1109/AIAC68175.2025.11332425
M3 - 会议稿件
AN - SCOPUS:105032627115
T3 - 2025 3rd International Conference on Artificial Intelligence and Automation Control, AIAC 2025
SP - 387
EP - 394
BT - 2025 3rd International Conference on Artificial Intelligence and Automation Control, AIAC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 3rd International Conference on Artificial Intelligence and Automation Control, AIAC 2025
Y2 - 15 October 2025 through 17 October 2025
ER -