TY - GEN
T1 - Small UAV Formation Control in Complex Environments Based on Deep Reinforcement Learning
AU - Ren, Hui
AU - Qu, Zhenge
AU - Guo, Yuxin
AU - Wu, Jiang
N1 - Publisher Copyright:
© 2025 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2025
Y1 - 2025
N2 - In addressing the problem of multi-UAV formation flight in complex environments, we propose a novel formation control framework for small fixed-wing UAVs. The framework integrates deep reinforcement learning (DRL) with traditional control techniques to fully leverage advanced decision-making and computational efficiency. Specifically, we develop a position controller for the leader UAV, combining the traditional Artificial Potential Field (APF) algorithm with the Deep Deterministic Policy Gradient (DDPG) algorithm. We design a potential-based action space and reward function for this controller. In addition, we have also designed a formation controller for the follower UAVs based on consensus theory, including the design of the control law, state switching conditions, and an obstacle and collision avoidance strategy based on Artificial Potential Field. This ensures that the formation safely reaches the target position while maintaining the formation. Finally, simulation experiments validate the effectiveness of the proposed method in enhancing formation efficiency and robustness, thereby offering a novel solution for multi-UAV missions in complex scenarios.
AB - In addressing the problem of multi-UAV formation flight in complex environments, we propose a novel formation control framework for small fixed-wing UAVs. The framework integrates deep reinforcement learning (DRL) with traditional control techniques to fully leverage advanced decision-making and computational efficiency. Specifically, we develop a position controller for the leader UAV, combining the traditional Artificial Potential Field (APF) algorithm with the Deep Deterministic Policy Gradient (DDPG) algorithm. We design a potential-based action space and reward function for this controller. In addition, we have also designed a formation controller for the follower UAVs based on consensus theory, including the design of the control law, state switching conditions, and an obstacle and collision avoidance strategy based on Artificial Potential Field. This ensures that the formation safely reaches the target position while maintaining the formation. Finally, simulation experiments validate the effectiveness of the proposed method in enhancing formation efficiency and robustness, thereby offering a novel solution for multi-UAV missions in complex scenarios.
KW - Artificial potential field
KW - Deep reinforcement learning
KW - Formation control
KW - Multi-UAVs
UR - https://www.scopus.com/pages/publications/105020301009
U2 - 10.23919/CCC64809.2025.11179779
DO - 10.23919/CCC64809.2025.11179779
M3 - 会议稿件
AN - SCOPUS:105020301009
T3 - Chinese Control Conference, CCC
SP - 9071
EP - 9076
BT - Proceedings of the 44th Chinese Control Conference, CCC 2025
A2 - Sun, Jian
A2 - Yin, Hongpeng
PB - IEEE Computer Society
T2 - 44th Chinese Control Conference, CCC 2025
Y2 - 28 July 2025 through 30 July 2025
ER -