TY - GEN
T1 - Cooperative Target Pursuit by Multiple Fixed-wing UAVs Based on Deep Reinforcement Learning and Artificial Potential Field
AU - Zhao, Feng
AU - Hua, Yongzhao
AU - Zheng, Hongwei
AU - Yu, Jianglong
AU - Dong, Xiwang
AU - Li, Qingdong
AU - Ren, Zhang
N1 - Publisher Copyright:
© 2023 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2023
Y1 - 2023
N2 - This paper presents a cooperative target pursuit algorithm based on deep reinforcement learning and artificial potential field for multiple fixed-wing unmanned aerial vehicles (UAVs) in the complex environment. In the proposed algorithm, decentralized deep deterministic policy gradient is employed to learn cooperative target pursuit policies that are adaptive to complex environments including static obstacles and dynamic evader. The artificial potential field method is combined into the learning process to avoid obstacles and collision. In order to cooperatively pursuit, the reward is designed to incentivize the capture of target and encourage a beneficial formation of pursuers. Finally, the simulation shows the feasibility of the proposed algorithm in learning multiple cooperative pursuit strategies.
AB - This paper presents a cooperative target pursuit algorithm based on deep reinforcement learning and artificial potential field for multiple fixed-wing unmanned aerial vehicles (UAVs) in the complex environment. In the proposed algorithm, decentralized deep deterministic policy gradient is employed to learn cooperative target pursuit policies that are adaptive to complex environments including static obstacles and dynamic evader. The artificial potential field method is combined into the learning process to avoid obstacles and collision. In order to cooperatively pursuit, the reward is designed to incentivize the capture of target and encourage a beneficial formation of pursuers. Finally, the simulation shows the feasibility of the proposed algorithm in learning multiple cooperative pursuit strategies.
KW - Artificial Potential Field
KW - Cooperative Target Pursuit
KW - Decentralized Deep Deterministic Policy Gradient
UR - https://www.scopus.com/pages/publications/85175523783
U2 - 10.23919/CCC58697.2023.10241187
DO - 10.23919/CCC58697.2023.10241187
M3 - 会议稿件
AN - SCOPUS:85175523783
T3 - Chinese Control Conference, CCC
SP - 5693
EP - 5698
BT - 2023 42nd Chinese Control Conference, CCC 2023
PB - IEEE Computer Society
T2 - 42nd Chinese Control Conference, CCC 2023
Y2 - 24 July 2023 through 26 July 2023
ER -