TY - GEN
T1 - Collaborative Penetration Algorithm with Dominant Region Analysis Embedded in Deep Reinforcement Learning
AU - Luo, Jiong
AU - Li, Xiaoduo
AU - Yan, Rui
AU - Hua, Yongzhao
AU - Dong, Xiwang
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - The problem of UAV attack-defense confrontation is a hot research direction in the field of unmanned systems at present. However, in an environment with threat areas or obstacles, when the enemy is a defender with higher mobility or pursuit ability, the cooperative penetration problem of multiple UAVs is still lack of effective solutions. Therefore, this paper combines the theoretical analysis of game theory and the advantages of reinforcement learning in complex scenes, and designs an algorithm framework for embedding dominant region analysis into deep reinforcement learning. On the premise of sacrificing strategy, we analytically derive the attacker's dominance region through geometric optimization and integrate this framework into the Deep Deterministic Policy Gradient (DDPG) algorithm by enhancing state space formulation, reward function design, and termination criteria. Numerical simulations demonstrate the algorithm's superior efficacy over baseline reinforcement learning approaches, exhibiting reduced training time (42.8%), increased penetration success rate (31%), and optimized trajectory lengths (4.9%).
AB - The problem of UAV attack-defense confrontation is a hot research direction in the field of unmanned systems at present. However, in an environment with threat areas or obstacles, when the enemy is a defender with higher mobility or pursuit ability, the cooperative penetration problem of multiple UAVs is still lack of effective solutions. Therefore, this paper combines the theoretical analysis of game theory and the advantages of reinforcement learning in complex scenes, and designs an algorithm framework for embedding dominant region analysis into deep reinforcement learning. On the premise of sacrificing strategy, we analytically derive the attacker's dominance region through geometric optimization and integrate this framework into the Deep Deterministic Policy Gradient (DDPG) algorithm by enhancing state space formulation, reward function design, and termination criteria. Numerical simulations demonstrate the algorithm's superior efficacy over baseline reinforcement learning approaches, exhibiting reduced training time (42.8%), increased penetration success rate (31%), and optimized trajectory lengths (4.9%).
UR - https://www.scopus.com/pages/publications/105016229782
U2 - 10.1109/ICCA65672.2025.11129851
DO - 10.1109/ICCA65672.2025.11129851
M3 - 会议稿件
AN - SCOPUS:105016229782
T3 - IEEE International Conference on Control and Automation, ICCA
SP - 606
EP - 611
BT - 2025 IEEE 19th International Conference on Control and Automation, ICCA 2025
PB - IEEE Computer Society
T2 - 19th IEEE International Conference on Control and Automation, ICCA 2025
Y2 - 30 June 2025 through 3 July 2025
ER -