Distributed UAV Swarm Confrontation Decision-Making Based on Reinforcement Learning

  • Hongya Liang
  • , Yao Fan
  • , Jianwei Zhou
  • , Shuai Zheng
  • , Xiaoduo Li
  • , Liang Han*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rapid development of unmanned aerial vehicles (UAV) technology, the decision-making process in UAV swarm confrontation has become a critical research focus both domestically and internationally. To address the computational challenges posed by centralized decision-making methods, this paper introduces a decentralized framework for strategic decision-making in UAV swarm conflicts. First, this paper constructs a confrontation scenario consisting of multiple homogeneous and equal-numbered UAVs, and allocates a specific strike target to each UAV through a target allocation algorithm, transforming the multi-UAV combat into single UAV combat tasks. Then, the Deep Deterministic Policy Gradient (DDPG) algorithm is employed to train the decision-making model for the solitary UAV engagement. To markedly enhance the model's convergence rate, a strategy integrating reward shaping and curriculum learning is implemented. Moreover, the Artificial Potential Field (APF) method is employed to address the issue of collision prevention in multi-UAV operations. Ultimately, numerical simulation validates the effectiveness and scalability of the proposed approach.

Original languageEnglish
Title of host publicationProceedings of the 37th Chinese Control and Decision Conference, CCDC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5359-5364
Number of pages6
ISBN (Electronic)9798331510565
DOIs
StatePublished - 2025
Event37th Chinese Control and Decision Conference, CCDC 2025 - Xiamen, China
Duration: 16 May 202519 May 2025

Publication series

NameProceedings of the 37th Chinese Control and Decision Conference, CCDC 2025

Conference

Conference37th Chinese Control and Decision Conference, CCDC 2025
Country/TerritoryChina
CityXiamen
Period16/05/2519/05/25

Keywords

  • UAV swarm
  • curriculum learning
  • ddpg
  • reward reshaping

Fingerprint

Dive into the research topics of 'Distributed UAV Swarm Confrontation Decision-Making Based on Reinforcement Learning'. Together they form a unique fingerprint.

Cite this