Skip to main navigation Skip to search Skip to main content

Multi-UAV Path Planning Based on Multi-Agent Deep Reinforcement Learning

  • Zeyang Sun
  • , Xiling Luo*
  • , Xiaohai Ji
  • , Wang Zhao
  • , Yue Lei
  • , Gengyi Bai
  • *Corresponding author for this work
  • Hangzhou Innovation Institute
  • Beihang University
  • Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As the widespread adoption of unmanned aerial vehicles (UAVs) in various fields, the UAVs path planning serves as a prerequisite for ensuring the safety and efficiency of the flight. Existing methods rely on grid airspace and detailed modeling for a specific environment, limiting portability and inflexibility responding to diverse tasks and environments. They utilize discrete actions space, neglecting the dynamics characteristics of UAVs, which leads to planning relatively rough paths. Furthermore, the current Studies lack sufficient consideration of reliance on spatiotemporal data and disregard multi-UAV collisions and unstable training environments, which hinder the realization of collaboration among UAVs. In response to these challenges, this paper propose an Enhanced Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (Enhanced-MATD3). We model a state space according to the 3D scene and design a continuous action space. Further, we devise reward function based on obstacle and collision avoidance principles. By enhancing the actor and critic network from both temporal and spatial dimensions, we broaden UAVs perception range and sharpen their responsiveness to dynamic environmental changes. Our work use different scenarios during the training process, and the experimental results demonstrate the advantages of our algorithm in terms of average path length, planning time, and convergence speed over the baseline.

Original languageEnglish
Title of host publicationIntelligent Transportation Engineering - Proceedings of the 9th International Conference, ICITE 2024
EditorsGuoqiang Mao
PublisherIOS Press BV
Pages882-894
Number of pages13
ISBN (Electronic)9781643686028
DOIs
StatePublished - 17 Jul 2025
Event9th International Conference on Intelligent Transportation Engineering, ICITE 2024 - Xi'an, China
Duration: 18 Oct 202420 Oct 2024

Publication series

NameAdvances in Transdisciplinary Engineering
Volume72
ISSN (Print)2352-751X
ISSN (Electronic)2352-7528

Conference

Conference9th International Conference on Intelligent Transportation Engineering, ICITE 2024
Country/TerritoryChina
CityXi'an
Period18/10/2420/10/24

Keywords

  • Enhanced-MATD3
  • Multi-Agent deep reinforcement learning
  • low-Altitude airspace
  • path planning

Fingerprint

Dive into the research topics of 'Multi-UAV Path Planning Based on Multi-Agent Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this