Skip to main navigation Skip to search Skip to main content

Cooperative Pursuit-Evasion with Low Altitude Wireless Network: A Hierarchical Reinforcement Learning Approach

  • Zhengzhi Yang
  • , Yuanhao Cui
  • , Wenbo Du
  • , Fanbiao Li
  • , Yumeng Li*
  • *Corresponding author for this work
  • Beihang University
  • State Key Laboratory of CNS/ATM
  • Beijing University of Posts and Telecommunications
  • Central South University

Research output: Contribution to journalArticlepeer-review

Abstract

As an emerging countermeasure, cooperative interception by multiple UAVs offers an effective solution to neutralize rogue drones and safeguard low-altitude airspace operations. Effective coordination among counter-UAVs in encircling intruding drones remains challenging. This paper proposes a Hierarchical Cooperative Deep Reinforcement Learning (HCDRL) algorithm to enhance cooperation and efficiency among UAVs pursuing agile targets. The proposed approach decomposes the multi-agent pursuit-evasion scenario into multiple subtasks using a two-layer hierarchical decision-making framework. Specifically, the upper-layer network acts as a meta-strategy, dynamically assessing pursuit scenarios and assigning optimal subtasks. Meanwhile, the lower-layer policy networks of individual agents determine maneuver actions based on local observations and assigned subtasks. Simulation results demonstrate that the proposed algorithm significantly improves multi-agent cooperative encirclement performance, achieving an 11.18% higher success rate and a 9.94% reduction in completion time compared to state-of-the-art methods.

Original languageEnglish
Pages (from-to)5716-5729
Number of pages14
JournalIEEE Transactions on Mobile Computing
Volume25
Issue number4
DOIs
StatePublished - 2026

Keywords

  • Pursuit-evasion game (PEG)
  • cooperative encirclement
  • multi-agent reinforcement learning (MARL)

Fingerprint

Dive into the research topics of 'Cooperative Pursuit-Evasion with Low Altitude Wireless Network: A Hierarchical Reinforcement Learning Approach'. Together they form a unique fingerprint.

Cite this