Enabling Efficient Scheduling in Large-Scale UAV-Assisted Mobile-Edge Computing via Hierarchical Reinforcement Learning

  • Tao Ren
  • , Jianwei Niu
  • , Bin Dai
  • , Xuefeng Liu
  • , Zheyuan Hu
  • , Mingliang Xu
  • , Mohsen Guizani*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Due to the high maneuverability and flexibility, unmanned aerial vehicles (UAVs) have been considered as a promising paradigm to assist mobile edge computing (MEC) in many scenarios including disaster rescue and field operation. Most existing research focuses on the study of trajectory and computation-offloading scheduling for UAV-assisted MEC in stationary environments, and could face challenges in dynamic environments where the locations of UAVs and mobile devices (MDs) vary significantly. Some latest research attempts to develop scheduling policies for dynamic environments by means of reinforcement learning (RL). However, as these need to explore in high-dimensional state and action space, they may fail to cover in large-scale networks where multiple UAVs serve numerous MDs. To address this challenge, we leverage the idea of 'divide-and-conquer' and propose HT3O, a scalable scheduling approach for large-scale UAV-assisted MEC. First, HT3O is built with neural networks via deep RL to obtain real-time scheduling policies for MEC in dynamic environments. More importantly, to make HT3O more scalable, we decompose the scheduling problem into two-layered subproblems and optimize them alternately via hierarchical RL. This not only substantially reduces the complexity of each subproblem, but also improves the convergence efficiency. Experimental results show that HT3O can achieve promising performance improvements over state-of-the-art approaches.

Original languageEnglish
Pages (from-to)7095-7109
Number of pages15
JournalIEEE Internet of Things Journal
Volume9
Issue number10
DOIs
StatePublished - 15 May 2022

Keywords

  • Computation offloading
  • Hierarchical reinforcement learning (HRL)
  • Mobile edge computing (MEC)
  • Trajectory optimization
  • Unmanned aerial vehicle

Fingerprint

Dive into the research topics of 'Enabling Efficient Scheduling in Large-Scale UAV-Assisted Mobile-Edge Computing via Hierarchical Reinforcement Learning'. Together they form a unique fingerprint.

Cite this