跳到主要导航 跳到搜索 跳到主要内容

Energy-Aware Collaborative AAV Target Tracking via Reinforcement Learning-Based Predictive Control with Asynchronous Policy Iteration

  • Yi Xia
  • , Ronghua Zhang
  • , Xiangwang Hou
  • , Xin Xu*
  • , Jingjing Wang
  • , Chunxiao Jiang
  • , Dusit Niyato
  • *此作品的通讯作者
  • National University of Defense Technology
  • Tsinghua University
  • Nanyang Technological University

科研成果: 期刊稿件文章同行评审

摘要

Autonomous aerial vehicle (AAV) target tracking technology is an essential component for enabling diverse low-altitude activities. Due to the constraints on energy and computing resources of AAVs, current approaches face challenges in balancing prolonged flight duration with precise tracking while avoiding high computational complexity. Therefore, this paper proposes an energy-aware formation control algorithm for multiple AAVs to cooperatively track a target while retaining a desired formation pattern. Firstly, to achieve a balanced outcome in terms of tracking performance and control effort, an actor-critic based learning predictive rule is explored to develop a near-optimal control protocol that stabilizes error dynamics and minimizes value functions for discrete-time AAV systems. By decomposing the infinite-horizon target tracking problem into a sequence of finite-horizon sub-problems, the reinforcement learning (RL)-based predictive control algorithm can achieve fast convergence in approximating the solution of Hamilton-Jacobi-Bellman (HJB) equation. Furthermore, by employing a delicately designed asynchronous policy iteration mechanism with adjustable learning intervals in RL, the cumbersome learning process can be effectively mitigated, thereby attaining both high learning efficiency and a reduced computational burden simultaneously. The involved errors are proven to be convergent and simulation results validate the optimality of our method.

源语言英语
期刊IEEE Transactions on Mobile Computing
DOI
出版状态已接受/待刊 - 2025

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 7 - 经济适用的清洁能源
    可持续发展目标 7 经济适用的清洁能源

指纹

探究 'Energy-Aware Collaborative AAV Target Tracking via Reinforcement Learning-Based Predictive Control with Asynchronous Policy Iteration' 的科研主题。它们共同构成独一无二的指纹。

引用此