Skip to main navigation Skip to search Skip to main content

Trajectory Planning of UAV in Unknown Dynamic Environment with Deep Reinforcement Learning

  • Jia Wang*
  • , Weihong Wang
  • , Qian Wu
  • *Corresponding author for this work
  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Providing a collision-free, safe and efficient optimal trajectory for unmanned aerial vehicles (UAVs) in an unknown dynamic environment is one of the most important issues for researchers. In this paper, a trajectory planning approach for UAV in unknown dynamic environment based on deep reinforcement learning (DRL) is proposed. This study models trajectory planning of UAV as a discrete-time, discrete-action problem, and then proposes an improved deep Q network (IDQN) algorithm to solve it. The IDQN algorithm adds the track angle information of UAV to the reward function to speed up the learning process, furthermore, it also improves the action selection strategy and learning rate setting. Besides, in simulation, the paper considers the trajectory constraints of UAV in order to make the obtained trajectory have better practical availability. Simulation results demonstrate the effectiveness of the IDQN algorithm to implement UAV trajectory planning with constraints in unknown dynamic environments. Meanwhile, comparison with the classical DQN (CDQN) algorithm is conducted to further explore the advantage of the method.

Original languageEnglish
Title of host publicationProceedings of 2019 Chinese Intelligent Systems Conference - Volume II
EditorsYingmin Jia, Junping Du, Weicun Zhang
PublisherSpringer Verlag
Pages470-480
Number of pages11
ISBN (Print)9789813296855
DOIs
StatePublished - 2020
EventChinese Intelligent Systems Conference, CISC 2019 - Haikou, China
Duration: 26 Oct 201927 Oct 2019

Publication series

NameLecture Notes in Electrical Engineering
Volume593
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceChinese Intelligent Systems Conference, CISC 2019
Country/TerritoryChina
CityHaikou
Period26/10/1927/10/19

Keywords

  • Improved DQN
  • Trajectory constraints
  • Trajectory planning
  • UAV
  • Unknown dynamic environment

Fingerprint

Dive into the research topics of 'Trajectory Planning of UAV in Unknown Dynamic Environment with Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this