Skip to main navigation Skip to search Skip to main content

3D Path Planning for UAV with Improved Double Deep Q-Network

  • Liping Zhao
  • , Yaofei Ma
  • , Jie Zou*
  • *Corresponding author for this work
  • Beihang University
  • Science and Technology on Electro-Optic Control Laboratory of Luoyang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Unmanned aerial vehicle (UAV) has been widely used in civil and military fields due to its advantages such as zero casualties, low cost and strong maneuverability. Path planning in 3D obstacle environment is one of the fundamental capabilities of UAV for mission performing. In this paper, we propose a 3D path planning algorithm to learn a target-driven end-to-end model based on an improved double deep Q-network (DQN), where a greedy exploration strategy is applied to accelerate learning. The model takes target and obstacle message as input, and moving command of UAV as output. It can realize path planning successfully for UAV in 3D complex environment. Besides, the experimental results show that improved double DQN has better convergence speed compared with DQN and double DQN.

Original languageEnglish
Title of host publicationProceedings of 2020 Chinese Intelligent Systems Conference - Volume II
EditorsYingmin Jia, Weicun Zhang, Yongling Fu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages374-383
Number of pages10
ISBN (Print)9789811584572
DOIs
StatePublished - 2021
EventChinese Intelligent Systems Conference, CISC 2020 - Shenzhen, China
Duration: 24 Oct 202025 Oct 2020

Publication series

NameLecture Notes in Electrical Engineering
Volume706 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceChinese Intelligent Systems Conference, CISC 2020
Country/TerritoryChina
CityShenzhen
Period24/10/2025/10/20

Keywords

  • 3D path planning
  • Greedy exploration strategy
  • Improved double DQN
  • Reinforcement learning
  • UAV

Fingerprint

Dive into the research topics of '3D Path Planning for UAV with Improved Double Deep Q-Network'. Together they form a unique fingerprint.

Cite this