Skip to main navigation Skip to search Skip to main content

Policy-based monocular vision autonomous quadrotor obstacle avoidance method

  • Beihang University

Research output: Contribution to journalConference articlepeer-review

Abstract

Aiming at the obstacle avoidance control problem of small quadrotor, a method of quadrotor obstacle avoidance based on reinforcement learning is proposed. The proposed method can make training converge quickly and has good environmental robustness. The proposed methods include: (1) a framework adopts perception module and decision module to improve the generalization ability of the obstacle avoidance model; (2) An Actor-Critic framework-based Proximal Policy Optimization (PPO) algorithm to provide quadrotor with policy-based decision-making capabilities; The experimental simulation results show that the strategy-based framework converges quickly and has a high success rate, the training time is much lower than that of the value-based framework. The monocular vision observation ability is limited, which leads to deviations between local observation and global state, So LSTM layer is usually added to increase model performance. Policy -based decision can have a good obstacle avoidance effect without adding the LSTM layer, and have good generalization ability after short relearning after changing.

Original languageEnglish
Article number032025
JournalJournal of Physics: Conference Series
Volume2083
Issue number3
DOIs
StatePublished - 2 Dec 2021
Event2021 2nd International Conference on Applied Physics and Computing, ICAPC 2021 - Ottawa, Canada
Duration: 8 Sep 202110 Sep 2021

Keywords

  • Deep reinforcement learning
  • Obstacle avoidance
  • PPO
  • Policy-based

Fingerprint

Dive into the research topics of 'Policy-based monocular vision autonomous quadrotor obstacle avoidance method'. Together they form a unique fingerprint.

Cite this