TY - GEN
T1 - AUV Path Planning with Kinematic Constraints in Unknown Environment Using Reinforcement Learning
AU - Hou, Xuyang
AU - Du, Jun
AU - Wang, Jingjing
AU - Ren, Yong
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2020/6/19
Y1 - 2020/6/19
N2 - Path planning has been considered as a challenging and critical issue in the management of autonomous underwater vehicle (AUV) systems. Although in the past years, the study of AUV path planning has obtained numerous achievements, many researches only considered the optimal path in a known grid-based environment, and ignored nonlinear kinematic characteristics of AUVs, which is unrealistic. In this paper, a reinforcement learning based path planning algorithm is proposed, with the nonlinear kinematic constraints of the AUV in unknown continuous environments. The proposed algorithm utilizes the sonar array to detect the randomly placed obstacles and plans a collision-free path that connects the start and target points even the map data is not known in advance. Extensive analysis and simulations are conducted in unknown continuous environments, and verify the validity and efficiency of the proposed algorithm.
AB - Path planning has been considered as a challenging and critical issue in the management of autonomous underwater vehicle (AUV) systems. Although in the past years, the study of AUV path planning has obtained numerous achievements, many researches only considered the optimal path in a known grid-based environment, and ignored nonlinear kinematic characteristics of AUVs, which is unrealistic. In this paper, a reinforcement learning based path planning algorithm is proposed, with the nonlinear kinematic constraints of the AUV in unknown continuous environments. The proposed algorithm utilizes the sonar array to detect the randomly placed obstacles and plans a collision-free path that connects the start and target points even the map data is not known in advance. Extensive analysis and simulations are conducted in unknown continuous environments, and verify the validity and efficiency of the proposed algorithm.
KW - AUV
KW - path planning
KW - reinforcement learning
KW - unknown environments
UR - https://www.scopus.com/pages/publications/85091576384
U2 - 10.1145/3408127.3408183
DO - 10.1145/3408127.3408183
M3 - 会议稿件
AN - SCOPUS:85091576384
T3 - ACM International Conference Proceeding Series
SP - 274
EP - 278
BT - ICDSP 2020 - 2020 4th International Conference on Digital Signal Processing, Proceedings
PB - Association for Computing Machinery
T2 - 4th International Conference on Digital Signal Processing, ICDSP 2020
Y2 - 19 June 2020 through 21 June 2020
ER -