Skip to main navigation Skip to search Skip to main content

Robust Navigation for Unmanned Surface Vehicle Utilizing Improved Distributional Soft Actor-Critic

  • Jingzehua Xu
  • , Ziqi Jia
  • , Zekai Zhang
  • , Tianyu Xing
  • , Jingjing Wang*
  • , Yong Ren
  • *Corresponding author for this work
  • Tsinghua University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Navigating unmanned surface vehicles (USVs) efficiently and robustly in the presence of obstacles and ocean current interference in marine environments is highly challenging. To achieve robust navigation without environment maps and prior information, we follow the three effective improvements of the distributed algorithm distributional soft actor-critic with three refinements (DSACT) over distributional soft actor-critic (DSAC): expected value substitution, double value distribution learning, and variance-based critic gradient adjustment. In order to further optimize the learning rate of DSACT, we optimize DSACT through loss-adjusted prioritized experience replay (LAP) and propose a local path planner called LAP-DSACT for USV navigation. In order to offset the disturbance of ocean currents and plan a smooth and safe trajectory, the motion compensation is considered in the USV motion model. The experimental results clearly demonstrate that LAP-DSACT algorithm outperforms the comparison algorithms in terms of task time, energy efficiency, and the quality of path.

Original languageEnglish
Title of host publicationArtificial Neural Networks and Machine Learning – ICANN 2024 - 33rd International Conference on Artificial Neural Networks, Proceedings
EditorsMichael Wand, Jürgen Schmidhuber, Michael Wand, Kristína Malinovská, Jürgen Schmidhuber, Igor V. Tetko, Igor V. Tetko
PublisherSpringer Science and Business Media Deutschland GmbH
Pages291-305
Number of pages15
ISBN (Print)9783031723407
DOIs
StatePublished - 2024
Event33rd International Conference on Artificial Neural Networks, ICANN 2024 - Lugano, Switzerland
Duration: 17 Sep 202420 Sep 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15019 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference33rd International Conference on Artificial Neural Networks, ICANN 2024
Country/TerritorySwitzerland
CityLugano
Period17/09/2420/09/24

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy
  2. SDG 14 - Life Below Water
    SDG 14 Life Below Water

Keywords

  • Distributional reinforcement learning
  • Loss-adjusted prioritized experience replay
  • Robust navigation
  • Unmanned surface vehicle

Fingerprint

Dive into the research topics of 'Robust Navigation for Unmanned Surface Vehicle Utilizing Improved Distributional Soft Actor-Critic'. Together they form a unique fingerprint.

Cite this