Skip to main navigation Skip to search Skip to main content

Robot Path Planning Method Based on an Improved TD3 Algorithm

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traditional path planning methods often rely heavily on environmental maps, while deep reinforcement learning (DRL) algorithms face challenges in achieving stable policy convergence in complex environments. To overcome these challenges, this study introduces an enhanced Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for autonomous path planning in unknown environments. First, temporally correlated the algorithm integrates Ornstein-Uhlenbeck (OU) noise to improve action exploration. Second, an Uncertainty-Weighted TD3 (UW-TD3) mechanism is introduced, which fuses Q-value estimates based on confidence weights to enhance stability and accelerate convergence. Finally, Hindsight Experience Replay (HER) is integrated to improve sample efficiency, enabling the agent to learn effectively from unsuccessful experiences. Simulation results under static as well as dynamic obstacle scenarios demonstrate that the proposed method significantly improves planning success rates and training efficiency, outperforming conventional TD3 and DDPG algorithms.

Original languageEnglish
Title of host publication2025 International Conference on Computational Intelligence and Robotics, CIR 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages21-26
Number of pages6
ISBN (Electronic)9798331523824
DOIs
StatePublished - 2025
Event2025 International Conference on Computational Intelligence and Robotics, CIR 2025 - Guangzhou, China
Duration: 12 Sep 202514 Sep 2025

Publication series

Name2025 International Conference on Computational Intelligence and Robotics, CIR 2025

Conference

Conference2025 International Conference on Computational Intelligence and Robotics, CIR 2025
Country/TerritoryChina
CityGuangzhou
Period12/09/2514/09/25

Keywords

  • deep reinforcement learning
  • hindsight experience replay
  • OU noise
  • path planning
  • TD3
  • uncertainty-based Q-value fusion

Fingerprint

Dive into the research topics of 'Robot Path Planning Method Based on an Improved TD3 Algorithm'. Together they form a unique fingerprint.

Cite this