Skip to main navigation Skip to search Skip to main content

Balanced Reward-inspired Reinforcement Learning for Autonomous Vehicle Racing

  • Zhen Tian
  • , Dezong Zhao
  • , Zhihao Lin
  • , David Flynn
  • , Wenjing Zhao
  • , Daxin Tian
  • University of Glasgow
  • Hong Kong Polytechnic University

Research output: Contribution to journalConference articlepeer-review

Abstract

Autonomous vehicle racing has attracted extensive interest due to its great potential in autonomous driving at the extreme limits. Model-based and learning-based methods are being widely used in autonomous racing. However, model-based methods cannot cope with the dynamic environments when only local perception is available. As a comparison, learning-based methods can handle complex environments under local perception. Recently, deep reinforcement learning (DRL) has gained popularity in autonomous racing. DRL outperforms conventional learning-based methods by handling complex situations and leveraging local information. DRL algorithms, such as the proximal policy algorithm, can achieve a good balance between the execution time and safety in autonomous vehicle competition. However, the training outcomes of conventional DRL methods exhibit inconsistent correctness in decision-making. The instability in decision-making introduces safety concerns in autonomous vehicle racing, such as collisions into track boundaries. The proposed algorithm is capable to avoid collisions and improve the training quality. Simulation results on a physical engine demonstrate that the proposed algorithm outperforms other DRL algorithms in in collision avoidance, achieving safer control during sharp bends, and higher training quality among multiple tracks.

Original languageEnglish
Pages (from-to)628-640
Number of pages13
JournalProceedings of Machine Learning Research
Volume242
StatePublished - 2024
Event6th Annual Learning for Dynamics and Control Conference, L4DC 2024 - Oxford, United Kingdom
Duration: 15 Jul 202417 Jul 2024

Keywords

  • Autonomous vehicle racing
  • balanced reward function
  • local planning
  • proximal policy optimization

Fingerprint

Dive into the research topics of 'Balanced Reward-inspired Reinforcement Learning for Autonomous Vehicle Racing'. Together they form a unique fingerprint.

Cite this