TY - JOUR
T1 - Balanced Reward-inspired Reinforcement Learning for Autonomous Vehicle Racing
AU - Tian, Zhen
AU - Zhao, Dezong
AU - Lin, Zhihao
AU - Flynn, David
AU - Zhao, Wenjing
AU - Tian, Daxin
N1 - Publisher Copyright:
© 2024 Z. Tian, D. Zhao, Z. Lin, D. Flynn, W. Zhao & D. Tian.
PY - 2024
Y1 - 2024
N2 - Autonomous vehicle racing has attracted extensive interest due to its great potential in autonomous driving at the extreme limits. Model-based and learning-based methods are being widely used in autonomous racing. However, model-based methods cannot cope with the dynamic environments when only local perception is available. As a comparison, learning-based methods can handle complex environments under local perception. Recently, deep reinforcement learning (DRL) has gained popularity in autonomous racing. DRL outperforms conventional learning-based methods by handling complex situations and leveraging local information. DRL algorithms, such as the proximal policy algorithm, can achieve a good balance between the execution time and safety in autonomous vehicle competition. However, the training outcomes of conventional DRL methods exhibit inconsistent correctness in decision-making. The instability in decision-making introduces safety concerns in autonomous vehicle racing, such as collisions into track boundaries. The proposed algorithm is capable to avoid collisions and improve the training quality. Simulation results on a physical engine demonstrate that the proposed algorithm outperforms other DRL algorithms in in collision avoidance, achieving safer control during sharp bends, and higher training quality among multiple tracks.
AB - Autonomous vehicle racing has attracted extensive interest due to its great potential in autonomous driving at the extreme limits. Model-based and learning-based methods are being widely used in autonomous racing. However, model-based methods cannot cope with the dynamic environments when only local perception is available. As a comparison, learning-based methods can handle complex environments under local perception. Recently, deep reinforcement learning (DRL) has gained popularity in autonomous racing. DRL outperforms conventional learning-based methods by handling complex situations and leveraging local information. DRL algorithms, such as the proximal policy algorithm, can achieve a good balance between the execution time and safety in autonomous vehicle competition. However, the training outcomes of conventional DRL methods exhibit inconsistent correctness in decision-making. The instability in decision-making introduces safety concerns in autonomous vehicle racing, such as collisions into track boundaries. The proposed algorithm is capable to avoid collisions and improve the training quality. Simulation results on a physical engine demonstrate that the proposed algorithm outperforms other DRL algorithms in in collision avoidance, achieving safer control during sharp bends, and higher training quality among multiple tracks.
KW - Autonomous vehicle racing
KW - balanced reward function
KW - local planning
KW - proximal policy optimization
UR - https://www.scopus.com/pages/publications/85203683576
M3 - 会议文章
AN - SCOPUS:85203683576
SN - 2640-3498
VL - 242
SP - 628
EP - 640
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
T2 - 6th Annual Learning for Dynamics and Control Conference, L4DC 2024
Y2 - 15 July 2024 through 17 July 2024
ER -