Skip to main navigation Skip to search Skip to main content

一 种 多 感 知 多 约 束 奖 励 机 制 的 驾 驶策 略 学 习 方 法

Translated title of the contribution: A driving decision⁃making approach based on multi⁃sensing and multi⁃constraints reward function
  • Zhong Li Wang
  • , Hao Wang
  • , Yan Shen
  • , Bai Gen Cai
  • Beijing Jiaotong University

Research output: Contribution to journalArticlepeer-review

Abstract

Due to the complicated and volatile traffic scenes,deep learning-based approaches and most of the deep reinforcement learning approaches cannot satisfy the requirements of real applications. To address these issues,a reinforcement learning-based approach based on multi-sensing and multi-constraint reward function under SAC framework(MSMC-SAC)is proposed. The inputs of the method include front images and LiDAR data,as well as the bird's-eye view information generated from the perception results. The multiple information input is coded by an encoding network to obtain the representation in latent space,and the reconstructed information is used as the input for reinforcement learning module,and a reward function considering various constraints such as transverse-longitudinal error,heading,smoothness,and driving speed is designed. The performance of the proposed method in some typical traffic scenarios is simulated and verified with CARLA. The multi-constraint reward mechanism is analyzed. The simulation results show that the presented approach can generate the driving policies in many traffic scenarios,and the performance is outperformed against the existing SOTA methods.

Translated title of the contributionA driving decision⁃making approach based on multi⁃sensing and multi⁃constraints reward function
Original languageChinese (Traditional)
Pages (from-to)2718-2727
Number of pages10
JournalJilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition)
Volume52
Issue number11
DOIs
StatePublished - 1 Nov 2022
Externally publishedYes

Fingerprint

Dive into the research topics of 'A driving decision⁃making approach based on multi⁃sensing and multi⁃constraints reward function'. Together they form a unique fingerprint.

Cite this