摘要
With the rapid development of traffic intelligence, autonomous driving technology has gradually attracted the interests of researchers. Behavioral decision-making is one of the most important parts of autonomous driving system (ADS). As a common solution, imitation learning (IL) provides a more natural and intuitive way of learning through the prior knowledge of experts. Generative adversarial imitation learning(GAIL), which is a branch of IL, is often used to learn the driving policy because of its robustness and capacity of handling large-scale problems. However, modal collapse caused by GAIL may make the generated policies lack diversity resulting in the failure of multi-task learning. In the paper, we propose an algorithm named as bidirectional generative adversarial imitation learning (BiGAIL) that allows the agent to learn the map between task intentions and driving policies, so as to achieve the goal of learning intention-based driving policy. Through simulation verification, the agent trained with BiGAIL is able to select the appropriate policy based on the current environment and learn different driving policies from multi-task demonstrations.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | Proceedings - 2022 Chinese Automation Congress, CAC 2022 |
| 出版商 | Institute of Electrical and Electronics Engineers Inc. |
| 页 | 893-898 |
| 页数 | 6 |
| ISBN(电子版) | 9781665465335 |
| DOI | |
| 出版状态 | 已出版 - 2022 |
| 活动 | 2022 Chinese Automation Congress, CAC 2022 - Xiamen, 中国 期限: 25 11月 2022 → 27 11月 2022 |
出版系列
| 姓名 | Proceedings - 2022 Chinese Automation Congress, CAC 2022 |
|---|---|
| 卷 | 2022-January |
会议
| 会议 | 2022 Chinese Automation Congress, CAC 2022 |
|---|---|
| 国家/地区 | 中国 |
| 市 | Xiamen |
| 时期 | 25/11/22 → 27/11/22 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 7 经济适用的清洁能源
指纹
探究 'BiGAIL: Learning Intention-based Driving Policy from Multi-task Demonstrations' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver