Abstract
With the rapid development of traffic intelligence, autonomous driving technology has gradually attracted the interests of researchers. Behavioral decision-making is one of the most important parts of autonomous driving system (ADS). As a common solution, imitation learning (IL) provides a more natural and intuitive way of learning through the prior knowledge of experts. Generative adversarial imitation learning(GAIL), which is a branch of IL, is often used to learn the driving policy because of its robustness and capacity of handling large-scale problems. However, modal collapse caused by GAIL may make the generated policies lack diversity resulting in the failure of multi-task learning. In the paper, we propose an algorithm named as bidirectional generative adversarial imitation learning (BiGAIL) that allows the agent to learn the map between task intentions and driving policies, so as to achieve the goal of learning intention-based driving policy. Through simulation verification, the agent trained with BiGAIL is able to select the appropriate policy based on the current environment and learn different driving policies from multi-task demonstrations.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2022 Chinese Automation Congress, CAC 2022 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 893-898 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781665465335 |
| DOIs | |
| State | Published - 2022 |
| Event | 2022 Chinese Automation Congress, CAC 2022 - Xiamen, China Duration: 25 Nov 2022 → 27 Nov 2022 |
Publication series
| Name | Proceedings - 2022 Chinese Automation Congress, CAC 2022 |
|---|---|
| Volume | 2022-January |
Conference
| Conference | 2022 Chinese Automation Congress, CAC 2022 |
|---|---|
| Country/Territory | China |
| City | Xiamen |
| Period | 25/11/22 → 27/11/22 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Autonomous driving
- bidirectional generative adversarial imitation learning
- intention-based policy
- multi-task demonstrations
Fingerprint
Dive into the research topics of 'BiGAIL: Learning Intention-based Driving Policy from Multi-task Demonstrations'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver