跳到主要导航 跳到搜索 跳到主要内容

SAFEDREAMER: SAFE REINFORCEMENT LEARNING WITH WORLD MODEL

  • Weidong Huang
  • , Jiaming Ji
  • , Chunhe Xia
  • , Borong Zhang
  • , Yaodong Yang*
  • *此作品的通讯作者
  • Peking University
  • Beihang University

科研成果: 会议稿件论文同行评审

摘要

The deployment of Reinforcement Learning (RL) in real-world applications is constrained by its failure to satisfy safety criteria. Existing Safe Reinforcement Learning (SafeRL) methods, which rely on cost functions to enforce safety, often fail to achieve zero-cost performance in complex scenarios, especially vision-only tasks. These limitations are primarily due to model inaccuracies and inadequate sample efficiency. The integration of the world model has proven effective in mitigating these shortcomings. In this work, we introduce SafeDreamer, a novel algorithm incorporating Lagrangian-based methods into world model planning processes within the superior Dreamer framework. Our method achieves nearly zero-cost performance on various tasks, spanning low-dimensional and vision-only input, within the Safety-Gymnasium benchmark, showcasing its efficacy in balancing performance and safety in RL tasks. Further details can be found in the code repository: https://github.com/PKU-Alignment/SafeDreamer.

源语言英语
出版状态已出版 - 2024
活动12th International Conference on Learning Representations, ICLR 2024 - Hybrid, Vienna, 奥地利
期限: 7 5月 202411 5月 2024

会议

会议12th International Conference on Learning Representations, ICLR 2024
国家/地区奥地利
Hybrid, Vienna
时期7/05/2411/05/24

指纹

探究 'SAFEDREAMER: SAFE REINFORCEMENT LEARNING WITH WORLD MODEL' 的科研主题。它们共同构成独一无二的指纹。

引用此