跳到主要导航 跳到搜索 跳到主要内容

I4R: Promoting deep reinforcement learning by the indicator for expressive representations

  • Xufang Luo*
  • , Qi Meng
  • , Di He
  • , Wei Chen
  • , Yunhong Wang
  • *此作品的通讯作者
  • Beihang University
  • Microsoft USA
  • Peking University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Learning expressive representations is always crucial for well-performed policies in deep reinforcement learning (DRL). Different from supervised learning, in DRL, accurate targets are not always available, and some inputs with different actions only have tiny differences, which stimulates the demand for learning expressive representations. In this paper, firstly, we empirically compare the representations of DRL models with different performances. We observe that the representations of a better state extractor (SE) are more scattered than a worse one when they are visualized. Thus, we investigate the singular values of representation matrix, and find that, better SEs always correspond to smaller differences among these singular values. Next, based on such observations, we define an indicator of the representations for DRL model, which is the Number of Significant Singular Values (NSSV) of a representation matrix. Then, we propose I4R algorithm, to improve DRL algorithms by adding the corresponding regularization term to enhance the NSSV. Finally, we apply I4R to both policy gradient and value based algorithms on Atari games, and the results show the superiority of our proposed method.1,.

源语言英语
主期刊名Proceedings of the 29th International Joint Conference on Artificial Intelligence, IJCAI 2020
编辑Christian Bessiere
出版商International Joint Conferences on Artificial Intelligence
2669-2675
页数7
ISBN(电子版)9780999241165
出版状态已出版 - 2020
活动29th International Joint Conference on Artificial Intelligence, IJCAI 2020 - Yokohama, 日本
期限: 1 1月 2021 → …

出版系列

姓名IJCAI International Joint Conference on Artificial Intelligence
2021-January
ISSN(印刷版)1045-0823

会议

会议29th International Joint Conference on Artificial Intelligence, IJCAI 2020
国家/地区日本
Yokohama
时期1/01/21 → …

指纹

探究 'I4R: Promoting deep reinforcement learning by the indicator for expressive representations' 的科研主题。它们共同构成独一无二的指纹。

引用此