跳到主要导航 跳到搜索 跳到主要内容

Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based Reinforcement Learning

  • Yangru Huang
  • , Peixi Peng*
  • , Yifan Zhao
  • , Yunpeng Zhai
  • , Haoran Xu
  • , Yonghong Tian*
  • *此作品的通讯作者
  • Peking University
  • Peng Cheng Laboratory
  • Sun Yat-Sen University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Efficient motion and appearance modeling are critical for vision-based Reinforcement Learning (RL). However, existing methods struggle to reconcile motion and appearance information within the state representations learned from a single observation encoder. To address the problem, we present Synergizing Interactive Motion-appearance Understanding (Simoun), a unified framework for vision-based RL Given consecutive observation frames, Simoun deliberately and interactively learns both motion and appearance features through a dual-path network architecture. The learning process collaborates with a structural interactive module, which explores the latent motion-appearance structures from the two network paths to leverage their complementarity. To promote sample efficiency, we further design a consistency-guided curiosity module to encourage the exploration of under-learned observations. During training, the curiosity module provides intrinsic rewards according to the consistency of environmental temporal dynamics, which are deduced from both motion and appearance network paths. Experiments conducted on Deep-Mind control suite and CARLA automatic driving benchmarks demonstrate the effectiveness of Simoun, where it performs favorably against state-of-the-art methods.

源语言英语
主期刊名Proceedings - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
出版商Institute of Electrical and Electronics Engineers Inc.
176-185
页数10
ISBN(电子版)9798350307184
DOI
出版状态已出版 - 2023
已对外发布
活动2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Paris, 法国
期限: 2 10月 20236 10月 2023

出版系列

姓名Proceedings of the IEEE International Conference on Computer Vision
ISSN(印刷版)1550-5499

会议

会议2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023
国家/地区法国
Paris
时期2/10/236/10/23

指纹

探究 'Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此