跳到主要导航 跳到搜索 跳到主要内容

Quadruped Reinforcement Learning without Explicit State Estimation

  • Qikai Li
  • , Guiyu Dong
  • , Ripeng Qin
  • , Jiawei Chen
  • , Kun Xu*
  • , Xilun Ding
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Reinforcement learning is a promising approach to developing legged robot locomotion controllers. The gen-eral process of development is: large-scale training in the virtual simulation environment to obtain reliable control policy network, and then the policy network is deployed to real legged robot. In the training procedure, a complete robot state increases the speed of training and the stability of policy. The robot's states like body velocities are easily available in simulation training, but they are difficult to obtain in the real robot, hence specifically designed robot state estimators are needed. However, the development of state estimators requires expert knowledge related to control theory and robotics, limiting the direct application of reinforcement learning to robots. To take advantage of the end-to-end mapping of artificial neural networks, we simplified the existing reinforcement learning process for quadruped robots and propose a training method based on curriculum learning in this work. The proposed method can produce a reliable policy that does not require robot state estimator and only take raw sensors data. The feasibility of the proposed method is verified in simulations and real quadrupedal robot. Video of the quadrupedal robot is available at www.youtube.com/watch?v=-iho4KIlEPw.

源语言英语
主期刊名2022 IEEE International Conference on Robotics and Biomimetics, ROBIO 2022
出版商Institute of Electrical and Electronics Engineers Inc.
1989-1994
页数6
ISBN(电子版)9781665481090
DOI
出版状态已出版 - 2022
活动2022 IEEE International Conference on Robotics and Biomimetics, ROBIO 2022 - Jinghong, 中国
期限: 5 12月 20229 12月 2022

出版系列

姓名2022 IEEE International Conference on Robotics and Biomimetics, ROBIO 2022

会议

会议2022 IEEE International Conference on Robotics and Biomimetics, ROBIO 2022
国家/地区中国
Jinghong
时期5/12/229/12/22

指纹

探究 'Quadruped Reinforcement Learning without Explicit State Estimation' 的科研主题。它们共同构成独一无二的指纹。

引用此