Skip to main navigation Skip to search Skip to main content

Multi-Stage Deep Reinforcement Learning-Based Design and Optimization of Walking policy for Humanoid Robots

  • Haomiao Qiu
  • , Jian Fu*
  • , Kaibo Yang
  • , Ke Li
  • , Zhiyuan Yu
  • , Jiawei Zhao
  • *Corresponding author for this work
  • Beihang University
  • Beijing Institute of Precision Mechatronics and Controls

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the continuous advancement of research on humanoid robots, the optimization of motion control strategies has become a crucial research direction in this field. Walking, as the fundamental motion for humanoid robots to accomplish various tasks, its control policy directly affects the practical application of humanoid robots. Currently, the problems of low efficiency in policy training and high energy consumption in motion have gradually become prominent, which are key challenges to be solved urgently. To address this, a multi-stage deep reinforcement learning (DRL) framework is proposed for the generation and optimization of walking motion control policy. By sequentially solving the problems of motion policy generation and optimization, the results of simulation verify that the proposed pipeline can accelerate the policy training convergence and effectively improve motion energy efficiency under different walking speeds.

Original languageEnglish
Title of host publicationProceedings of 2025 8th International Conference on Computer Information Science and Artificial Intelligence, CISAI 2025
PublisherAssociation for Computing Machinery, Inc
Pages868-874
Number of pages7
ISBN (Electronic)9798400718748
DOIs
StatePublished - 19 Dec 2025
Event2025 8th International Conference on Computer Information Science and Artificial Intelligence, CISAI 2025 - Wuhan, China
Duration: 12 Sep 202514 Sep 2025

Publication series

NameProceedings of 2025 8th International Conference on Computer Information Science and Artificial Intelligence, CISAI 2025

Conference

Conference2025 8th International Conference on Computer Information Science and Artificial Intelligence, CISAI 2025
Country/TerritoryChina
CityWuhan
Period12/09/2514/09/25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Energy Efficiency Optimization
  • Humanoid Robots
  • Motion policy
  • Multi-Stage Deep Reinforcement Learning

Fingerprint

Dive into the research topics of 'Multi-Stage Deep Reinforcement Learning-Based Design and Optimization of Walking policy for Humanoid Robots'. Together they form a unique fingerprint.

Cite this