跳到主要导航 跳到搜索 跳到主要内容

Hydraulic-Supports Alignment by TD3 with Segmented Experience Pool

  • Yi Yang*
  • , Yapeng Dai*
  • , Tian Wang
  • , Wei Qian
  • *此作品的通讯作者
  • Henan Polytechnic University
  • Henan Key Laboratory of Intelligent Detection and Control of Coal Mine Equipment

科研成果: 期刊稿件文章同行评审

摘要

Hydraulic-supports alignment is to keep the coal mining face in line and is heavily influenced by the various geological states. The experiences produced by the moving process are unbalanced, which leads to the agent not learning important knowledge from the rare samples. This paper is the first to introduce the reinforcement learning to the hydraulic-supports alignment, and establish the Markov optimal decision model by TD3 algorithm. Aiming at the imbalance issue of the experience, this paper proposes a segmented experience pool and three sampling replay mechanisms according to the characteristics of the moving process with various geological states. Experimental results show that the improved TD3, utilizing a segmented experience pool with three different replay mechanisms, could effectively identify the optimal moving policy and achieve significant convergence in cases involving both normal movement and insufficient movement of hydraulic-supports. In contrast, the TD3 performs inadequately and struggles to find the optimal policy.

源语言英语
文章编号35
期刊Neural Processing Letters
57
2
DOI
出版状态已出版 - 4月 2025

指纹

探究 'Hydraulic-Supports Alignment by TD3 with Segmented Experience Pool' 的科研主题。它们共同构成独一无二的指纹。

引用此