跳到主要导航 跳到搜索 跳到主要内容

基于分层强化学习的天车调度优化方法

  • Zhang Sheng Su
  • , Sheng Long Jiang*
  • , Gong Zhuang Peng
  • , Yue Yong Liang
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Cranes are key heavy-duty material handling equipment widely used in shops, warehouses, ports, and other industrial settings. The scheduling of cranes significantly affects transportation efficiency and the achievement of production goals. To address the crane scheduling problem with time windows (CSP-TW), a mixed-integer linear programming model based on spatio-temporal discretization is developed. Based on the characteristics of the model, a hierarchical reinforcement learning (HRL) decision-making framework is designed. The high-level decision network assigns transportation tasks to appropriate cranes, while the low-level network plans paths for each crane to complete its assigned task. During the learning process, action tabu rules are introduced to avoid ineffective actions and guide the decision networks toward the dominant policy space. Subsequently, external experience pooling and the dueling double deep Q-network strategy are adopted to train the decision networks. Tests were executed based on the logistics simulation platform of a steel plant from a certain company. Ablation experiments show that the introduction of action tabu rules improves learning efficiency.Training comparisons indicate that HRL achieves better convergence than the end-to-end framework. Comparative experiments demonstrate that HRL outperforms several methods, including multi-rule combinations, meta-heuristic algorithms, end-to-end and deep Q-network, while satisfying second-level response-time requirements for applications.

投稿的翻译标题Hierarchical reinforcement learning-based optimization method for crane scheduling
源语言繁体中文
页(从-至)2261-2273
页数13
期刊Kongzhi Lilun Yu Yingyong/Control Theory and Applications
42
11
DOI
出版状态已出版 - 2025

关键词

  • action tabu
  • crane scheduling
  • hierarchical reinforcement learning
  • path planning
  • task assignment

指纹

探究 '基于分层强化学习的天车调度优化方法' 的科研主题。它们共同构成独一无二的指纹。

引用此