跳到主要导航 跳到搜索 跳到主要内容

Solving job scheduling problems in a resource preemption environment with multi-agent reinforcement learning

  • Xiaohan Wang
  • , Lin Zhang*
  • , Tingyu Lin
  • , Chun Zhao
  • , Kunyu Wang
  • , Zhen Chen
  • *此作品的通讯作者
  • Beihang University
  • CAS - Institute of Electronics
  • Beijing Information Science & Technology University

科研成果: 期刊稿件文章同行评审

摘要

In smart manufacturing, robots gradually replace traditional machines as new processing units, which have significantly liberated laborers and reduced manufacturing expenditure. However, manufacturing resources are usually limited so that the preemption relationship exists among robots. Under this circumstance, job scheduling puts forward higher requirements on accuracy and generalization. To this end, this paper proposes a scheduling algorithm to solve job scheduling problems in a resource preemption environment with multi-agent reinforcement learning. The resource preemption environment is modeled as a decentralized partially observable Markov decision process, where each job is regarded as an intelligent agent that chooses an available robot according to its current partial observation. Based on this modeling, a multi-agent scheduling architecture is constructed to handle the high-dimension action space issue caused by multi-task simultaneous scheduling. Besides, multi-agent reinforcement learning is employed to learn both the decision-making policy of each agent and the cooperation between job agents. This paper is novel in addressing the scheduling problem in a resource preemption environment and solving the job shop scheduling problem with multi-agent reinforcement learning. The experiments of the case study indicate that our proposed method outperforms the traditional rule-based methods and the distributed-agent reinforcement learning method in total makespan, training stability, and model generalization.

源语言英语
文章编号102324
期刊Robotics and Computer-Integrated Manufacturing
77
DOI
出版状态已出版 - 10月 2022

指纹

探究 'Solving job scheduling problems in a resource preemption environment with multi-agent reinforcement learning' 的科研主题。它们共同构成独一无二的指纹。

引用此