跳到主要导航 跳到搜索 跳到主要内容

ROSE: Cluster resource scheduling via speculative over-subscription

  • Beihang University
  • University of Leeds
  • Lancaster University
  • Alibaba Group Holding Ltd.

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

A long-standing challenge in cluster scheduling is to achieve a high degree of utilization of heterogeneous resources in a cluster. In practice there exists a substantial disparity between perceived and actual resource utilization. A scheduler might regard a cluster as fully utilized if a large resource request queue is present, but the actual resource utilization of the cluster can be in fact very low. This disparity results in the formation of idle resources, leading to inefficient resource usage and incurring high operational costs and an inability to provision services. In this paper we present a new cluster scheduling system, ROSE, that is based on a multi-layered scheduling architecture with an ability to over-subscribe idle resources to accommodate unfulfilled resource requests. ROSE books idle resources in a speculative manner: instead of waiting for resource allocation to be confirmed by the centralized scheduler, it requests intelligently to launch tasks within machines according to their suitability to oversubscribe resources. A threshold control with timely task rescheduling ensures fully-utilized cluster resources without generating potential task stragglers. Experimental results show that ROSE can almost double the average CPU utilization, from 36.37% to 65.10%, compared with a centralized scheduling scheme, and reduce the workload makespan by 30.11%, with an 8.23% disk utilization improvement over other scheduling strategies.

源语言英语
主期刊名Proceedings - 2018 IEEE 38th International Conference on Distributed Computing Systems, ICDCS 2018
出版商Institute of Electrical and Electronics Engineers Inc.
949-960
页数12
ISBN(电子版)9781538668719
DOI
出版状态已出版 - 19 7月 2018
活动38th IEEE International Conference on Distributed Computing Systems, ICDCS 2018 - Vienna, 奥地利
期限: 2 7月 20185 7月 2018

出版系列

姓名Proceedings - International Conference on Distributed Computing Systems
2018-July

会议

会议38th IEEE International Conference on Distributed Computing Systems, ICDCS 2018
国家/地区奥地利
Vienna
时期2/07/185/07/18

指纹

探究 'ROSE: Cluster resource scheduling via speculative over-subscription' 的科研主题。它们共同构成独一无二的指纹。

引用此