跳到主要导航 跳到搜索 跳到主要内容

Predator-An experience guided configuration optimizer for Hadoop MapReduce

  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

MapReduce is a distributed computing programming framework which provides an effective solution to the data processing challenge. As an open-source implementation of MapReduce, Hadoop has been widely used in practice. The performance of Hadoop MapReduce heavily depends on its configuration settings, so tuning these configuration parameters could be an effective way to improve its performance. However, picking out the optimal configuration settings is not easy for the time consuming nature of MapReduce together with the high dimensional and nonlinear features of its configuration optimization. In this paper, we introduce Predator, an experience guided configuration optimizer, which does not treat the optimization problem as a pure black-box problem but utilizes useful experience learnt from Hadoop MapReduce configuration practice to assist the optimizing process. The optimizer uses job execution time estimated by a practical MapReduce cost model as the objective function, and classifies Hadoop MapReduce parameters into different groups by their different tunable levels to shrink search space. Furthermore, the optimization algorithm of the optimizer uses the idea of subspace division to prevent local optimum problem, and it could also reduce the searching time by cutting down the cost in visiting unpromising points in search space. Experiments on Hadoop clusters demonstrate the effectiveness and efficiency of the optimizer.

源语言英语
主期刊名CloudCom 2012 - Proceedings
主期刊副标题2012 4th IEEE International Conference on Cloud Computing Technology and Science
出版商IEEE Computer Society
419-426
页数8
ISBN(印刷版)9781467345095
DOI
出版状态已出版 - 2012
活动4th IEEE International Conference on Cloud Computing Technology and Science, CloudCom 2012 - Taipei, 中国台湾
期限: 3 12月 20126 12月 2012

出版系列

姓名CloudCom 2012 - Proceedings: 2012 4th IEEE International Conference on Cloud Computing Technology and Science

会议

会议4th IEEE International Conference on Cloud Computing Technology and Science, CloudCom 2012
国家/地区中国台湾
Taipei
时期3/12/126/12/12

指纹

探究 'Predator-An experience guided configuration optimizer for Hadoop MapReduce' 的科研主题。它们共同构成独一无二的指纹。

引用此