跳到主要导航 跳到搜索 跳到主要内容

PMS: An effective approximation approach for distributed large-scale graph data processing and mining

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recently, large-scale graph data processing and mining has drawn great attention, and many distributed graph processing systems [6-8, 10, 11, 13, 14, 17, 20] have been proposed. However, large-scale graph processing remains a challenging problem. Because the computation time in some cases is still unacceptable especially when the time is limited. As illustrated in Table 1, nearly three hours are needed when running Single-Source Shortest Path algorithm on the USA-road dataset 1 using performant open-source distributed graph processing systems. In this paper, we propose an effective priority-based message sampling (PMS) approach to further improve the performance of distributed graph processing at the cost of some accuracy loss. Noticing that the passing and processing of messages dominates the computation time, our approach works by eliminating those less useful messages directly without passing them which can effectively reduce the computation overhead.We implement our approach basing on Apache Giraph [6], a popular open-source implementation of Google's Pregel [14] and report the primary results of our prototype system. The experimental results show that our approach can achieve reasonable accuracy with much less computation time.

源语言英语
主期刊名CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management
出版商Association for Computing Machinery
1999-2002
页数4
ISBN(电子版)9781450349185
DOI
出版状态已出版 - 6 11月 2017
活动26th ACM International Conference on Information and Knowledge Management, CIKM 2017 - Singapore, 新加坡
期限: 6 11月 201710 11月 2017

出版系列

姓名International Conference on Information and Knowledge Management, Proceedings
Part F131841

会议

会议26th ACM International Conference on Information and Knowledge Management, CIKM 2017
国家/地区新加坡
Singapore
时期6/11/1710/11/17

指纹

探究 'PMS: An effective approximation approach for distributed large-scale graph data processing and mining' 的科研主题。它们共同构成独一无二的指纹。

引用此