跳到主要导航 跳到搜索 跳到主要内容

CrowdTC: Crowdsourced taxonomy construction

  • Hong Kong University of Science and Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recently, taxonomy has attracted much attention. Both automatic construction solutions and human-based computation approaches have been proposed. The automatic methods suffer from the problem of either low precision or low recall and human computation, on the other hand, is not suitable for large scale tasks. Motivated by the shortcomings of both approaches, we present a hybrid framework, which combines the power of machine-based approaches and human computation (the crowd) to construct a more complete and accurate taxonomy. Specifically, our framework consists of two steps: we first construct a complete but noisy taxonomy automatically, then crowd is introducedto adjust the entity positions in the constructed taxonomy. However, the adjustment is challenging as the budget (money) for asking the crowd is often limited. In our work, we formulatethe problem of finding the optimal adjustment as an entityselection optimization (ESO) problem, which is proved to beNP-hard. We then propose an exact algorithm and a moreefficient approximation algorithm with an approximation ratioof 1/2(1-1/e). We conduct extensive experiments on real datasets, the results show that our hybrid approach largely improves the recall of the taxonomy with little impairment for precision.

源语言英语
主期刊名Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015
编辑Charu Aggarwal, Zhi-Hua Zhou, Alexander Tuzhilin, Hui Xiong, Xindong Wu
出版商Institute of Electrical and Electronics Engineers Inc.
913-918
页数6
ISBN(电子版)9781467395038
DOI
出版状态已出版 - 5 1月 2016
活动15th IEEE International Conference on Data Mining, ICDM 2015 - Atlantic City, 美国
期限: 14 11月 201517 11月 2015

出版系列

姓名Proceedings - IEEE International Conference on Data Mining, ICDM
2016-January
ISSN(印刷版)1550-4786

会议

会议15th IEEE International Conference on Data Mining, ICDM 2015
国家/地区美国
Atlantic City
时期14/11/1517/11/15

指纹

探究 'CrowdTC: Crowdsourced taxonomy construction' 的科研主题。它们共同构成独一无二的指纹。

引用此