摘要
The name disambiguation task is designed to solve the name ambiguity problem of documents of multiple persons who have the same name with one another. The task aims to partition all the publications belonging to multiple person with the same name and realize that each decomposed partition is composed of publications of a unique person. Many works on name disambiguation task have a common feature that clustering method is usually used in the last step. The paper presents a complementary study to these works from another point of view. Based on the idea that documents with strong association relationships are likely to belong to the same author, this paper proposes a method of discovering meta clusters by graph partition with a heuristic rule to improve these clustering-based works. Specially, different from these works, this work uses clustering ensemble method instead of clustering method in the last step. Experimental results on a real-life dataset show that the improved method has satisfactory performance compared with the clustering-based baseline method.
| 源语言 | 英语 |
|---|---|
| 页(从-至) | 1559-1568 |
| 页数 | 10 |
| 期刊 | Journal of Intelligent and Fuzzy Systems |
| 卷 | 38 |
| 期 | 2 Fuzzy Systems in Distributed Sensing Applications |
| DOI | |
| 出版状态 | 已出版 - 6 2月 2020 |
指纹
探究 'Name disambiguation using meta clusters and clustering ensemble' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver