TY - JOUR
T1 - Automatic blockchain whitepapers analysis via heterogeneous graph neural network
AU - Liu, Lin
AU - Tsai, Wei Tek
AU - Bhuiyan, Md Zakirul Alam
AU - Yang, Dong
N1 - Publisher Copyright:
© 2020 Elsevier Inc.
PY - 2020/11
Y1 - 2020/11
N2 - The blockchain whitepaper contains detailed technical and business information, so its analysis is important for blockchain text mining. Previous works focus on analyze homogeneous objects and relations. The main problem, however, is these works do not take into account the heterogeneity of information. This paper presents a new methodology for whitepapers analysis by designing heterogeneous graph neural network, named S-HGNN. In detail, this paper first builds a Heterogeneous Information Network (HIN) using heterogeneous objects and relationships extracted from the whitepaper to obtain similarity measures, then uses Graph Convolutional Network (GCN) and Graph Attention Network (GAT) to integrate both structural information and internal semantic into the whitepaper embedding. Compared with the previous models, this model improves 0.96%∼33.34% in terms of F1-score for classification task, and 4.94%∼14.14% in terms of purity for clustering task, and gets stable results on different tasks. The results show the effectiveness and robustness of this model for whitepapers analysis.
AB - The blockchain whitepaper contains detailed technical and business information, so its analysis is important for blockchain text mining. Previous works focus on analyze homogeneous objects and relations. The main problem, however, is these works do not take into account the heterogeneity of information. This paper presents a new methodology for whitepapers analysis by designing heterogeneous graph neural network, named S-HGNN. In detail, this paper first builds a Heterogeneous Information Network (HIN) using heterogeneous objects and relationships extracted from the whitepaper to obtain similarity measures, then uses Graph Convolutional Network (GCN) and Graph Attention Network (GAT) to integrate both structural information and internal semantic into the whitepaper embedding. Compared with the previous models, this model improves 0.96%∼33.34% in terms of F1-score for classification task, and 4.94%∼14.14% in terms of purity for clustering task, and gets stable results on different tasks. The results show the effectiveness and robustness of this model for whitepapers analysis.
KW - Blockchain
KW - Classification
KW - Clustering
KW - Heterogeneous graph neural network
KW - Heterogeneous information networks
UR - https://www.scopus.com/pages/publications/85086720033
U2 - 10.1016/j.jpdc.2020.05.014
DO - 10.1016/j.jpdc.2020.05.014
M3 - 文章
AN - SCOPUS:85086720033
SN - 0743-7315
VL - 145
SP - 1
EP - 12
JO - Journal of Parallel and Distributed Computing
JF - Journal of Parallel and Distributed Computing
ER -