TY - GEN
T1 - Construction of Social Community Knowledge Graph from Wikipedia
AU - Li, Shimei
AU - Xu, Tongge
AU - Liu, Yueqi
AU - Yang, Liqun
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Knowledge graphs have become a fundamental trend in the fields of healthcare, education, finance, ICT, science, engineering, society, politics, and tourism, etc. The community knowledge graph aims to explore the composition of communities and their relationships through visual social network analysis. However, the process of unstructured data like text raises challenges for knowledge extraction. To automatically build a knowledge graph, this study applies natural language processing and technology of knowledge graph creation. This paper proposes a novel method, Augmented Word Vectors (AWV), which is a word vector representation with lexical features and part-of-speech (POS) features. This work evaluates the performance of AWV on various deep learning models, and the experimental results show that AWV is effective for improving word embedding in the task of named entity recognition. Furthermore, an unsupervised approach based on rules and dependency semantics was applied to extract relations. Specifically, this method achieves high productivity by mapping sentences as dependency trees to extract relations mediated by verbs or nouns. We build a community knowledge graph for Hong Kong political groups by applying our method to the Chinese Wikipedia, which validates the effectiveness of the proposed method.
AB - Knowledge graphs have become a fundamental trend in the fields of healthcare, education, finance, ICT, science, engineering, society, politics, and tourism, etc. The community knowledge graph aims to explore the composition of communities and their relationships through visual social network analysis. However, the process of unstructured data like text raises challenges for knowledge extraction. To automatically build a knowledge graph, this study applies natural language processing and technology of knowledge graph creation. This paper proposes a novel method, Augmented Word Vectors (AWV), which is a word vector representation with lexical features and part-of-speech (POS) features. This work evaluates the performance of AWV on various deep learning models, and the experimental results show that AWV is effective for improving word embedding in the task of named entity recognition. Furthermore, an unsupervised approach based on rules and dependency semantics was applied to extract relations. Specifically, this method achieves high productivity by mapping sentences as dependency trees to extract relations mediated by verbs or nouns. We build a community knowledge graph for Hong Kong political groups by applying our method to the Chinese Wikipedia, which validates the effectiveness of the proposed method.
KW - knowledge graph
KW - named entity recognition
KW - social network analysis
KW - word embedding
UR - https://www.scopus.com/pages/publications/85152234722
U2 - 10.1109/HPCC-DSS-SmartCity-DependSys57074.2022.00341
DO - 10.1109/HPCC-DSS-SmartCity-DependSys57074.2022.00341
M3 - 会议稿件
AN - SCOPUS:85152234722
T3 - Proceedings - 24th IEEE International Conference on High Performance Computing and Communications, 8th IEEE International Conference on Data Science and Systems, 20th IEEE International Conference on Smart City and 8th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022
SP - 2308
EP - 2315
BT - Proceedings - 24th IEEE International Conference on High Performance Computing and Communications, 8th IEEE International Conference on Data Science and Systems, 20th IEEE International Conference on Smart City and 8th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th IEEE International Conference on High Performance Computing and Communications, 8th IEEE International Conference on Data Science and Systems, 20th IEEE International Conference on Smart City and 8th IEEE International Conference on Dependability in Sensor, Cloud and Big Data Systems and Application, HPCC/DSS/SmartCity/DependSys 2022
Y2 - 18 December 2022 through 20 December 2022
ER -