TY - GEN
T1 - Microgroup mining on TSina via network structure and user attribute
AU - Xiong, Xiaobing
AU - Niu, Xiang
AU - Zhou, Gang
AU - Xu, Ke
AU - Huang, Yongzhong
PY - 2011
Y1 - 2011
N2 - In this paper, we focus on the problem of community detection on TSina: the most popular microblogging network in China. By characterizing the structure and content of microgroup (community) on TSina in detail, we reveal that different from ordinary social networks, the degree assortativity coefficients are negative on most microgroups. In addition, we find that users from the same microgroup likely exhibit some similar attributes (e.g., sharing many followers, tags and topics). Inspired by these new findings, we propose a united method for microgroup detection without losing the information of link structure and user attribute. First, the link direction is converted to the weight by giving higher value to the more surprising link, while attribute similarity between two users is measured by the Jaccard coefficient of common features like followers, tags, and topics. Then, above two factors are uniformly converted to the edge weight of a newly generated network. Finally, many frequently used community detection algorithms that support weighted network would be employed. Extensive experiments on real social networks show that the factors of link structure and user attribute play almost equally important roles in microgroup detection on TSina. Our newly proposed method significantly outperforms the traditional methods with average accuracy being improved by 25%, and the number of unrecognized users decreasing by about 75%.
AB - In this paper, we focus on the problem of community detection on TSina: the most popular microblogging network in China. By characterizing the structure and content of microgroup (community) on TSina in detail, we reveal that different from ordinary social networks, the degree assortativity coefficients are negative on most microgroups. In addition, we find that users from the same microgroup likely exhibit some similar attributes (e.g., sharing many followers, tags and topics). Inspired by these new findings, we propose a united method for microgroup detection without losing the information of link structure and user attribute. First, the link direction is converted to the weight by giving higher value to the more surprising link, while attribute similarity between two users is measured by the Jaccard coefficient of common features like followers, tags, and topics. Then, above two factors are uniformly converted to the edge weight of a newly generated network. Finally, many frequently used community detection algorithms that support weighted network would be employed. Extensive experiments on real social networks show that the factors of link structure and user attribute play almost equally important roles in microgroup detection on TSina. Our newly proposed method significantly outperforms the traditional methods with average accuracy being improved by 25%, and the number of unrecognized users decreasing by about 75%.
KW - Community Detection
KW - Microblogging
KW - Microgroup Mining
KW - United Method
UR - https://www.scopus.com/pages/publications/84255160758
U2 - 10.1007/978-3-642-25856-5_11
DO - 10.1007/978-3-642-25856-5_11
M3 - 会议稿件
AN - SCOPUS:84255160758
SN - 9783642258558
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 138
EP - 151
BT - Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings
T2 - 7th International Conference on Advanced Data Mining and Applications, ADMA 2011
Y2 - 17 December 2011 through 19 December 2011
ER -