TY - JOUR
T1 - China’s policy similarity evaluation using LDA model
T2 - An experimental analysis in Hebei province
AU - Zhang, Junhuan
AU - Gui, Wanbing
AU - Wen, Jiaqi
N1 - Publisher Copyright:
© The Author(s) 2022.
PY - 2024/4
Y1 - 2024/4
N2 - This article proposes a combination model, which is composed of latent Dirichlet allocation model, TF-IDF feature extraction algorithm and Euclidean distance measurement method, to identify and judge whether the similarities between multiple policy texts exist or not. With the help of actual data result, this will drive the relevant government agencies to figure out problems in a timely manner and provide a decision-making basis for them to formulate and optimise appropriate economic policies. To this end, this article analyses and studies the four types of economic texts that are classified as Insurance, Banking, Tax and Finance from the Central Government of Hebei province and Shijiazhuang city levels. Also, we consider Beijing, Shanghai and Guangdong. Experimental results show that (1) the combination model can quickly and effectively recognise and determine whether there are similarities between multiple economic policy texts; (2) similarities exist or not between the central, provincial and municipal level policy texts depending on the comparison of the distance values across them; (3) the smaller the distance value between economic policy texts of the same kind, the higher the similarity in them; and (4) the distance values between the six policy texts in Finance, Insurance, Bank and Tax categories are ranked from low to high. In terms of similarity, the Finance category is the highest, followed by Insurance and Bank, and the Tax category is the lowest.
AB - This article proposes a combination model, which is composed of latent Dirichlet allocation model, TF-IDF feature extraction algorithm and Euclidean distance measurement method, to identify and judge whether the similarities between multiple policy texts exist or not. With the help of actual data result, this will drive the relevant government agencies to figure out problems in a timely manner and provide a decision-making basis for them to formulate and optimise appropriate economic policies. To this end, this article analyses and studies the four types of economic texts that are classified as Insurance, Banking, Tax and Finance from the Central Government of Hebei province and Shijiazhuang city levels. Also, we consider Beijing, Shanghai and Guangdong. Experimental results show that (1) the combination model can quickly and effectively recognise and determine whether there are similarities between multiple economic policy texts; (2) similarities exist or not between the central, provincial and municipal level policy texts depending on the comparison of the distance values across them; (3) the smaller the distance value between economic policy texts of the same kind, the higher the similarity in them; and (4) the distance values between the six policy texts in Finance, Insurance, Bank and Tax categories are ranked from low to high. In terms of similarity, the Finance category is the highest, followed by Insurance and Bank, and the Tax category is the lowest.
KW - Economic policies
KW - Euclidean distance
KW - LDA model
KW - information management
KW - similarity
UR - https://www.scopus.com/pages/publications/85134063883
U2 - 10.1177/01655515221097858
DO - 10.1177/01655515221097858
M3 - 文章
AN - SCOPUS:85134063883
SN - 0165-5515
VL - 50
SP - 515
EP - 530
JO - Journal of Information Science
JF - Journal of Information Science
IS - 2
ER -