TY - GEN
T1 - LBPSC
T2 - 2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022
AU - Bi, Jing
AU - Ni, Kun
AU - Yuan, Haitao
AU - Qiao, Junfei
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Recognizing key entities on texts of water environment accurately and rapidly can not only extract important information of water environment, but also improve the water quality. In recent years, Chinese named entity recognition becomes a research focus and many methods based on neural networks have been proven effective on entity recognition. This work proposes an improved hybrid prediction model named LBPSC for Chinese named entity recognition for the water environment data, which combines Lattice structure, Bi-directional long short-term memory (BiLSTM), Positional feature encoding, Sentence self-attention and conditional random field (CRF). LBPSC employs a three-phase end-to-end methodology for Chinese named entity recognition. It first adopts a BiLSTM with lattice structure to extract both character and word features from two directions, thereby avoiding word segmentation errors. It then innovatively combines a sentence self-attention mechanism with positional feature encoding to better handle sentences and add the position information to the trained features after BiLSTM. Then, a CRF layer is adopted to decode features and finally output the predicted tag of the data. Experimental results with real-life dataset demonstrate that LBPSC outperforms other deep learning algorithms in terms of prediction accuracy.
AB - Recognizing key entities on texts of water environment accurately and rapidly can not only extract important information of water environment, but also improve the water quality. In recent years, Chinese named entity recognition becomes a research focus and many methods based on neural networks have been proven effective on entity recognition. This work proposes an improved hybrid prediction model named LBPSC for Chinese named entity recognition for the water environment data, which combines Lattice structure, Bi-directional long short-term memory (BiLSTM), Positional feature encoding, Sentence self-attention and conditional random field (CRF). LBPSC employs a three-phase end-to-end methodology for Chinese named entity recognition. It first adopts a BiLSTM with lattice structure to extract both character and word features from two directions, thereby avoiding word segmentation errors. It then innovatively combines a sentence self-attention mechanism with positional feature encoding to better handle sentences and add the position information to the trained features after BiLSTM. Then, a CRF layer is adopted to decode features and finally output the predicted tag of the data. Experimental results with real-life dataset demonstrate that LBPSC outperforms other deep learning algorithms in terms of prediction accuracy.
KW - CRF
KW - lattice LSTM
KW - Named entity recognition
KW - positional feature encoding
KW - sentence self-attention
UR - https://www.scopus.com/pages/publications/85142721729
U2 - 10.1109/SMC53654.2022.9945417
DO - 10.1109/SMC53654.2022.9945417
M3 - 会议稿件
AN - SCOPUS:85142721729
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 223
EP - 228
BT - 2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 9 October 2022 through 12 October 2022
ER -