Skip to main navigation Skip to search Skip to main content

Hierarchical Lexicon Embedding Architecture for Chinese Named Entity Recognition

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Named entity recognition (NER) is one of the most fundamental tasks in a variety of natural language applications. Due to the lack of delimiters in the Chinese language, Chinese NER task has been suffering from the shortage of word boundary information. Recently, incorporating word information has been proven an effective mechanism to alleviate this problem. However, how to integrate word information into the character-based model more effectively and efficiently is still a challenge. In this work, we propose a hierarchical lexicon embedding architecture for Chinese NER task. The words matched by the input sentence are divided into two categories, i.e., main words and auxiliary words, to help the model better capture useful information. In addition, the modification mainly lies in the embedding layer, as such it can be easily incorporated with different sequence modeling architectures. Experimental studies on four Chinese NER datasets have shown our method’s promising potential.

Original languageEnglish
Title of host publicationArtificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings
EditorsIgor Farkaš, Paolo Masulli, Sebastian Otte, Stefan Wermter
PublisherSpringer Science and Business Media Deutschland GmbH
Pages345-356
Number of pages12
ISBN (Print)9783030863821
DOIs
StatePublished - 2021
Event30th International Conference on Artificial Neural Networks, ICANN 2021 - Virtual, Online, Slovakia
Duration: 14 Sep 202117 Sep 2021

Publication series

NameLecture Notes in Computer Science
Volume12895 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference30th International Conference on Artificial Neural Networks, ICANN 2021
Country/TerritorySlovakia
CityVirtual, Online
Period14/09/2117/09/21

Keywords

  • Boundary information
  • Chinese named entity recognition
  • Lexicon

Fingerprint

Dive into the research topics of 'Hierarchical Lexicon Embedding Architecture for Chinese Named Entity Recognition'. Together they form a unique fingerprint.

Cite this