Skip to main navigation Skip to search Skip to main content

Domain-Invariant Feature Progressive Distillation with Adversarial Adaptive Augmentation for Low-Resource Cross-Domain NER

  • Tao Zhang
  • , Congying Xia
  • , Zhiwei Liu
  • , Shu Zhao
  • , Hao Peng*
  • , Philip Yu
  • *Corresponding author for this work
  • University of Illinois at Chicago
  • School of Computer Science and Technology, Anhui University

Research output: Contribution to journalArticlepeer-review

Abstract

Considering the expensive annotation in Named Entity Recognition (NER), Cross-domain NER enables NER in low-resource target domains with few or without labeled data, by transferring the knowledge of high-resource domains. However, the discrepancy between different domains causes the domain shift problem and hampers the performance of cross-domain NER in low-resource scenarios. In this article, we first propose an adversarial adaptive augmentation, where we integrate the adversarial strategy into a multi-task learner to augment and qualify domain adaptive data. We extract domain-invariant features of the adaptive data to bridge the cross-domain gap and alleviate the label-sparsity problem simultaneously. Therefore, another important component in this article is the progressive domain-invariant feature distillation framework. A multi-grained MMD (Maximum Mean Discrepancy) approach in the framework to extract the multi-level domain invariant features and enable knowledge transfer across domains through the adversarial adaptive data. Advanced Knowledge Distillation (KD) schema processes progressively domain adaptation through the powerful pre-trained language models and multi-level domain invariant features. Extensive comparative experiments over four English and two Chinese benchmarks show the importance of adversarial augmentation and effective adaptation from high-resource domains to low-resource target domains. Comparison with two vanilla and four latest baselines indicates the state-of-the-art performance and superiority confronted with both zero-resource and minimal-resource scenarios.

Original languageEnglish
Article number3570502
JournalACM Transactions on Asian and Low-Resource Language Information Processing
Volume22
Issue number3
DOIs
StatePublished - 14 Apr 2023

Keywords

  • Additional Key Words and PhrasesNER
  • adversarial augmentation
  • cross-domain
  • domain adaptation
  • knowledge distillation
  • low-resource

Fingerprint

Dive into the research topics of 'Domain-Invariant Feature Progressive Distillation with Adversarial Adaptive Augmentation for Low-Resource Cross-Domain NER'. Together they form a unique fingerprint.

Cite this