跳到主要导航 跳到搜索 跳到主要内容

Discriminative Feature Adaptation via Conditional Mean Discrepancy for Cross-Domain Text Classification

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper concerns the problem of Unsupervised Domain Adaptation (UDA) in text classification, aiming to transfer the knowledge from a source domain to a different but related target domain. Previous methods learn the discriminative feature of target domain in terms of noisy pseudo labels, which inevitably produces negative effects on training a robust model. In this paper, we propose a novel criterion Conditional Mean Discrepancy (CMD) to learn the discriminative features by matching the conditional distributions across domains. CMD embeds both the conditional distributions of source and target domains into tensor-product Hilbert space and computes Hilbert-Schmidt norm instead. We shed a new light on discriminative feature adaptation: the collective knowledge of discriminative features of different domains is naturally discovered by minimizing CMD. We propose Aligned Adaptation Networks (AAN) to learn the domain-invariant and discriminative features simultaneously based on Maximum Mean Discrepancy (MMD) and CMD. Meanwhile, to trade off between the marginal and conditional distributions, we further maximize both MMD and CMD criterions using adversarial strategy to make the features of AAN more discrepancy-invariant. To the best of our knowledge, this is the first work to definitely evaluate the shifts in the conditional distributions across domains. Experiments on cross-domain text classification demonstrate that AAN achieves better classification accuracy but less convergence time compared to the state-of-the-art deep methods.

源语言英语
主期刊名Database Systems for Advanced Applications - 26th International Conference, DASFAA 2021, Proceedings
编辑Christian S. Jensen, Ee-Peng Lim, De-Nian Yang, Chia-Hui Chang, Jianliang Xu, Wen-Chih Peng, Jen-Wei Huang, Chih-Ya Shen
出版商Springer Science and Business Media Deutschland GmbH
104-119
页数16
ISBN(印刷版)9783030731960
DOI
出版状态已出版 - 2021
活动26th International Conference on Database Systems for Advanced Applications, DASFAA 2021 - Taipei, 中国台湾
期限: 11 4月 202114 4月 2021

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12682 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议26th International Conference on Database Systems for Advanced Applications, DASFAA 2021
国家/地区中国台湾
Taipei
时期11/04/2114/04/21

指纹

探究 'Discriminative Feature Adaptation via Conditional Mean Discrepancy for Cross-Domain Text Classification' 的科研主题。它们共同构成独一无二的指纹。

引用此