TY - JOUR
T1 - Softly Associative Transfer Learning for Cross-Domain Classification
AU - Wang, Deqing
AU - Lu, Chenwei
AU - Wu, Junjie
AU - Liu, Hongfu
AU - Zhang, Wenjie
AU - Zhuang, Fuzhen
AU - Zhang, Hui
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2020/11
Y1 - 2020/11
N2 - The main challenge of cross-domain text classification is to train a classifier in a source domain while applying it to a different target domain. Many transfer learning-based algorithms, for example, dual transfer learning, triplex transfer learning, etc., have been proposed for cross-domain classification, by detecting a shared low-dimensional feature representation for both source and target domains. These methods, however, often assume that the word clusters matrix or the clusters association matrix as knowledge transferring bridges are exactly the same across different domains, which is actually unrealistic in real-world applications and, therefore, could degrade classification performance. In light of this, in this paper, we propose a softly associative transfer learning algorithm for cross-domain text classification. Specifically, we integrate two non-negative matrix tri-factorizations into a joint optimization framework, with approximate constraints on both word clusters matrices and clusters association matrices so as to allow proper diversity in knowledge transfer, and with another approximate constraint on class labels in source domains in order to handle noisy labels. An iterative algorithm is then proposed to solve the above problem, with its convergence verified theoretically and empirically. Extensive experimental results on various text datasets demonstrate the effectiveness of our algorithm, even with the presence of abundant state-of-the-art competitors.
AB - The main challenge of cross-domain text classification is to train a classifier in a source domain while applying it to a different target domain. Many transfer learning-based algorithms, for example, dual transfer learning, triplex transfer learning, etc., have been proposed for cross-domain classification, by detecting a shared low-dimensional feature representation for both source and target domains. These methods, however, often assume that the word clusters matrix or the clusters association matrix as knowledge transferring bridges are exactly the same across different domains, which is actually unrealistic in real-world applications and, therefore, could degrade classification performance. In light of this, in this paper, we propose a softly associative transfer learning algorithm for cross-domain text classification. Specifically, we integrate two non-negative matrix tri-factorizations into a joint optimization framework, with approximate constraints on both word clusters matrices and clusters association matrices so as to allow proper diversity in knowledge transfer, and with another approximate constraint on class labels in source domains in order to handle noisy labels. An iterative algorithm is then proposed to solve the above problem, with its convergence verified theoretically and empirically. Extensive experimental results on various text datasets demonstrate the effectiveness of our algorithm, even with the presence of abundant state-of-the-art competitors.
KW - Cross-domain text classification
KW - non-negative matrix tri-factorizations (NMTFs)
KW - softly associative transfer learning (sa-TL)
UR - https://www.scopus.com/pages/publications/85094931942
U2 - 10.1109/TCYB.2019.2891577
DO - 10.1109/TCYB.2019.2891577
M3 - 文章
C2 - 30703057
AN - SCOPUS:85094931942
SN - 2168-2267
VL - 50
SP - 4709
EP - 4721
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 11
M1 - 8626759
ER -