TY - GEN
T1 - Transfer learning by linking similar feature clusters for sentiment classification
AU - Zhang, Wenjie
AU - Zhang, Hui
AU - Wang, Deqing
AU - Liu, Rui
AU - Zhang, He
AU - Jiang, Xianlin
AU - Chen, Yong
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/1/11
Y1 - 2017/1/11
N2 - Transfer learning aims to extract the knowledge from a label-rich source domain to enhance the predictive model of a target domain. Previous methods achieve knowledge transfer by detecting a shared low-dimensional feature representation from source domain to target domain. Along this line, many algorithms, e.g., dual transfer learning (DTL), triplex transfer learning (TRi-TL) etc., have been proposed and widely used for text classification. However, we argue that it is difficult for models to distinguish exactly the common concepts or identical concepts across different domains through the existing algorithms, even though source and target domains are related but different. So we propose to use the similar feature clusters as knowledge transfer, that is, we only guarantee the approximate similarity of common word clusters across different domains, rather than the exactly same. Based on the above assumption, the derived association matrices between word clusters and document classes should be slightly different to account for the word clusters variations. To take the above assumptions into account, we propose a novel Nonnegative Matrix Tri-Factorization based transfer learning by linking similar feature clusters (LSF-TL) for sentiment classification, in which an approximate constraint between similar word clusters matrices is added to allow differences while keeping the knowledge transferring function. Besides, LSF-TL also provides the same approximate constraint for the derived clusters association matrices. Then we employ an iterative updating algorithm with sound theoretical proof to find the local optimal solution. Last, we evaluate our method by conducting extensive experiments on Amazon product reviews. The results show that our approach achieves better classification accuracy than the state-of-The-Art methods for both Cross-lingual sentiment classification(CLSC) and Cross-lingual cross-domain sentiment classification(CLCDSC) tasks.
AB - Transfer learning aims to extract the knowledge from a label-rich source domain to enhance the predictive model of a target domain. Previous methods achieve knowledge transfer by detecting a shared low-dimensional feature representation from source domain to target domain. Along this line, many algorithms, e.g., dual transfer learning (DTL), triplex transfer learning (TRi-TL) etc., have been proposed and widely used for text classification. However, we argue that it is difficult for models to distinguish exactly the common concepts or identical concepts across different domains through the existing algorithms, even though source and target domains are related but different. So we propose to use the similar feature clusters as knowledge transfer, that is, we only guarantee the approximate similarity of common word clusters across different domains, rather than the exactly same. Based on the above assumption, the derived association matrices between word clusters and document classes should be slightly different to account for the word clusters variations. To take the above assumptions into account, we propose a novel Nonnegative Matrix Tri-Factorization based transfer learning by linking similar feature clusters (LSF-TL) for sentiment classification, in which an approximate constraint between similar word clusters matrices is added to allow differences while keeping the knowledge transferring function. Besides, LSF-TL also provides the same approximate constraint for the derived clusters association matrices. Then we employ an iterative updating algorithm with sound theoretical proof to find the local optimal solution. Last, we evaluate our method by conducting extensive experiments on Amazon product reviews. The results show that our approach achieves better classification accuracy than the state-of-The-Art methods for both Cross-lingual sentiment classification(CLSC) and Cross-lingual cross-domain sentiment classification(CLCDSC) tasks.
KW - Domain Adaption
KW - Non-negative Matrix Tri-factorization
KW - Sentiment Classification
KW - Similar Concept
KW - Transfer Learning
UR - https://www.scopus.com/pages/publications/85013647008
U2 - 10.1109/ICTAI.2016.0154
DO - 10.1109/ICTAI.2016.0154
M3 - 会议稿件
AN - SCOPUS:85013647008
T3 - Proceedings - 2016 IEEE 28th International Conference on Tools with Artificial Intelligence, ICTAI 2016
SP - 1019
EP - 1026
BT - Proceedings - 2016 IEEE 28th International Conference on Tools with Artificial Intelligence, ICTAI 2016
A2 - Esposito, Anna
A2 - Alamaniotis, Miltos
A2 - Mali, Amol
A2 - Bourbakis, Nikolaos
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 28th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2016
Y2 - 6 November 2016 through 8 November 2016
ER -