Triplex transfer learning: Exploiting both shared and distinct concepts for text classification

  • Fuzhen Zhuang*
  • , Ping Luo
  • , Changying Du
  • , Qing He
  • , Zhongzhi Shi
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Transfer learning focuses on the learning scenarios when the test data from target domains and the training data from source domains are drawn from similar but different data distribution with respect to the raw features. Some recent studies argued that the high-level concepts (e.g. word clusters) can help model the data distribution difference, and thus are more appropriate for classification. Specifically, these methods assume that all the data domains have the same set of shared concepts, which are used as the bridge for knowledge transfer. However, besides these shared concepts each domain may have its own distinct concepts. To address this point, we propose a general transfer learning framework based on non-negative matrix tri-factorization which allows to explore both shared and distinct concepts among all the domains simultaneously. Since this model provides more flexibility in fitting the data it may lead to better classification accuracy. To solve the proposed optimization problem we develop an iterative algorithm and also theoretically analyze its convergence. Finally, extensive experiments show the significant superiority of our model over the baseline methods. In particular, we show that our method works much better in the more challenging tasks when distinct concepts may exist.

Original languageEnglish
Title of host publicationWSDM 2013 - Proceedings of the 6th ACM International Conference on Web Search and Data Mining
Pages425-434
Number of pages10
DOIs
StatePublished - 2013
Externally publishedYes
Event6th ACM International Conference on Web Search and Data Mining, WSDM 2013 - Rome, Italy
Duration: 4 Feb 20138 Feb 2013

Publication series

NameWSDM 2013 - Proceedings of the 6th ACM International Conference on Web Search and Data Mining

Conference

Conference6th ACM International Conference on Web Search and Data Mining, WSDM 2013
Country/TerritoryItaly
CityRome
Period4/02/138/02/13

Keywords

  • common concept
  • distinct concept
  • distribution mismatch
  • non-negative matrix tri-factorization
  • triplex transfer learning

Fingerprint

Dive into the research topics of 'Triplex transfer learning: Exploiting both shared and distinct concepts for text classification'. Together they form a unique fingerprint.

Cite this