跳到主要导航 跳到搜索 跳到主要内容

Multimodal Contrastive Transformer for Explainable Recommendation

  • Beihang University
  • Ludwig Maximilian University of Munich

科研成果: 期刊稿件文章同行评审

摘要

Explanations play an essential role in helping users evaluate results from recommender systems. Various natural language generation methods have been proposed to generate explanations for the recommendation. However, they usually suffer from two problems. First, since user-provided review text contains noisy data, the generated explanations may be irrelevant to the recommended items. Second, as lacking some supervision signals, most of the generated sentences are similar, which cannot meet the diversity and personalized needs of users. To tackle these problems, we propose a multimodal contrastive transformer (MMCT) model for an explainable recommendation, which incorporates multimodal information into the learning process, including sentiment features, item features, item images, and refined user reviews. Meanwhile, we propose a dynamic fusion mechanism during the decoding stage, which generates supervision signals to guide the explanation generation. Additionally, we develop a contrastive objective to generate diverse explainable texts. Comprehensive experiments on two real-world datasets show that the proposed model outperforms comparable explainable recommendation baselines in terms of explanation performance and recommendation performance. Efficiency analysis and robustness analysis verify the advantages of the proposed model. While ablation analysis establishes the relative contributions of the respective components and various modalities, the case study shows the working of our model from an intuitive sense.

源语言英语
页(从-至)2632-2643
页数12
期刊IEEE Transactions on Computational Social Systems
11
2
DOI
出版状态已出版 - 1 4月 2024

指纹

探究 'Multimodal Contrastive Transformer for Explainable Recommendation' 的科研主题。它们共同构成独一无二的指纹。

引用此