跳到主要导航 跳到搜索 跳到主要内容

Semantic Modeling of Textual Relationships in Cross-modal Retrieval

  • Jing Yu
  • , Chenghao Yang
  • , Zengchang Qin*
  • , Zhuoqian Yang
  • , Yue Hu
  • , Zhiguo Shi
  • *此作品的通讯作者
  • CAS - Institute of Information Engineering
  • Beihang University
  • University of Science and Technology Beijing

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Feature modeling of different modalities is a basic problem in current research of cross-modal information retrieval. Existing models typically project texts and images into one embedding space, in which semantically similar information will have a shorter distance. Semantic modeling of textural relationships is notoriously difficult. In this paper, we propose an approach to model texts using a featured graph by integrating multi-view textual relationships including semantic relationships, statistical co-occurrence, and prior relationships in knowledge base. A dual-path neural network is adopted to learn multi-modal representations of information and cross-modal similarity measure jointly. We use a Graph Convolutional Network (GCN) for generating relation-aware text representations, and use a Convolutional Neural Network (CNN) with non-linearities for image representations. The cross-modal similarity measure is learned by distance metric learning. Experimental results show that, by leveraging the rich relational semantics in texts, our model can outperform the state-of-the-art models by 3.4% on 6.3% in accuracy on two benchmark datasets.

源语言英语
主期刊名Knowledge Science, Engineering and Management - 12th International Conference, KSEM 2019, Proceedings
编辑Christos Douligeris, Dimitris Apostolou, Dimitris Karagiannis
出版商Springer
24-32
页数9
ISBN(印刷版)9783030295509
DOI
出版状态已出版 - 2019
活动12th International Conference on Knowledge Science, Engineering and Management, KSEM 2019 - Athens, 希腊
期限: 28 8月 201930 8月 2019

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
11775 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议12th International Conference on Knowledge Science, Engineering and Management, KSEM 2019
国家/地区希腊
Athens
时期28/08/1930/08/19

指纹

探究 'Semantic Modeling of Textual Relationships in Cross-modal Retrieval' 的科研主题。它们共同构成独一无二的指纹。

引用此