跳到主要导航 跳到搜索 跳到主要内容

Online cross-modal scene retrieval by binary representation and semantic graph

  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In recent years, cross-modal scene retrieval has atracted more atention. However, most existing approaches neglect the semantic relationship between objects in a scene together with the embedded spatial layouts. Moreover, these methods mostly apply the batch learning strategy, which is not suitable for processing streaming data. To address the aforementioned problems, we propose a new framework for online cross-modal scene retrieval based on binary representations and semantic graph. Specially, we adopt the cross-modal hashing based on the quantization loss of different modalities. By introducing the semantic graph, we are able to extract wealthy semantics and measure their correlation across different modalities. Further more, we propose a two-step optimization procedure based on stochastic gradient descent for online update. Experimental results on four datasets show the superiority of our approach over the state-of-the-art.

源语言英语
主期刊名MM 2017 - Proceedings of the 2017 ACM Multimedia Conference
出版商Association for Computing Machinery, Inc
744-752
页数9
ISBN(电子版)9781450349062
DOI
出版状态已出版 - 23 10月 2017
活动25th ACM International Conference on Multimedia, MM 2017 - Mountain View, 美国
期限: 23 10月 201727 10月 2017

出版系列

姓名MM 2017 - Proceedings of the 2017 ACM Multimedia Conference

会议

会议25th ACM International Conference on Multimedia, MM 2017
国家/地区美国
Mountain View
时期23/10/1727/10/17

指纹

探究 'Online cross-modal scene retrieval by binary representation and semantic graph' 的科研主题。它们共同构成独一无二的指纹。

引用此