跳到主要导航 跳到搜索 跳到主要内容

Discriminate Cross-modal Quantization for Efficient Retrieval

  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Efficient cross-modal retrieval involves searching similar items across different modalities, e.g., using an image(text) to search for texts(images). To speed up cross-modal retrieval, hashing-based methods threshold continuous embeddings into binary codes, inducing substantial loss of accuracy retrieval. To further improve retrieval performance, several quantization-based methods quantize embeddings into real-valued codewords to maximumlly preserve inter-modal and intra-modal similarity relation, while the discrimination between dissimilar data is ignored. To address these challenges, we propose, for the first time, a novel discriminate cross-modal quantization(DCMQ) which nonlinearly maps different modalities into a common space where ir-relevant data points are semantically separable: The points belonging to a class lie in a cluster that is not overlapped with other clusters corresponding to other classes. An effective optimization algorithm is developed for the proposed method to jointly learn the modality-specific mapping functions, the sharing codebooks, the unified binary codes and a linear classifier. Experimental comparison with state-of-the-art algorithms over three benchmark datasets demonstrates that DCMQ achieves significant improvement in search accuracy.

源语言英语
主期刊名2018 24th International Conference on Pattern Recognition, ICPR 2018
出版商Institute of Electrical and Electronics Engineers Inc.
3328-3334
页数7
ISBN(电子版)9781538637883
DOI
出版状态已出版 - 26 11月 2018
活动24th International Conference on Pattern Recognition, ICPR 2018 - Beijing, 中国
期限: 20 8月 201824 8月 2018

出版系列

姓名Proceedings - International Conference on Pattern Recognition
2018-August
ISSN(印刷版)1051-4651

会议

会议24th International Conference on Pattern Recognition, ICPR 2018
国家/地区中国
Beijing
时期20/08/1824/08/18

指纹

探究 'Discriminate Cross-modal Quantization for Efficient Retrieval' 的科研主题。它们共同构成独一无二的指纹。

引用此