TY - JOUR
T1 - Diversity Regularized Latent Semantic Match for Hashing
AU - Chen, Yong
AU - Zhang, Hui
AU - Tong, Yongxin
AU - Lu, Ming
N1 - Publisher Copyright:
© 2016
PY - 2017/3/22
Y1 - 2017/3/22
N2 - Hashing based approximate nearest neighbors (ANN) search has drawn considerable attraction owing to its low-memory storage and hardware-level logical computing which is doomed to be greatly applicable to quantities of large-scale and practical scenarios, such as information retrieval, computer vision and natural language processing. However, most existing hashing methods concentrate either on images only or on pairwise image-texts (labels, short documents) and rarely utilize more common sentences. In this paper, we propose D iversity R egularized L atent S emantic M atch for H ashing (DRLSMH), a new multimodal hashing method that projects images and sentences into a shared latent semantic space with label-supervised semantic constraints to proceed on multimodal retrieval. Notably, soft orthogonality is induced as a novel regularizer to preserve diverse hashing functions for compact and accurate representations; what's more, this kind of regularization also benefits the derivations of closed-form solutions with some proper relaxations under iterative optimization framework. Extensive experiments on two public datasets demonstrate the advantages of our method over some state-of-the-art baselines under cross-modal retrieval both on image-query-image, image-query-text and text-query-image tasks.
AB - Hashing based approximate nearest neighbors (ANN) search has drawn considerable attraction owing to its low-memory storage and hardware-level logical computing which is doomed to be greatly applicable to quantities of large-scale and practical scenarios, such as information retrieval, computer vision and natural language processing. However, most existing hashing methods concentrate either on images only or on pairwise image-texts (labels, short documents) and rarely utilize more common sentences. In this paper, we propose D iversity R egularized L atent S emantic M atch for H ashing (DRLSMH), a new multimodal hashing method that projects images and sentences into a shared latent semantic space with label-supervised semantic constraints to proceed on multimodal retrieval. Notably, soft orthogonality is induced as a novel regularizer to preserve diverse hashing functions for compact and accurate representations; what's more, this kind of regularization also benefits the derivations of closed-form solutions with some proper relaxations under iterative optimization framework. Extensive experiments on two public datasets demonstrate the advantages of our method over some state-of-the-art baselines under cross-modal retrieval both on image-query-image, image-query-text and text-query-image tasks.
KW - Approximate nearest neighbors
KW - Diversity regularizations
KW - Learning to match
KW - Multimodal retrieval
KW - Representation learning
KW - Soft orthogonality
UR - https://www.scopus.com/pages/publications/85008626024
U2 - 10.1016/j.neucom.2016.11.057
DO - 10.1016/j.neucom.2016.11.057
M3 - 文章
AN - SCOPUS:85008626024
SN - 0925-2312
VL - 230
SP - 77
EP - 87
JO - Neurocomputing
JF - Neurocomputing
ER -