跳到主要导航 跳到搜索 跳到主要内容

Landmark image retrieval by jointing feature refinement and multimodal classifier learning

  • Nanjing University of Aeronautics and Astronautics

科研成果: 期刊稿件文章同行评审

摘要

Landmark retrieval is to return a set of images with their landmarks similar to those of the query images. Existing studies on landmark retrieval focus on exploiting the geometries of landmarks for visual similarity matches. However, the visual content of social images is of large diversity in many landmarks, and also some images share common patterns over different landmarks. On the other side, it has been observed that social images usually contain multimodal contents, i.e., visual content and text tags, and each landmark has the unique characteristic of both visual content and text content. Therefore, the approaches based on similarity matching may not be effective in this environment. In this paper, we investigate whether the geographical correlation among the visual content and the text content could be exploited for landmark retrieval. In particular, we propose an effective multimodal landmark classification paradigm to leverage the multimodal contents of social image for landmark retrieval, which integrates feature refinement and landmark classifier with multimodal contents by a joint model. The geo-Tagged images are automatically labeled for classifier learning. Visual features are refined based on low rank matrix recovery, and multimodal classification combined with group sparse is learned from the automatically labeled images. Finally, candidate images are ranked by combining classification result and semantic consistence measuring between the visual content and text content. Experiments on real-world datasets demonstrate the superiority of the proposed approach as compared to existing methods.

源语言英语
页(从-至)1682-1695
页数14
期刊IEEE Transactions on Cybernetics
48
6
DOI
出版状态已出版 - 6月 2018

指纹

探究 'Landmark image retrieval by jointing feature refinement and multimodal classifier learning' 的科研主题。它们共同构成独一无二的指纹。

引用此