跳到主要导航 跳到搜索 跳到主要内容

Augmenting image descriptions using structured prediction output

  • Yahong Han*
  • , Xingxing Wei
  • , Xiaochun Cao
  • , Yi Yang
  • , Xiaofang Zhou
  • *此作品的通讯作者
  • Tianjin University
  • CAS - Institute of Information Engineering
  • University of Queensland

科研成果: 期刊稿件文章同行评审

摘要

The need for richer descriptions of images arises in a wide spectrum of applications ranging from image understanding to image retrieval. While the Automatic Image Annotation (AIA) has been extensively studied, image descriptions with the output labels lack sufficient information. This paper proposes to augment image descriptions using structured prediction output. We define a hierarchical tree-structured semantic unit to describe images, from which we can obtain not only the class and subclass one image belongs to, but also the attributes one image has. After defining a new feature map function of structured SVM, we decompose the loss function into every node of the hierarchical tree-structured semantic unit and then predict the tree-structured semantic unit for testing images. In the experiments, we evaluate the performance of the proposed method on two open benchmark datasets and compare with the state-of-the-art methods. Experimental results show the better prediction performance of the proposed method and demonstrate the strength of augmenting image descriptions.

源语言英语
文章编号6810013
页(从-至)1665-1676
页数12
期刊IEEE Transactions on Multimedia
16
6
DOI
出版状态已出版 - 1 10月 2014
已对外发布

指纹

探究 'Augmenting image descriptions using structured prediction output' 的科研主题。它们共同构成独一无二的指纹。

引用此