跳到主要导航 跳到搜索 跳到主要内容

Measuring Visual Surprise Jointly from Intrinsic and Extrinsic Contexts for Image Saliency Estimation

  • Jia Li*
  • , Yonghong Tian
  • , Xiaowu Chen
  • , Tiejun Huang
  • *此作品的通讯作者
  • Peking University
  • Cooperative Medianet Innovation Center

科研成果: 期刊稿件文章同行评审

摘要

Detecting conspicuous image content is a challenging task in the field of computer vision. In existing studies, most approaches focus on estimating saliency only with the cues from the input image. However, such “intrinsic” cues are often insufficient to distinguish targets and distractors that may share some common visual attributes. To address this problem, we present an approach to estimate image saliency by measuring the joint visual surprise from intrinsic and extrinsic contexts. In this approach, a hierarchical context model is first built on a database of 31.2 million images, where a Gaussian mixture model (GMM) is trained for each leaf node to encode the prior knowledge on “what is where” in a specific scene. For a testing image that shares similar spatial layout within a scene, the pre-trained GMM can serve as an extrinsic context model to measure the “surprise” of an image patch. Since human attention may quickly shift between different surprising locations, we adopt a Markov chain to model a surprise-driven attention-shifting process so as to infer the salient patches that can best capture human attention. Experiments show that our approach outperforms 19 state-of-the-art methods in fixation prediction.

源语言英语
页(从-至)44-60
页数17
期刊International Journal of Computer Vision
120
1
DOI
出版状态已出版 - 1 10月 2016

指纹

探究 'Measuring Visual Surprise Jointly from Intrinsic and Extrinsic Contexts for Image Saliency Estimation' 的科研主题。它们共同构成独一无二的指纹。

引用此