Skip to main navigation Skip to search Skip to main content

An Object-Level High-Order Contextual Descriptor Based on Semantic, Spatial, and Scale Cues

  • Tianjin University
  • CAS - Institute of Information Engineering

Research output: Contribution to journalArticlepeer-review

Abstract

Context has been playing an increasingly important role in areas such as object detection, scene understanding, and image segmentation. Although many different types of contextual cues have been successfully explored, most of them only consider the pair-wise relationship between objects or parts. Several models utilize the high-order relationship for encoding contextual information. However, they mainly use a single contextual cue. In this paper, we present a novel high-order contextual descriptor (HOOD) to measure the strength of interactions among objects within an image. Heterogeneous contextual cues like semantic, spatial, and scale contexts are jointly integrated into HOOD to define the high-order interactions. The strength of these interactions are inferred by applying Bayes' rule on the pure dependence of the involved objects. As a result, an object-level graph is constructed to represent the contextually consistent interactions. Moreover, we propose a HOOD based object localization framework to verify the effectiveness of HOOD. Experimental results on two benchmark datasets including SUN09 and PASCAL2007 show that our framework outperforms the state-of-the-art context based object localization methods. Finally, we apply HOOD on two multimedia applications: structured image retrieval and out-of-context object detection, which demonstrates the potential usages of HOOD.

Original languageEnglish
Article number6891238
Pages (from-to)1327-1339
Number of pages13
JournalIEEE Transactions on Cybernetics
Volume45
Issue number7
DOIs
StatePublished - 1 Jul 2015

Keywords

  • Contextual descriptor
  • object localization
  • out of context
  • structured image retrieval

Fingerprint

Dive into the research topics of 'An Object-Level High-Order Contextual Descriptor Based on Semantic, Spatial, and Scale Cues'. Together they form a unique fingerprint.

Cite this