跳到主要导航 跳到搜索 跳到主要内容

Tower of Knowledge for scene interpretation: A survey

  • Beihang University

科研成果: 期刊稿件文章同行评审

摘要

The past few decades have witnessed a wealth of promising work in making machines interpret the scenes around us. However, scene interpretation is still in its infancy, in comparison with human cognition. As such, human language, a highly developed output of human cognition, can be seen as an important cue towards scene interpretation. We survey in this paper Tower of Knowledge (ToK) approaches, which take advantage of human language, for scene interpretation. The core of ToK approaches is a multi-layer architecture, namely ToK architecture, aiming to establish the information flow of scene interpretation. In general, ToK architecture can be applied in scene interpretation by exploiting its either vertical or horizontal connections. First, we focus on the approaches with respect to the vertical connections in ToK architecture. In such approaches, the optimal label is assigned to each identified object in a scene, on the basis of verifying whether the object has the right characteristics to fulfil the functions a label implies. Second, we discuss the approaches on utilising the horizontal connections of ToK architecture to interpret a scene, according to the asymmetric spatial relationships of the objects. In retrospect of what has been achieved so far, we finally outlook what the future may hold for ToK.

源语言英语
页(从-至)42-48
页数7
期刊Pattern Recognition Letters
48
DOI
出版状态已出版 - 15 10月 2014

指纹

探究 'Tower of Knowledge for scene interpretation: A survey' 的科研主题。它们共同构成独一无二的指纹。

引用此