跳到主要导航 跳到搜索 跳到主要内容

Attentive relational networks for mapping images to scene graphs

  • Mengshi Qi
  • , Weijian Li
  • , Zhengyuan Yang
  • , Yunhong Wang*
  • , Jiebo Luo
  • *此作品的通讯作者
  • Beihang University
  • Beijing Advanced Innovation Center for Big Data and Brain Computing
  • University of Rochester

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Scene graph generation refers to the task of automatically mapping an image into a semantic structural graph, which requires correctly labeling each extracted object and their interaction relationships. Despite the recent success in object detection using deep learning techniques, inferring complex contextual relationships and structured graph representations from visual data remains a challenging topic. In this study, we propose a novel Attentive Relational Network that consists of two key modules with an object detection backbone to approach this problem. The first module is a semantic transformation module utilized to capture semantic embedded relation features, by translating visual features and linguistic features into a common semantic space. The other module is a graph self-attention module introduced to embed a joint graph representation through assigning various importance weights to neighboring nodes. Finally, accurate scene graphs are produced by the relation inference module to recognize all entities and corresponding relations. We evaluate our proposed method on the widely-adopted Visual Genome Dataset, and the results demonstrate the effectiveness and superiority of our model.

源语言英语
主期刊名Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
出版商IEEE Computer Society
3952-3961
页数10
ISBN(电子版)9781728132938
DOI
出版状态已出版 - 6月 2019
活动32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 - Long Beach, 美国
期限: 16 6月 201920 6月 2019

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
2019-June
ISSN(印刷版)1063-6919

会议

会议32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
国家/地区美国
Long Beach
时期16/06/1920/06/19

指纹

探究 'Attentive relational networks for mapping images to scene graphs' 的科研主题。它们共同构成独一无二的指纹。

引用此