跳到主要导航 跳到搜索 跳到主要内容

Relational attention network for crowd counting

  • Anran Zhang
  • , Jiayi Shen
  • , Zehao Xiao
  • , Fan Zhu
  • , Xiantong Zhen
  • , Xianbin Cao*
  • , Ling Shao
  • *此作品的通讯作者
  • Beihang University
  • Inception Institute of Artificial Intelligence
  • BUAA-CCMU Advanced Innovation Center for Big Data-based Precision Medicine

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Crowd counting is receiving rapidly growing research interests due to its potential application value in numerous real-world scenarios. However, due to various challenges such as occlusion, insufficient resolution and dynamic backgrounds, crowd counting remains an unsolved problem in computer vision. Density estimation is a popular strategy for crowd counting, where conventional density estimation methods perform pixel-wise regression without explicitly accounting the interdependence of pixels. As a result, independent pixel-wise predictions can be noisy and inconsistent. In order to address such an issue, we propose a Relational Attention Network (RANet) with a self-attention mechanism for capturing interdependence of pixels. The RANet enhances the self-attention mechanism by accounting both short-range and long-range interdependence of pixels, where we respectively denote these implementations as local self-attention (LSA) and global self-attention (GSA). We further introduce a relation module to fuse LSA and GSA to achieve more informative aggregated feature representations. We conduct extensive experiments on four public datasets, including ShanghaiTech A, ShanghaiTech B, UCF-CC-50 and UCF-QNRF. Experimental results on all datasets suggest RANet consistently reduces estimation errors and surpasses the state-of-the-art approaches by large margins.

源语言英语
主期刊名Proceedings - 2019 International Conference on Computer Vision, ICCV 2019
出版商Institute of Electrical and Electronics Engineers Inc.
6787-6796
页数10
ISBN(电子版)9781728148038
DOI
出版状态已出版 - 10月 2019
活动17th IEEE/CVF International Conference on Computer Vision, ICCV 2019 - Seoul, 韩国
期限: 27 10月 20192 11月 2019

出版系列

姓名Proceedings of the IEEE International Conference on Computer Vision
ISSN(印刷版)1550-5499

会议

会议17th IEEE/CVF International Conference on Computer Vision, ICCV 2019
国家/地区韩国
Seoul
时期27/10/192/11/19

指纹

探究 'Relational attention network for crowd counting' 的科研主题。它们共同构成独一无二的指纹。

引用此