跳到主要导航 跳到搜索 跳到主要内容

R-SSD: refined single shot multibox detector for pedestrian detection

  • Chaoqi Yan
  • , Hong Zhang
  • , Xuliang Li
  • , Ding Yuan*
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Pedestrian detection is a critical task in the field of computer vision, and it has made considerable progress with the help of Convnets. However, a persistent crucial problem is that small-scale pedestrians are notoriously difficult to detect because of the introduction of weak contrast and blurred boundaries in real-world scenarios. In this paper, we present a simple and compact detection method for detecting multi-scale pedestrians, which is especially suitable for detecting small-scale pedestrians that are not easily recognized in images or videos. We first interpret convolutional neural network (CNN) channel features, explore the detection performance of different feature fusion methods, and propose a novel two-level feature fusion strategy specially designed for small-scale pedestrians. Moreover, a sub-network named “prediction module” is injected into the framework to improve the general performance without any bells and whistles. In addition, we propose an adaptive loss that adds an adaptive adjustment coefficient to the Smooth L1 loss function to enhance its robustness to pedestrian detection tasks. Using these methods synthetically, we achieve state-of-the-art detection performance on the Caltech pedestrian dataset under three evaluation protocols; particularly, the performance of small-scale pedestrians under “Far” evaluation setting is improved (miss rate decreases from 70.97% to 60.09%). Further, the proposed method achieves a competitive speed-accuracy trade-off with 0.31 second per image of 1024×2048 pixels on the CityPersons dataset.

源语言英语
页(从-至)10430-10447
页数18
期刊Applied Intelligence
52
9
DOI
出版状态已出版 - 7月 2022

指纹

探究 'R-SSD: refined single shot multibox detector for pedestrian detection' 的科研主题。它们共同构成独一无二的指纹。

引用此