TY - JOUR
T1 - MSAGNet
T2 - Multi-Stream Attribute-Guided Network for Occluded Pedestrian Detection
AU - Zhang, Hong
AU - Yan, Chaoqi
AU - Li, Xuliang
AU - Yang, Yifan
AU - Yuan, Ding
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2022
Y1 - 2022
N2 - Pedestrian detection plays an indispensable role in human-centric applications. Although having enjoyed the merits of generic object detectors based on deep learning frameworks, pedestrian detection is still a persistent crucial task since the pedestrians often gather together and occlude each other. In this study, we propose a simple yet effective Multi-Stream Attribute-Guided Network (MSAGNet) to regard occluded pedestrian detection as a standard central point and height estimation problem. Specifically, we focus on searching for the central points of the pedestrians and predicting the scales and offsets of the corresponding pedestrians. Meanwhile, an adaptive weighting parameter, i.e., Intersection over the Visible part region of ground truth (IoV), is utilized to conduct accurate bounding box regression. Furthermore, a novel nonlinear Non-Maximum Suppression (NMS) is proposed to flexibly prune false positives and decrease the miss rate of adjacent overlapping pedestrians. Experimental results on Caltech-USA, CityPersons, CrowdHuman and WiderPerson pedestrian datasets show that the proposed MSAGNet can obtain significant performance boosts, while maintaining a reasonable run-time speed.
AB - Pedestrian detection plays an indispensable role in human-centric applications. Although having enjoyed the merits of generic object detectors based on deep learning frameworks, pedestrian detection is still a persistent crucial task since the pedestrians often gather together and occlude each other. In this study, we propose a simple yet effective Multi-Stream Attribute-Guided Network (MSAGNet) to regard occluded pedestrian detection as a standard central point and height estimation problem. Specifically, we focus on searching for the central points of the pedestrians and predicting the scales and offsets of the corresponding pedestrians. Meanwhile, an adaptive weighting parameter, i.e., Intersection over the Visible part region of ground truth (IoV), is utilized to conduct accurate bounding box regression. Furthermore, a novel nonlinear Non-Maximum Suppression (NMS) is proposed to flexibly prune false positives and decrease the miss rate of adjacent overlapping pedestrians. Experimental results on Caltech-USA, CityPersons, CrowdHuman and WiderPerson pedestrian datasets show that the proposed MSAGNet can obtain significant performance boosts, while maintaining a reasonable run-time speed.
KW - Pedestrian detection
KW - bounding box regression
KW - deep learning
KW - false positives
KW - non-maximum suppression
UR - https://www.scopus.com/pages/publications/85140749751
U2 - 10.1109/LSP.2022.3215920
DO - 10.1109/LSP.2022.3215920
M3 - 文章
AN - SCOPUS:85140749751
SN - 1070-9908
VL - 29
SP - 2163
EP - 2167
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
ER -