TY - GEN
T1 - Learning deep appearance feature for multi-Target tracking
AU - Li, Hexi
AU - Jiang, Na
AU - Sun, Chenxin
AU - Zhou, Zhong
AU - Wu, Wei
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - Multi-Target tracking is a worthy studying issue in computer vision. For surveillance video, frequent occlusion and dense crowds complicate the issue. To resolve these difficulties, this paper proposes an effective algorithm of multi-Target tracking in videos. Firstly, the faster Rcnn is proposed with the residual network to extract the objects of pedestrians in surveillance videos. The proposedment can effectively eliminate invalid target detection frames, separate peer targets and resist partial occlusions. Then, this paper put forward an accurate and efficient appearance-feature matching network model that is inspired by pedestrian re-identification theory. The deep learning feature-extraction module is composed of the stem Cnn and the Resnet blocks, therefore it can load res-50 caffemodel as pretraining model to increase the accuracy of the featureextraction. Meanwhile, the proposed network can decrease the time of train and test comparing with Resnet. Finally, the obtained multiple target tracking trajectories are further optimized by the strategy of occlusion distinction, deduplication and merging. The experiment results of the 2D MOT 2015 benchmark, KITTI dataset indicate that this proposed algorithm outperforms alternative multiple objects trackers in terms of multiple indicators.
AB - Multi-Target tracking is a worthy studying issue in computer vision. For surveillance video, frequent occlusion and dense crowds complicate the issue. To resolve these difficulties, this paper proposes an effective algorithm of multi-Target tracking in videos. Firstly, the faster Rcnn is proposed with the residual network to extract the objects of pedestrians in surveillance videos. The proposedment can effectively eliminate invalid target detection frames, separate peer targets and resist partial occlusions. Then, this paper put forward an accurate and efficient appearance-feature matching network model that is inspired by pedestrian re-identification theory. The deep learning feature-extraction module is composed of the stem Cnn and the Resnet blocks, therefore it can load res-50 caffemodel as pretraining model to increase the accuracy of the featureextraction. Meanwhile, the proposed network can decrease the time of train and test comparing with Resnet. Finally, the obtained multiple target tracking trajectories are further optimized by the strategy of occlusion distinction, deduplication and merging. The experiment results of the 2D MOT 2015 benchmark, KITTI dataset indicate that this proposed algorithm outperforms alternative multiple objects trackers in terms of multiple indicators.
KW - Appearance match
KW - Multi-Target tracking
KW - Target detection
KW - Trajectory optimization
UR - https://www.scopus.com/pages/publications/85067046762
U2 - 10.1109/ICVRV.2017.00011
DO - 10.1109/ICVRV.2017.00011
M3 - 会议稿件
AN - SCOPUS:85067046762
T3 - Proceedings - 2017 International Conference on Virtual Reality and Visualization, ICVRV 2017
SP - 7
EP - 12
BT - Proceedings - 2017 International Conference on Virtual Reality and Visualization, ICVRV 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Virtual Reality and Visualization, ICVRV 2017
Y2 - 21 October 2017 through 22 October 2017
ER -