TY - GEN
T1 - Multi-Scale Feature Fusion Network for VideoBased Person Re-Identification
AU - Liu, Penggao
AU - Ai, Mingjing
AU - Shan, Guozhi
N1 - Publisher Copyright:
©2021 IEEE.
PY - 2021
Y1 - 2021
N2 - In recent years, person re-identification technology has been greatly developed. Image-based person reidentification algorithms have achieved excellent performance on open source datasets. In contrast, the development of videobased person re-identification technology is relatively backward. At present, the main research work of video-based person re-identification algorithms is focused on the processing of temporal information in the picture sequence. Complex appearance features are not effective when performing temporal fusion, so the frame-level features used are almost based on global features. This paper proposes a video person re-identification model based on multi-scale feature fusion. The multi-scale feature fusion of the model is embodied in the design of the frame-level feature extraction module. This module extracts the frame-level features of different scales, and then catenates them together into vectors, which not only improves the feature discrimination degree, but also makes the catenated frame-level features carry out effective temporal fusion, and the test results on the Mars dataset have reached a competitive level. At the same time, a series of comparative experiments were carried out on the model parameters to achieve further optimization of performance.
AB - In recent years, person re-identification technology has been greatly developed. Image-based person reidentification algorithms have achieved excellent performance on open source datasets. In contrast, the development of videobased person re-identification technology is relatively backward. At present, the main research work of video-based person re-identification algorithms is focused on the processing of temporal information in the picture sequence. Complex appearance features are not effective when performing temporal fusion, so the frame-level features used are almost based on global features. This paper proposes a video person re-identification model based on multi-scale feature fusion. The multi-scale feature fusion of the model is embodied in the design of the frame-level feature extraction module. This module extracts the frame-level features of different scales, and then catenates them together into vectors, which not only improves the feature discrimination degree, but also makes the catenated frame-level features carry out effective temporal fusion, and the test results on the Mars dataset have reached a competitive level. At the same time, a series of comparative experiments were carried out on the model parameters to achieve further optimization of performance.
KW - Multi-scale features
KW - Person re-identification
KW - Temporal fusion
UR - https://www.scopus.com/pages/publications/85125738888
U2 - 10.1109/ICETCI53161.2021.9563438
DO - 10.1109/ICETCI53161.2021.9563438
M3 - 会议稿件
AN - SCOPUS:85125738888
T3 - 2021 IEEE International Conference on Electronic Technology, Communication and Information, ICETCI 2021
SP - 228
EP - 232
BT - 2021 IEEE International Conference on Electronic Technology, Communication and Information, ICETCI 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 IEEE International Conference on Electronic Technology, Communication and Information, ICETCI 2021
Y2 - 27 August 2021 through 29 August 2021
ER -