跳到主要导航 跳到搜索 跳到主要内容

Semi-Supervised Cross-View Projection-Based Dictionary Learning for Video-Based Person Re-Identification

  • Xiaoke Zhu
  • , Xiao Yuan Jing*
  • , Liang Yang
  • , Xinge You
  • , Dan Chen
  • , Guangwei Gao
  • , Yunhong Wang
  • *此作品的通讯作者
  • Wuhan University
  • Henan University
  • Nanjing University of Posts and Telecommunications
  • Huazhong University of Science and Technology

科研成果: 期刊稿件文章同行评审

摘要

Video-based person re-identification (re-id) has attracted a lot of research interest. When facing dramatic growth in new pedestrian videos, existing video-based person re-id methods usually need large quantities of labeled pedestrian videos to train a discriminative model. In practice, labeling large quantities of pedestrian videos is a costly and time-consuming task, which will limit the application of these methods in the real environment. Therefore, it is valuable and necessary to investigate how to learn a discriminative re-id model by using limited labeled training pedestrian videos. In this paper, we propose a semi-supervised cross-view projection-based dictionary learning (SCPDL) approach for video-based person re-id. Specifically, SCPDL jointly learns a pair of feature projection matrices and a pair of dictionaries by integrating the information contained in labeled and unlabeled pedestrian videos. With the learned feature projection matrices, the influence of variations within each video to the re-id can be reduced. With the learned dictionary pair, pedestrian videos from two different cameras can be converted into coding coefficients in a common representation space, such that the differences between different cameras can be bridged. In the learning process, the labeled pedestrian videos are used to ensure that the learned dictionaries have favorable discriminability; the large quantities of unlabeled pedestrian videos are used to ensure that SCPDL can better capture the variations between pedestrian videos, such that the learned dictionaries can own stronger representative capability. Experiments on two public pedestrian sequence data sets (iLIDS-VID and PRID 2011) demonstrate the effectiveness of the proposed approach.

源语言英语
文章编号7954687
页(从-至)2599-2611
页数13
期刊IEEE Transactions on Circuits and Systems for Video Technology
28
10
DOI
出版状态已出版 - 10月 2018

指纹

探究 'Semi-Supervised Cross-View Projection-Based Dictionary Learning for Video-Based Person Re-Identification' 的科研主题。它们共同构成独一无二的指纹。

引用此