Skip to main navigation Skip to search Skip to main content

An unsupervised real-time tracking and recognition framework in videos

  • Huafeng Wang*
  • , Yunhong Wang
  • , Jin Huang
  • , Fan Wang
  • , Zhaoxiang Zhang
  • *Corresponding author for this work
  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

A novel framework for unsupervised face tracking and recognition is built on Detection-Tracking-Refinement-Recognition (DTRR) approach. This framework proposed a hybrid face detector for real-time face tracking which is robust to occlusions, facial expression and posture changes. After a posture correction and face alignment, the tracked face is featured by the Local Ternary Pattern (LTP) operator. Then these faces are clustered into several groups according to the distance between feature vectors. During the next step, those groups which each contains a series of faces can be further merged by the Scale-invariant feature transform (SIFT) operator. Due to extreme computing time consumption by SIFT, a multithreaded refinement process was given. After the refinement process, the relevant faces are put together which is of much importance for face recognition in videos. The framework is validated both on several videos collected in unconstrained condition (8 min each.) and on Honda/UCSD database. These experiments demonstrated that the framework is capable of tracking the face and automatically grouping a serial faces for a single human-being object in an unlabeled video robustly.

Original languageEnglish
Title of host publicationThe Era of Interactive Media
PublisherSpringer New York
Pages447-457
Number of pages11
Volume9781461435013
ISBN (Electronic)9781461435013
ISBN (Print)1461435005, 9781461435006
DOIs
StatePublished - 1 Oct 2013

Keywords

  • Face recognition
  • Real-time
  • Refinement
  • Tracking
  • Unsupervised
  • Video

Fingerprint

Dive into the research topics of 'An unsupervised real-time tracking and recognition framework in videos'. Together they form a unique fingerprint.

Cite this