Learning Dynamic GMM for Attention Distribution on Single-Face Videos

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The past decade has witnessed the popularity of video conferencing, such as FaceTime and Skype. In video conferencing, almost every frame has a human face. Hence, it is necessary to predict attention on face videos by saliency detection, as saliency can be used as a guidance of regionof- interest (ROI) for the content-based applications. To this end, this paper proposes a novel approach for saliency detection in single-face videos. From the data-driven perspective, we first establish an eye tracking database which contains fixations of 70 single-face videos viewed by 40 subjects. Through analysis on our database, we investigate that most attention is attracted by face in videos, and that attention distribution within a face varies with regard to face size and mouth movement. Inspired by the previous work which applies Gaussian mixture model (GMM) for face saliency detection in still images, we propose to model visual attention on face region for videos by dynamic GMM (DGMM), the variation of which relies on face size, mouth movement and facial landmarks. Then, we develop a long shortterm memory (LSTM) neural network in estimating DGMM for saliency detection of single-face videos, so called LSTM-DGMM. Finally, the experimental results show that our approach outperforms other state-of-the-art approaches in saliency detection of single-face videos.

Original languageEnglish
Title of host publicationProceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017
PublisherIEEE Computer Society
Pages1632-1641
Number of pages10
ISBN (Electronic)9781538607336
DOIs
StatePublished - 22 Aug 2017
Event30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017 - Honolulu, United States
Duration: 21 Jul 201726 Jul 2017

Publication series

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
Volume2017-July
ISSN (Print)2160-7508
ISSN (Electronic)2160-7516

Conference

Conference30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017
Country/TerritoryUnited States
CityHonolulu
Period21/07/1726/07/17

Fingerprint

Dive into the research topics of 'Learning Dynamic GMM for Attention Distribution on Single-Face Videos'. Together they form a unique fingerprint.

Cite this