Visual saliency based aerial video summarization by online scene classification

  • Jiewei Wang*
  • , Yunhong Wang
  • , Zhaoxiang Zhang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Compared with traditional video summarization approaches, aerial video summarization is a new and challenging issue for its particular characteristics. Aerial video data is a massive data stream, without pre-edit structures such as sports or news video data, lack of camera motion such as zoom and pan. On account of these characteristics, we proposed a novel approach for summarization. First, we extract GIST features for each frame as the holistic scene representation. Then, we divide aerial video into temporal segments representing a visual scene using on-line clustering method by examine GIST features of each frame only once. Finally, we select several key frames from each scene for summarization according to visual saliency index (VSI) of each frame computed from their visual saliency map. In the paper, we proposed new criterion for estimation of temporal segmentation of streaming video. Experimental observations show the success of our approach on aerial video summarization.

Original languageEnglish
Title of host publicationProceedings - 6th International Conference on Image and Graphics, ICIG 2011
Pages777-782
Number of pages6
DOIs
StatePublished - 2011
Event6th International Conference on Image and Graphics, ICIG 2011 - Hefei, Anhui, China
Duration: 12 Aug 201115 Aug 2011

Publication series

NameProceedings - 6th International Conference on Image and Graphics, ICIG 2011

Conference

Conference6th International Conference on Image and Graphics, ICIG 2011
Country/TerritoryChina
CityHefei, Anhui
Period12/08/1115/08/11

Keywords

  • Aerial video summarization
  • Online clustering
  • Saliency
  • Scene classification
  • Visual attention

Fingerprint

Dive into the research topics of 'Visual saliency based aerial video summarization by online scene classification'. Together they form a unique fingerprint.

Cite this